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Intertwined Cultural and Relational 
Environments of Organizations" 


FRANCISCO J. GRANADOS, Southern Methodist University 





Abstract 


I examine fundamental elements of the cultural and relational environments of 
the organizations that produce electoral forecasts in the Spanish parliamentary 
elections. The analysis reveals how these elements shape the interests and decisions 
in a collectivity of organizations that share a common technology. I investigate 
the reasons for the coincidence of incorrect forecasts across the polling firms by 
considering neoinstitutional propositions about the social construction of institutions 
and isomorphism by mimesis. The coincidence of forecasts contributes to preserving 
the firms’ individual and collective social legitimacy despite repeated forecasting 
failures. To reach coincidence, the firms’ decisions about what forecast to present is 
! decoupled from the ineffective technological process that legitimates the forecasts. 
| Cultural conceptions about the production and technology of the forecasting activity 
generate interdependence among the polling firms. This relational aspect compels 
firms to conform their forecasts to those of other firms. 


| Production by organizations takes place within cultural and relational 
environments (DiMaggio 1994; Zelizer 2002; Granovetter 1985; White 2002). 
| Producers and other actors involved in the same kind of production activity share 
-understandings about what they produce. This common cultural environment 
may be relevant to how the relationships among those actors are structured. 
Organizations producing any kind of forecasts presented to the public are 
{ paradigmatic with regard to the relevance of their cultural environment to their 


*I thank Jesús M. de Miguel, Juan Diez-Medrano, Joseph Galaskiewicz, David Knoke, John W. 
! Meyer, Charles Perrow, and Evan Schofer for comments and encouraging reactions. An earlier version 
| of this article received the 2003 Ronald Burt Student Paper Award from the Economic Sociology 
| Section of the American Sociological Association. Direct correspondence to Francisco J. Granados, 
| Department of Sociology, Southern Methodist University, 3225 Daniel Avenue, Dallas, TX 75275. 
| E-mail: Granados@smu.edu 


| © The University of North Carolina Press Social Forces, March 2005, 83(3):883—918 
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production. As I explain below, the production of forecasts always implies, besides 
the public expectations about their efficacy, a series of shared understandings 
that generate interdependences among the organizations forecasting a specific 
event. The manifestation of such interdependence becomes especially relevant 
for these organizations if their production process does not by itself fulfill public 
expectations about an effective and coherent performance, thus requiring that 
the set of organizations react to overcome public concerns about their activity. I 
analyze the activity of the polling firms that have incorrectly forecasted the results 
of the parliamentary elections held in Spain since 1977, after the restoration of 
democracy. The production of these publicly notorious and highly demanded 
forecasts presents much uncertainty to the firms because the Spanish electoral 
system makes ineffective the use of survey research techniques for the prediction 
of the distribution of parliamentary seats among the contending political 
parties. 

The result of such uncertainty is the polling firms' conformity to an institution 
that they socially construct. In the more recent Spanish parliamentary elections, 
the polling firms collectively constructed a generic forecast (1.e., a set of convergent 
individual forecasts), which generated order across the firms productions and thus 
allowed them to meet some key cultural expectations about forecasting. Such social 
construction acts as an institutionalized convention to which all firms conform 
their subsequent forecasts, resulting in the coincidence of these forecasts over 
time, paradoxically, despite the firms’ incorrect prediction of the electoral result. I 
analyze key elements in the firms’ common cultural and relational environments 
affecting the forecast production and resulting in that institutionalized generic 
forecast. Public cultural conceptions about the polling firms’ common source of 
social legitimacy for producing forecasts (i.e., the scientific forecasting technology) 
generate interdependence among this collectivity, affect the individual behavior 
of the firms, and make the coincidence of their forecasts a necessary condition 
to preserve that legitimacy. 

In the article, I first review some aspects of the neoinstitutional literature 
about isomorphism by mimesis, pointing to the importance of emphasizing 
the collective aspects of this phenomenon and to the interaction of cultural 
and relational elements. Second, I explain the technical limitations that prevent 
the polling firms from correctly forecasting the distribution of parliamentary 
seats among political parties in the Spanish parliamentary elections. Third, I 
examine the extent of the failures and coincidences of the electoral forecasts in 
eight parliamentary elections held in Spain since the restoration of democracy. 
Fourth, I present and refute the polling firms’ own explanation of their failures. 
Fifth, I explain the social construction and institutionalization of a generic 
forecast in the more recent elections since 1989 and the institutionalization of the 
population of polling firms. This last process is related to the historical change 
from divergent forecasts in earlier elections to convergent forecasts in the more 
recent elections. 
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Cultural and Relational Aspects 


COLLECTIVE ASPECTS OF INSTITUTIONAL ISOMORPHISM 
BY MIMESIS š 


The neoinstitutional theory of organizations indicates that belief systems shared 
by organizations significant to each other create imperatives to conform to the 
prevalent conventions in the cultural environment, thus accounting for the 
similarity of organizational structures and actions (Meyer and Rowan 1977; Meyer 
and Scott 1983; DiMaggio and Powell 1991; see Zucker 1987 and Scott 2001 for 
reviews). Of special relevance to this article are neoinstitutional propositions about 
isomorphism by mimesis (Meyer and Rowan 1977; DiMaggio and Powell 1983) 
and the related processes of emergence and maintenance of institutions (Zucker 
1977; DiMaggio 1988; Weick 1995; Tolbert and Zucker 1996). The theory indicates 
that organizations facing high uncertainty in technical aspects of their production, 
having ambiguous and questioned goals, and subject to public evaluation are 
highly dependent on appearing legitimate. Hence, they are especially prone to seek 
conformity to the specific social and cultural conventions in their environment.! 
In an effort to fulfill public expectations regarding their structural characteristics 
and actions, organizations avoid standing out and being noticed as different from 
public expectations. Importantly, adopting such conventions does not necessarily 
produce an increase of organizational efficacy or efficiency, but it may result in 
increased individual legitimacy, which betters chances of individual survival and 
success. One consequence of the generalized adoption of conventions is greater 
homogenization—that is, isomorphism—of organizational structures, productive 
processes, and behavior among organizations. 

One mechanism leading to institutional isomorphism is mimesis by 
organizations that purposively model themselves on other similar organizations 
(especially those regarded as superior or more successful) by adopting similar 
or identical decisions and structures. Conversely, decisions and structures 
can be imposed, induced, or authorized in a set of organizations by powerful 
agents of an organizational field (typically the state and important suppliers 
or customers). Finally, professional knowledge and socialization aspects that 
members of a profession acquire during their standard education may result 
in common normative elements and common organizational structures among 
organizations that incorporate the same kind of professionals (DiMaggio and 
Powell 1983; Scott 1987). 

As I show below, the convergence of failed forecasts across polling firms is an 
instance of institutional isomorphism by mimesis, although a series of subsidiary 
normative and coercive aspects are also present. Thus, as professionals, the 
pollsters share and apply the same formal scientific knowledge, which conveys 
legitimacy to the polling activity? However, as explained in the next section, 
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this knowledge is invalid in forecasting the distribution of seats in the Spanish 
Parliament among the contending parties. Therefore, the pollsters cannot base 
their forecasts on that knowledge, and consequently this knowledge is not the 
cause of the observed convergence of electoral forecasts among polling firms. 
The existing pollsters’ professional associations and networks, as well as firms’ 
occasional collaborations in making polls, may facilitate communication and 
mimesis among polling firms. Also, legal dispositions concerning electoral polls 
pose some restrictions to their publication. Finally, the media that publish the 
polls they order from the polling firms demand that these firms provide the kind 
of forecasts that are more eloquent for the media audiences. These forecasts refer 
to the distribution of parliamentary seats among parties. 

A few empirical studies have shown cases of isomorphism by mimesis in 
the context of a broader research objective (e.g., Fligstein 1985; Levitt and Nass 
1989; Galaskiewicz and Wasserman 1989; Haveman 1993; Haunschild and Miner 
1997), although some of the studies analyze this phenomenon rather marginally. 
My investigation contributes to neoinstitutional theory of organizations by 
comprehensively analyzing an instance of mimetic isomorphism. I consider 
the social construction of the institution that becomes a convention adopted 
by the polling firms. I also expand the usual theoretic scope of the origin and 
consequences of institutional isomorphism. In this regard, while previous 
literature has considered the survival and success consequences for individual 
organizations adopting organizational structures or decisions prevalent in their 
environment, my research also considers the analysis of collective aspects of 
institutional isomorphism. 

I explain how, in the more recent elections, the polling firms have collectively 
constructed generic forecasts in order to generate consistency across their 
production and, in this way, meet cultural expectations about that production 
(Berger and Luckman 1967; Weick 1979, 1995). Another collective aspect is the 
interdependence among polling firms resulting from sharing the same source 
of legitimacy to forecast the electoral results, namely, the scientific forecasting 
technology. This technology consists of survey research methods to obtain data 
on voting preferences and statistical techniques to analyze them. I argue that, in 
the context of repeated incorrect forecasts in consecutive elections, preserving the 
legitimacy of that technology requires that the polling firms present convergent 
forecasts. This conformity of individual firms’ predictions to a generic forecast 
so constructed allows the firms to articulate an explanation of their failure that 
is consistent with cultural expectations about their interdependent production. 
Importantly, that explanation exculpates the ineffective forecasting technology 
from failure, thereby preventing the delegitimation of the firms’ production 
activity and, for that matter, of the polling firms. Thus, a third collective aspect 
of isomorphism that I investigate is the beneficial effect of convergent forecasts 
to preserve the social legitimacy of the collectivity of polling firms. Such effect 
is distinct from, but compatible with, the positive consequences that being 
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isomorphic may have on the survival and success of the individual polling firms 
(Meyer and Rowan 1977; DiMaggio and Powell 1983).? 


INTERDEPENDENCE BY CULTURE AND INTERDEPENDENT CULTURE 


Emphasis on collective aspects of institutional isomorphism makes evident 
the relevance of studying intertwined cultural and relational environments of 
organizations.* The key aspect of the polling firms’ relational environment is their 
mutual interdependence resulting from cultural beliefs about the technology they 
share.’ Organizations with a common technology make the same kind of products 
(in this case, the electoral forecasts) and become interdependent because of 
cultural expectations about the characteristics of products and their production. 
Moreover, these cultural elements shape organizational interests in specific ways, 
affecting how organizations relate to each other as well as the possible courses 
of individual and collective action. 

In the case of the Spanish polling firms, what are those cultural expectations, 
and how do they make these organizations interdependent? All polling firms 
are culturally identified by the public as part of the same scientific knowledge 
phenomenon, in the sense that the statistical technological process they use should 
yield coincident electoral forecasts across firms. Since divergent forecasts would 
make that technology lose its validity as the source of social legitimacy for polling, 
that cultural expectation compels the firms to act in a way that allows them to 
display publicly their expected interdependence. Moreover, three other features 
related to that shared technology contribute to the firms' interdependence. First, 
the public (as well as the polling firms themselves) culturally recognizes these 
organizations as constituting a single production phenomenon. Second, the 
public can assess the consistency of forecasts across firms by comparing them 
and observing whether they are coincident. Third, the firms’ culturally defined 
social legitimacy to make forecasts is their most valued asset. This collective 
asset of the firms results from the scientific character of their shared forecasting 
technology. 

Importantly, in the case of the Spanish polling firms, their crucial 
interdependence arises from the cultural belief that forecasting is a standard 
process derived from a rigorous scientific technology, rather than from the firms’ 
actual application of this technology. Although the specific features of the technical 
process and the quality of the polling firms’ survey research are relevant elements, 
my analyses indicate that the cultural meaning of that technology is what makes 
it key to defining how those organizations relate to one another. In such cultural 
and relational context, the convergence of forecasts across polling firms results 
from the fact that the pollsters’ decisions about what forecast to present are 
decoupled from the ineffective technological process for forecasting (Meyer and 
Rowan 1977). Instead, the pollsters base those decisions on previously published 
forecasts. Ironically, the firms’ common technology, the cultural meanings of 


888 / Social Forces 83:3, March 2005 


which cause the firms' interdependence, must be disregarded in deciding the 
forecasts so that the forecasts converge and conform to the cultural expectation 
of the firms' interdependence that results from sharing that technology. 

Complete understanding of the social organization phenomenon investigated 
and of the interplay between its cultural and relational aspects requires considering 
the interests of the polling firms (Scott 1987, DiMaggio 1988). Interests are socially 
constructed and embedded in cultural institutions and in structures of social 
relationships; these institutions and structures constitute the individual actors 
(Meyer and Jepperson 2000; White 1992; Weber [1922] 1947).° My investigation 
indicates that the interests (and subsequent courses of action) of the polling firms 
are greatly shaped by two cultural aspects—the institutionalized generic forecast 
that the firms socially construct and the cultural meanings about the forecasting 
technology—as well as by the relational interdependence that results from sharing 
this technology. Moreover, I suggest that the intertwining of these elements 
mutually affects the specific relevance of each element in the social construction of 
the firms’ interests. Thus, the cultural meaning of the firms’ technology generates 
the firms’ interdependence, and this relational aspect constricts the firms’ decisions 
so that their subsequent forecasts conform to the generic forecast. This conformity 
prevents firms’ individual attempts at quality differentiation of their product from 
the rest of firms, thus lowering competition among them. Most important, the 
firms’ interdependence requires convergent forecasts so that they can articulate 
an explanation of their failure that buffers the ineffective forecasting technology 
from any responsibility for the failed forecasts—thus preserving that technology 
as a valid source of social legitimacy. 


Technical Deficiencies of the Forecasts 


With the restoration of democracy in Spain, after a four-decade dictatorial regime 
starting in 1939, eight democratic elections were held from 1977 to 2000. In 
parliamentary elections, 350 deputies are elected to serve four-year terms in the 
Spanish Parliament. After 15 days of official electoral campaigning, the voters 
vote for one of the blocked lists that the political parties propose in each of the 
52 electoral districts. These districts correspond to administrative divisions within 
each of the 17 Autonomous Communities (i.e., the political and administrative 
sections of the country) and two autonomous cities. Each district is assigned a 
number of parliamentary seats that varies according (although nonproportionally) 
to its population. The total of seats a party obtains in the Parliament is the sum 
of the seats it wins in each electoral district. Although from 11 to 16 political 
parties have obtained parliamentary representation in the eight elections under 
discussion, the majority of seats have always gone to two main parties—one 
social-democratic and the other conservative. Historically, the winning party has 
obtained from 146 to 202 (41.7% to 57.7%) parliamentary seats, and the second 
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party from 101 to 141 (28.996 to 40.396). The next party in number of seats has 
obtained from 12 to 23 (3.4% to 6.8%) seats. 

Most relevant for my research are some technical aspects about forecasting and 
the Spanish electoral system. The polling firms’ surveys have high levels of both 
nonresponse and respondents who report not making their voting decisions at the 
time of the interview.’ Moreover, although the only data of statistical reliability 
in an electoral poll are the declared vote intention and the number of undecided 
interviewees, the media ordering the polls demand that the polling firms forecast 
the estimated vote intention for each political party (i.e., the sum of both the 
declared vote intentions and the pollsters' estimate of the electoral choices of the 
undecided respondents). This kind of forecast requires that the pollsters adjust 
the declared vote intention data by applying a projective formula to estimate the 
most probable vote intention of the undecided. This overt manipulation of the 
raw survey data implies that the pollsters insert into their forecasts speculations 
about the interviewees' future choices, thereby undermining the polls' statistical 
reliability? Given the high number of undecided respondents, these projections 
hold the key to the accuracy of the forecasted vote distribution among political 
parties. 

The media, especially newspapers, demand that the polling firms estimate 
vote intentions so that their audiences can more easily get an idea of the eventual 
electoral voting results. Nevertheless, the media's most valued forecast is the 
distribution of parliamentary seats among political parties. These forecasts are the 
most suggestive for the media audience and allow journalists to speculate about 
the alternative governments that might result from the election (e.g., majorities 
and governmental alliances among parties). The polling firms accept the media 
demand even knowing that they run a high risk of failure (Díez-Nicolás 1993; 
El País 1996)? The media's demand, election after election, for such forecasts 
that are doomed to fail is a main reason for the persistence of this failing polling 
organizational and professional activity (Meyer and Zucker 1989; Clarke 1999). 

I focus on forecasts of the distribution of parliamentary seats among parties. 
The technical basis of these forecasts is even more ineffective than that of the 
estimation of vote intention. The first difficulty lies in deriving the forecasts of 
parliamentary seats from those rough estimated vote intentions. Most importantly, 
independently of the quality of the survey data used to estimate vote intentions, 
distinctive characteristics of the Spanish electoral system render practically 
impossible any accurate forecasts of the distribution of parliamentary seats across 
the contending political parties (Díez-Nicolás 1996). 

Two characteristics of the Spanish electoral systems are relevant. 


(1) The number of parliamentary seats disputed in each electoral district is 
not proportional to the electoral population. Therefore, the polling firms 
must obtain a sample and statistical estimates of the vote distribution for 
each district. To be reliable, a polling firm should select 52 representative 
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samples, one for each electoral district, with a minimum of 1,000 cases 
(i.e., a national total of 52,000 interviews). The sampling error for a point 
estimate in each sampled electoral district would approximate + 2.5% for 
a 95% confidence level (Diez-Nicolds 1996).!° However, the sample sizes 
drawn for electoral polls range from 1,500 to 16,000 cases for the whole 
country and thereby greatly increase the sampling error." 


(2) The Spanish electoral system is proportional (it applies the d'Hont method 
to allocate the parliamentary seats of each electoral district to the party 
lists). In proportional electoral systems, the distribution of seats won in 
each electoral district by contending parties is proportional to the number 
of votes cast for each party list in the district. This proportionality of the 
Spanish electoral system implies that the last parliamentary seat in each 
electoral district is allocated to one of the contending parties having a slight 
vote advantage. 


Díez-Nicolás (1996) shows that for the 1996 election, in 33 of the 52 
electoral districts, the number of votes needed to change the assignation of the 
last parliamentary seat from one party to another was less than 2.596 of the 
district's electoral population, that is, a quantity smaller than the sampling error 
of a hypothetically very accurate survey. Importantly, this circumstance makes 
forecasting the distribution of parliamentary seats among parties extremely 
uncertain because a pollster has no statistical basis for deciding which party 
would be most likely to obtain the final seat to be allocated for a district. 
Specifically, a researcher performing a statistical significance test cannot reject 
the null hypothesis of no difference in the number of votes likely to be won 
by two parties competing for the last parliamentary seat to be allocated in an 
electoral district. Therefore, the nonstatistically grounded decisions that pollsters 
make about the allocation of the last seat of each electoral district determine 
the forecast of the total number of parliamentary seats that parties obtain in 
the Spanish Parliament. Each of the 52 electoral districts represents a possible 
forecasting failure in regards to last seat allocation. The overall national forecast 
of the distribution of parliamentary seats across parties shows the forecasting 
inaccuracy amplified because the national forecast accumulates all the failures 
of the district-level forecasts. 

In conclusion, the research method that the polling firms apply to forecast 
parliamentary seats is technically deficient. This deficiency is impossible to 
overcome and makes feasible a wide range of possible forecasts of the electoral 
result, depending on a series of polling firms' decisions unsubstantiated by 
statistical knowledge. This technical limitation prevents polling firms from 
learning by doing to improve their research process. It also precludes explaining 
the coincidence of forecasts across firms as arising from the firms' convergence 
with regard to any specific aspect of the firms' polling methods. Even if the polling 
firms used identical methods at each step, they would still face the uncertainty 
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of deciding to what party to allocate the last parliamentary seat in each electoral 
district. That technical limitation, which is common to all firms, also implies the 
impossibility of differentiating the firms' forecasts according to different quality 
levels. This circumstance prevents ranking of polling firms based on their success 
and differentiating between leader and follower firms. 


Coincidence across Failed Forecasts 


I analyze the failure of parliamentary seat forecasts and their coincidence 
patterns across different polling firms. Figures 1 through 8 present, for each 
parliamentary election since 1977, all published forecasts about the total number 
of parliamentary seats for the two main political parties at each election." The 
two straight horizontal lines in the graphs indicate the actual electoral results for 
both parties, which allow visual assessment of the difference between forecasts 
and results (note that these straight lines do not mean to imply that the actual 
vote intention remains constant over time). The poll estimates in each graph 
are arrayed from left to right by dates of publication, which are indicated on 
the horizontal axis for each individual forecast (or set of forecasts published 
by different firms on the same day; these sets are separated by dotted vertical 
lines in the graphs). The vertical axis indicates forecasted and actual number of 
parliamentary seats. The letters in the graphs label the polling firm making 
each pair of forecasts (i.e., one forecast for each of the two main political 
parties). Note that since the 1982 election most polling firms published a range 
indicating the minimum and maximum number of seats expected for each party. 
In these cases there are two symbols indicating the lower and upper limits of the 
ranges, respectively. For example, in the 1982 election two polls forecasted the 
parliamentary outcomes and were published on the same date, October 22. The 
polling firm F estimated that the two main political parties would get between 
193 and 217 and between 87 and 107 parliamentary seats, respectively. The other 
poll, made by collaboration between firms N and H, did not forecast ranges but a 
specific number of parliamentary seats for each party, 217 and 69, respectively. 

The graphs indicate that the polling firms failed to forecast correctly the 
number of parliamentary seats obtained by the two main political parties in 
seven of the eight elections. The extent of the errors varies by election and by 
party, and occasionally some forecasts come close to succeeding (especially if the 
forecasts consisted of large ranges). In the 1989 election, the forecasts were close 
to the electoral result.'^ The graphs also indicate that the forecasts of the 1977, 
1982, and 1986 elections did not coincide across different polls for one of the two 
main political parties. This circumstance changed in the 1989 election, when the 
forecasts began to coincide for both political parties. The coincidence of forecasts 
across polling firms continued in the elections of 1993, 1996, and 2000, when it 
reached its highest level. In these more recent elections, the set of forecasts jointly 
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Figure 1. Forecasts of Parliamentary Seats, Election of June 15, 1977 
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Figure 2. Forecasts of Parliamentary Seats, Election of March 1, 1979 
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Figure 3. Forecasts of Parliamentary Seats, Election of October 28, 1982 
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Figure 4. Forecasts of Parliamentary Seats, Election of June 22, 1986 
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Figure 5. Forecasts of Parliamentary Seats, Election of October 29, 1989 
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Figure 6. Forecasts of Parliamentary Seats, Election June 6, 1993 
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Figure 7. Forecasts of Parliamentary Seats, Election of March 3, 1996 
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Figure 8. Forecasts of Parliamentary Seats, Election of March 12, 2000 
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Table 1. Standard Deviations of Parliamentary Seat Forecasts, by Political 
Parties (1977 to 2000 Elections)* 





1977 1979 1982 1986 1989 1993" 1996 2000 





First party p.12 4.60 4.24 11.29 3.73 3.86 4.35 2.26 
(5.11) (5.42) 

Second party 10.61 2.83 9.90 4.42 4.11 3.73 4.33 2.65 
(4.72) (7.65) 

Third party* - 1.06 3.01 5e 3.20 3.09 2.42 1:52 
(2.88) 





a The standard deviation indicates the variation or dispersion in a distribution of scores around the mean. 
The formula used here for its calculation is 


X6.-5y 
$e M 


where y, refers to each score (forecast), y is the mean of the set of forecasts considered, and N is the 
total number of forecasts in one election. Forecasts that offered a range of values are considered in the 
computations using the midpoint of the range. Note that the denominator of the formula is N instead of 
N-1. This latter divisor is used to calculate a corrected estimator of population standard deviations. The 
analysis of Table 1 does not involve inference; therefore, the use of N-1 as a divisor is unnecessary. Moreover, 
a divisor with N-1 would yield inflated values of the standard deviations referring to the earlier elections, 
whose numbers of polls were much smaller than in the later elections. 


^ In the 1993 and 1996 elections, the figures in parentheses indicate standard deviation including the outlier 
polling firm, and the figures not in parentheses indicate standard deviation excluding the outlier. 
The forecast for the third party of the outlier firm in 1993 election did not differ from the forecasts 
of the other firms; therefore, that firm is not considered outlier with regard to that forecast. 


* The analysis for the third party in the 1977 election is unfeasible because only one of the two polls published 
in that election presented forecasts about the third party. 


either overestimated or underestimated the electoral results. The forecasts were not 
distributed around the electoral results (1.e., with some polls above and others below 
it). Note that the 1993 and 1996 elections each present an outlier polling firm: 
polling firm A in 1993 (it made one forecast), and O in 1996 (it made three 
forecasts). These two firms presented forecasts divergent from forecasts of the 
other firms. Note also that the 1979 election was an exception to the historical 
trend of change from noncoincidence to coincidence of forecasts—in this early 
election, the two polls presented similar forecasts for the two main parties.'® 

I measure the extent of coincidence across forecasts using the standard 
deviation of all the seat forecasts in each election.'5 In this analysis, I consider 
forecasts for the three first political parties in each election. Table 1 reports the 
results and details of this analysis. Standard deviations close to zero indicate low 
dispersion—that is, coincidence across forecasts in this case—while values far 
from zero indicate high dispersion. (For the 1993 and 1996 elections, I present 
analyses both including and excluding the outlier firms.) As one would expect 
from observing the graphs, the standard deviation of the forecasts for at least 
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Factor Vote: Factor Seats: 
Common Reason Common Reason 
In the Mistakes of In the Mistakes of 
Vote Forecasts Seats Forecasts 





*p«.05 **p<.01 ***p<.001 (two-tailed tests) 


Figure 9. Diagram of the Confirmatory Factor Analyses Coefficients for the 
2000 Election 


one party is considerably higher in the earlier elections through 1986 compared 
with the four elections since 1989 (with the exception of 1979 election I mention 
above). These results and the previous graphs indicate a history of failed forecasts 
in the Spanish parliamentary elections and, paradoxically, a striking coincidence 
across forecasts in the more recent elections when all forecasts jointly either 
overestimated or underestimated the electoral results. 

I further analyze the forecasts for the last five elections (1986 through 2000) 
by performing five confirmatory factor analyses (Bollen 1989; Bollen and Long 
1993) that consider the extent of coincidence across polling firms in regards to 
the deviation of their forecasts from the electoral result. The results appear in 
Table 2. Figure 9 shows the model tested in diagrammatic form (displaying the 
coefficients for the 2000 election). These analyses can include only polling firms 
that published specific forecasts for each of the 52 electoral districts of Spain 
regarding vote intention and/or distribution of parliamentary seats, typically in 
the final poll published about one week before the election. In the models, each 
observed variable (labeled with a polling firm’s identification letter) consists of an 
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indicator referring to a firm's final poll. The 163 cases in these variables consist of 
the difference between the firm's forecast and the electoral result for each electoral 
district and each political party considered." Therefore, the observed data in these 
analyses indicate the level of error by either overestimation or underestimation 
of the specific polls analyzed. 

Two latent constructs or factors are modeled, one for vote intention forecasts 
and the other for parliamentary seat forecasts (data limitations in the 1986 
election allow modeling only for the latter). These factors represent for vote and 
seats forecasts, respectively, a hypothetical common reason affecting all polling 
firms and accounting for the firms’ coincidence in terms of deviation between 
forecasts and the electoral result. Finally, each correlation between the error terms 
referring to indicators for vote and seats forecasts of each polling firm considers 
the existence of a “house effect” for the errors in the firms' forecasts; that is, the 
extent to which unique mistakes by a firm in forecasting vote intentions generates 
mistaken parliamentary seat forecasts. 

As expected from the previous analysis, the results for the four most recent 
elections, 1989 to 2000, are similar to each other, and they differ from the 1986 
election. The goodness-of-fit statistics indicate that the hypothesized model closely 
fits the observed data of the four more recent elections. The factor loadings of 
the latent construct referring to forecasts of vote intention indicate moderate 
coincidence (values from .53 to .80) across polls in 1989 and 1993, and high 
coincidence (values from .81 to .85) in 1996 and 2000. Similarly, the factor 
loadings of the latent construct referred to forecasts of parliamentary seats indicate 
moderate coincidence (values from .56 to .73) across polls in the elections of 1989, 
and higher coincidence (values from .70 to .94) in the elections of 1993, 1996, 
and 2000. (In 1996 the coincidence is only moderate [value .59] with regard to 
one firm [Z].) These results suggest the existence of a common reason affecting 
all polling firms and generating a high level of cross-firm coincidence of incorrect 
forecasts of both vote intention and parliamentary seats. The correlations between 
the two factors in the models of the 1989 to 2000 elections indicate a moderate 
overall coincidence (values from .53 to .65) between the erroneous forecasts of 
vote intention and parliamentary seats. Finally, the moderate-low—and in the 
expected direction—correlations between error terms modeling “house effects” 
indicate that these effects are not very substantial (especially in the 1996 and 2000 
elections), suggesting that a polling firm’s mistakes in forecasting parliamentary 
seats are largely unrelated to its mistakes in forecasting vote intention. 

The factor analysis for the 1986 election only has the latent construct about 
forecasts of parliamentary seats because of the lack of data about forecasts of 
vote intention by electoral districts in this election. The result is consistent with 
the analysis shown in Table 1 about this election, namely, that the polling firms 
did not coincide in their forecasts of the parliamentary seats won by one of the 
two main political parties. The factor loadings indicate a moderate (.51 to .78) 
coincidence across polls, and the hypothesized model poorly fits the observed 
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data. In contrast to the 1989 to 2000 elections, these results do not suggest that 
a common reason affected all the firms" 1986 electoral forecasts.'? 


Polling Firms' Explanation of Their Failures 


After an election, the discrepancy between forecasts and the actual electoral result 
becomes evident, and the polling firms feel compelled to explain their failures. 
The media audiences expect the forecasts to predict the correct rank order of 
the main political parties according to the electoral result and the approximated 
difference in number of parliamentary seats between positions in the ranking. 
Among the cultural understandings about polling electoral results is that forecasts 
may not be completely accurate but should be closer to the electoral result 
than has been the case of the polls about the Spanish parliamentary elections." 
Political reporters and experts publish newspaper articles and argue on radio and 
television broadcasts about the failure of the forecasts. They base their demands 
for accuracy on the polling experiences of other electoral systems that technically 
allow for more accurate forecasts, such as the majority system of the U.S. The 
pollsters themselves have publicly acknowledged the failure of their forecasts. For 
instance, the CEO and main researcher of one firm commented in a newspaper 
article about the 1996 and 2000 elections: 


Did the firms collude to avoid the risk of what happened in 1996? Sincerely, 
I think that the convergence in the estimations [of the 2000 election] is 
due to some extent to the punishing lesson of 1996, but basically it was 
a bet for precaution about the predictable mobilization of the left-wing 
electorate in the days before the end of the campaign . . . The fact that 
the asymmetry in the mobilization would remain until the day of the 
election was evidently a question of opinion. A riskier interpretation was 
perfectly possible (and it would had been, of course, more correct). But 
it is not possible to consider technically deficient or, even less, dishonest 
the option taken by the majority [of pollsters]. Again we [the pollsters] 
have been corporatively damaged by an absurd norm, imposed by the 
article 69 of the Electoral Law, referring to an unjustified limit on the 
information provided to the readers, in other words, to the citizens. (Wert 
2000; translated from Spanish). 


As exemplified in this statement, the polling firms offer a post-electoral 
explanation of their failure that intends to limit the public disappointment with 
the polls. Most important, I argue that their explanation seeks to preserve the 
social legitimacy of the polling activity by exculpating the forecasting technology 
from any responsibility for the failure. Therefore, despite the pollsters' awareness 
of the technical limitations to make the forecasts, they avoid any self-criticism 
in that regard. Instead, when facing publicly the criticisms to their performance, 
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they pretend a different cause of their failures (Goffman 1959). The pollsters argue 
that the differences between forecasts and electoral results, and the coincidence 
of these errors across different polls, are caused by the common effect on all 
the polls of systematic measurement errors due to contingent behaviors of the 
interviewees and the electorate (Wert 1994, 1996, 2000; Martí-Font 1996). More 
specifically, the polling firms explain their failures arguing that: (1) interviewees 
who intended to vote for unpopular political parties hid their real vote intentions 
by either reporting a different one or refusing to answer the survey; (2) events 
at the very end of the electoral campaign mobilized segments of the electorate 
that, in the survey, reported that they intended not to vote; and (3) electorate 
segments changed their vote intentions during the period between the last surveys 
and the election day (which assumes an extremely volatile vote intention of the 
electorate). 

The explanation of the polling firms reconciles the interrelated phenomena 
of forecasts and electoral result, captures the interdependence of the different 
poll results (which supposedly reflect the same voting reality), and presents their 
failure as logical and inevitable. To be sure, the explanation is that those three 
kinds of behaviors cause systematic measurement errors not controlled by any 
polling firm in their research, identically affecting all surveys, and thus yielding 
the convergence across the failing forecasts. The logic of this.explanation depends 
on the convergence of failed forecasts among polls, and thus the coincidence of 
forecasts becomes a critical issue for the polling firms. Such coincidence allows 
for a retrospective interpretation of the electoral process—in terms of behaviors 
and results—in a way consistent with the expected interdependence of forecasts 
across firms. 

Although those three electorate or interviewee behaviors surely occur to 
some extent, the polling firms' explanation assigns to such behaviors the sole 
responsibility for the failed forecasts. The polling firms' explanation does not 
consider the technical deficiencies in the forecasting process, and it assumes that 
each individual polling firm is a closed system whose forecasts are unaffected 
by the other firms' forecasts. However, the crucial technical limitation posed by 
the characteristics of the Spanish electoral system implies that, even if the polling 
firms could perfectly forecast vote intention or if all firms were affected in the same 
way by systematic errors, the uncertainty that each pollster faces in deciding how 
to allocate the last parliamentary seat in each electoral district would still most 
likely yield divergent rather than coincident forecasts of parliamentary seats across 
polling firms. 

The following analysis tests the validity of the polling firms’ explanation. In 
contrast to the preceding analyses, it considers vote intention forecasts rather than 
parliamentary seat forecasts. The analysis includes all published polls from the 
1989 to 2000 elections that forecasted vote intentions for each of the 52 electoral 
districts. I investigate the differences between forecasts and electoral results across 
electoral districts with comparable sociopolitical characteristics (i.e., districts 
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belonging to the same political section). The polling firms’ proposition that 
behaviors of the interviewees and the electorate create systematic measurement 
errors in the polls should be reflected in the forecasts of a specific poll for the 
electoral districts within a political section showing a similar pattern of systematic 
measurement error. 

To test this expectation, I apply separately by poll, political party, and section 
an index of internal poll consistency designed specifically for this analysis.” The 


index is T 
X X.(E-&)-(5-&)) 


i=1 j=i+1 
> 


Ud 
where F, is the forecast in electoral district i, R, is the electoral result in electoral 
district i, and N is the number of electoral districts compared within the political 
section.” Note that the difference between F and R, indicates the extent of survey 
measurement error concerning the forecast of electoral district i. This index 
indicates the level of similarity (or dissimilarity) in the differences between 
forecast and electoral results across the electoral districts of a political section. 
Index levels equal or close to zero indicate similarity of these differences regardless 
of the magnitude of the differences. For example, a section with overestimations 
of vote intention of 496 (i.e., the forecast is 496 higher than the actual electoral 
result) in each electoral district would yield an index of internal poll consistency 
of 0; a section with overestimations of 6% in all districts would also yield the same 
index value. Index levels departing from zero indicate dissimilarity. For example, 
a section with two electoral districts, one with an overestimation of +1% and 
the other with underestimation of —196 yields an index of 2. 

Similarity indicates consistency among the district forecasts of a specific poll 
because it suggests that systematic measurement errors affect all electoral district 
forecasts within a political section equally. Such a result is consistent with the 
polling firms' explanation that systematic measurement errors cause the failures 
of their electoral forecasts. Conversely, dissimilarity indicates inconsistency among 
a specific poll’s forecasts for districts within a section and implies that these 
forecasts were affected in different ways by electorate and interviewee behaviors, 
hence presenting different systematic measurement errors and contradicting 
the pollsters’ explanation. Specifically, dissimilarity would suggest the illogical 
circumstance that interviewees in one electoral district hid their vote intentions for 
one political party (generating underestimation of vote intention regarding that 
party) but that such duplicitous behavior did not occur in another comparable 
electoral district. Likewise, some event at the very end of the electoral campaign 
either mobilized segments of the electorate or made the electorate change its 
vote intention only in some of the political section’s districts. These hypothetical 
circumstances are unlikely to occur because they do not reflect the national scope 
of the electoral process in the Spanish parliamentary elections.” 

Table 3 presents the results of the analysis. For each poll, it reports the 
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summary indexes by political section (i.e., weighted averages of indexes by parties 
in a section) and the total index (i.e., weighted average of the summary indexes by 
section). The total indexes are: 2.0, 2.1, and 5.9 in the polls of 1989; 2.2, 2.6, and 
3.0 in 1993; 1.9 and 2.2 in 1996; and 1.9 and 2.2 in 2000. These results indicate 
that the average difference between the systematic measurement errors in forecasts 
for the same political party in two comparable electoral districts ranges from 
1.996 to 5.996 of vote intention, depending on the poll analyzed. These results 
suggest substantial dissimilarity within polls between the forecasting errors of 
comparable electoral districts and thus internal inconsistency of the polls with 
regard to how the forecasts for districts were affected by systematic measurement 
errors. The analysis shows that the polling firms’ behavioral explanation about 
the origin of their failure is not reflected in their own poll results. The analysis 
therefore refutes the polling firms’ explanation both of the origin of the forecast 
failure and the coincidence of their forecasts in the more recent elections. 


Institutionalization of a Generic Forecast 


Given the deficient survey research technology to produce accurate forecasts, 
the consequent uncertainty in the allocation of the last parliamentary seat in 
each electoral district and hence the likely variation of possible forecasts across 
polling firms, how does one explain the coincident forecasts in all the elections 
since 1989? This paradox can be explained if the polling firms are considered 
open organizational systems affected by two key elements of their cultural and 
relational environments: an institutionalized forecast that the firms collectively 
construct and the interdependence among firms that results from sharing a 
common source of social legitimacy. 

The observed coincidence of forecasts across polling firms is the firms’ social 
construction of an institutionalized prediction of the electoral result. Specifically, 
a generic forecast consists of the convergence of the ongoing individual firms’ 
forecasts such that the trend depicted by these results indicates a common 
prediction of the rank order of the political parties in the elections, as well as the 
approximate number of parliamentary seats obtained by the parties. 

From a cognitive approach, institutions constructed by social interaction are 
used as a common frame of reference to justify the past and guide new actions 
(Berger and Luckman 1967; Weick 1979, 1993). Accordingly, a generic forecast 
is a referent that guides and constrains the firms’ subsequent forecasts within 
a forecasting context characterized by high uncertainty. The generic forecast 
produces a consistent order across polling firms’ forecasts and allows them to 
articulate a coherent post-electoral explanation of their failures that prevents 
their social delegitimation. 

The emergence and maintenance of an institutionalized generic forecast 
result from two complementary processes that I consider in turn: individual 
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decisions that create and sustain an institution and the stability and automatic 
maintenance (i.e., inertia) resulting from the taken-for-granted character of 


social institutions. 


MIMESIS AMONG POLLING FIRMS 


A generic forecast emerges from the convergence of consecutive individual forecasts 
by the polling firms. These organizations, facing high uncertainty in deciding 
their forecasts, individually decide their forecasts according to the previously 
published forecasts of other firms (i.e., the generic forecast). A firm examines 
previously published forecasts for cues about an appropriate conventional forecast 
that will allow that firm to remain properly aligned with other firms' forecasts 
(Goffman 1959). The decision to mimic the generic forecast is based on a firm's 
individual interest in avoiding standing out and thus preserving its individual 
prestige relative to other firms' prestige in the likely event of forecasting failure 
(Meyer and Rowan 1977; DiMaggio and Powell 1983). Such adjustment of its 
forecast implies that a firm decouples its decision from the technical activity on 
which the production of forecasts should be based (Meyer and Rowan 1977). A 
firm allocates parliamentary seats to political parties according to the forecasts 
already published by other firms rather than by relying on the uncertain cues 
from its own deficient survey estimates of the vote intention. Because all polling 
firms—aware that they face the same uncertain situation—behave similarly, over 
time they collectively construct a generic forecast. 

A firm deciding not to conform to the generic forecast could gain individual 
prestige if its forecast turned out to be closer to the electoral result than the generic 
forecast. Conversely, its deviation from the generic forecast would harm its prestige 
if its forecast were farther from the electoral result than was the generic forecast 
socially constructed by other firms (especially if it were the only firm deviating 
erroneously from the generic forecast).^ When a firm conforms to the generic 
forecast, it remains unnoticed in individual terms no matter what the electoral 
result, and its level of individual prestige relative to other firms' prestige remains 
unaffected. In the more recent elections, the Spanish polling firms have opted to 
adjust their forecasts to the generic forecast. Apparently, firms consider that the 
harm to a firm's individual prestige that would result from offering a forecast 
more erroneous than a generic forecast exceeds any prestige gain that the firm 
might obtain if its deviant forecast is closer to the electoral result. 

The pollsters cannot admit that they mimic the forecasts of other polling firms. 
Any public acknowledgement of this behavior would risk the firms' legitimacy 
(similar delegitimation would occur if they were to acknowledge publicly the 
technical limitations to producing accurate forecasts). The presumed scientific 
foundation of their forecasts requires that the firms maintain the public's belief 
that each polling firm bases its forecast on its own survey research and not on 
the forecasts published by other polling firms." 
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Nevertheless, relying on a falsifiability methodological logic (Popper 1959; 
Stinchcombe 1968), my analyses corroborate the existence of mimesis among 
polling firms by excluding the other plausible alternative explanation of how 
the convergence of failed forecasts across firms occurred in the more recent 
elections. Once I have shown such convergence, then disproving the polling 
firms’ alternative explanation about the cause of the convergence reasonably 
implies that mimesis of forecasts among the polling firms—the only plausible 
remaining mechanism—occurs and causes their convergence. Hence, I find 
support for neoinstitutional propositions on isomorphism by mimesis and the 
social construction of institutions. 

The conformity of the polling firms to the generic forecast allows for the 
crucial collective benefit of safeguarding the social legitimacy of both the survey 
technology and the production of forecasts. The coincidence of failed forecasts 
across firms is essential to protect that legitimacy. If the firms were to publish 
divergent forecasts, their incongruity would damage their social legitimacy. It is 
not logical that the interviewees would respond differently to surveys of different 
firms (conducted within the same period) and that these surveys would thus 
yield divergent forecasts. Interviewees should logically respond identically to the 
survey questions of every polling firm, and thus similar systematic measurement 
errors that may affect all firms’ surveys would produce similar deviation of all 
the firms’ forecasts with regard to the electoral result (i.e., the polling firms’ 
explanation of their failure). 

The importance of the convergence of forecasts across polling firms for their 
social legitimacy suggests an additional motivation for polling firms to mimic the 
forecasts published by other firms. Thus, the construction of a generic forecast 
would also imply an implicit or explicit agreement among polling firms to offer 
coincident forecasts (i.e., collusion)." Some journalists and opinion leaders 
finally considered this possibility after the 2000 election (in this regard, see Wert 
[2000] quoted above). The polling firms (either all or most of them, with a few 
outliers and/or followers whose motivation would only be to avoid standing 
out) would either explicitly communicate their intention not to deviate from a 
consensus forecast or, more subtly, signal to one another through their forecasts 
their continuing conformity to the previously published forecasts.?* 


TAKEN-FOR-GRANTED EFFECT OF INSTITUTIONS 


The second process involved in the emergence and maintenance of a generic 
forecast refers to the taken-for-granted effect of institutions (Scott 2001). 
Consecutive conformity of the forecasts of additional individual firms to a generic 
forecast generates a socially self-reinforced bandwagon effect and enhances 
the institutionalization of the generic forecast, in the sense that this forecast 
becomes more taken for granted as the correct prediction of the electoral result 
(Meyer and Rowan 1977; Knoke 1982; Tolbert and Zucker 1983; Abrahamson 
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and Rosenkopf 1997). Thus, the polling firms become more constrained over 
time in their ability both to conceive and publish subsequent forecasts. With a 
higher level of institutionalization, a deviant forecast would be more noticed, 
thus increasing the risk for a firm to stand out and harm its individual prestige 
if its deviant forecast turns out to be farther from the electoral result than the 
generic forecast. 

The earliest form of generic forecast of a future parliamentary election that 
the polling firms can observe is the actual result of the previous parliamentary 
election. In the absence of relevant changes in the system of political parties, that 
antecedent is a certain starting point for the next electoral forecasts. The results of 
other elections held between two parliamentary elections, such as those to elect 
city and Autonomous Community governments or the European Parliament, 
also provide cues about the evolution of the vote. Finally, any published political 
forecast affects the institutionalization of the generic forecast, although the most 
relevant ones are the electoral forecasts published close to the parliamentary 
election, which predict distribution of parliamentary seats across parties? Because 
of the inefficacy of the forecasting technology in successive elections, the generic 
forecasts socially constructed by the polling firms have differed from the actual 
electoral results, though, by chance, forecasts and electoral results might have been 
close, as happened in the 1989 election. The true vote intention in an election 
and the consequent distribution of parliamentary seats among political parties 
have followed their own trend apart from the firms' generic forecast. 

The convergence of forecasts occurring in the elections since 1989 supports 
the preceding arguments. In these elections, the number of electoral forecasts 
published was considerably higher than in earlier elections, therefore allowing 
for the institutionalization of a generic forecast. Table 4 presents the results of 
an analysis for the 1986 to 2000 elections that considers specifically the existence 
of the institutionalization process I have described. This analysis investigates 
whether, within each election, the forecasts about the three first political parties 
became more homogeneous as additional forecasts were published. At each 
election, I calculate the standard deviation of three consecutive sets of forecasts 
with coincident date of publication (1986 and 2000 elections allow only two 
chronological sets).*° 

The overall results of the analysis indicate the institutionalization of a generic 
forecast in those five elections. In the 1986, 1993, and 1996 elections, the standard 
deviation diminishes over time for the three political parties, indicating the 
increasing convergence among forecasts in each chronological set. In the 1989 
election, the standard deviations for each of the three parties increase from 
the first to the second chronological set, contradicting the institutionalization 
argument, but decrease from the second to the third, supporting that argument. 
Finally, in the 2000 election the standard deviation of the two chronological sets 
analyzed diminishes for two parties but increases (by .39) for the other party. 
The mixed result in this election suggests that the level of institutionalization 
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of a generic forecast may vary for each political party. In addition, the fact that 
the number of forecasts in the 2000 election was lower than in previous recent 
elections supports the argument suggesting that the institutionalization of a 
generic forecast increases with the number of forecasts published. 


INSTITUTIONALIZATION OF THE POPULATION OF POLLING FIRMS 


The analysis presented in Table 1 indicates the absence in the earlier elections 
of coincidence among the wrong forecasts of different polling firms. This 
circumstance posed at risk the collective social legitimacy of the firms. In the more 
recent elections since 1989, the analyses presented show the convergence among 
erroneous forecasts. This trend has allowed the polling firms to attribute their 
failure in each election to specific behaviors of the electorate and the interviewees, 
thus preserving the firms’ legitimacy and allowing their forecasting activity in 
subsequent elections despite the repeated failures. A final aspect that requires 
consideration is the reason for the historical change from divergent forecasts in 
earlier elections to convergent forecasts in the more recent elections. 

This historical change is related to the institutionalization of the population 
of Spanish polling firms and to arguments previously presented. The main 
indication of the institutionalization of the population of polling firms is the 
growing number of organizations making electoral forecasts purchased by the 
media.*! Other institutionalization indications of that organizational population 
are the continuing presence of the same firms election after election since 1986, 
their membership to professional and industrial associations (that on occasions 
have taken on the responsibility of publicly communicating the firms’ explanation 
of the forecasting failures), and the firms’ growing awareness of their involvement 
in a common enterprise (DiMaggio and Powell 1983). The last is evident in the 
CEO quote above about common circumstances faced by all firms. 

In the earlier elections, the polling firms individually experienced the 
insurmountable technical limitations of forecasting accurately the distribution 
of parliamentary seats. Moreover, in both 1977 and 1982, the fact that the two 
firms making the two forecasts for each election published them the same day 
prevented each firm from learning the other firm’s forecast through the media 
and thus their deliberate convergence. The forecast failures, disclosed election 
after election, made each firm aware that relying exclusively on its own survey 
research could result in a deviation from other forecasts published, thereby risking 
its individual prestige if its forecasts were to deviate more from the electoral 
result than the other firms’ forecasts.? Moreover, the amount of dispersion 
among forecasts in the earlier elections (with the noted exception of the 1979 
election) menaced the legitimacy of the electoral polls because the public could 
easily observe the incongruence of contradictory erroneous forecasts across the 
polls. Initial public concern about the reliability of electoral forecasts appeared 
in newspapers and other media after the 1982 election. The criticisms increased 
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in virulence with the subsequent failure of 1986 and have remained in the public 
electoral debate ever since. The polling firms over time would have become 
increasingly attentive to their interdependence and learned that isomorphism, 
by offering coincident forecasts, helps to dampen, although not eliminate, these 
criticisms and thus to preserve the firms' individual and collective legitimacy 
despite repeated failures. 


Conclusion 


The analysis of key elements in the cultural and relational environments of the 
Spanish polling firms shows how these elements are intertwined and mutually 
affect their relevance in shaping the interests and decisions of these organizations 
(i.e., their social constitution as actors). I have explained how the convergence 
among the firms’ forecasts in the more recent elections yielded a generic forecast 
that imposes order in the firms’ production so that cultural expectations about 
forecasting are met. Such order is a necessary condition for the logic of the 
firms’ explanation of their failure to predict correctly the electoral results. The 
convergence of firms’ forecasts requires that the firms decouple their decision of 
what forecast to publish from the technological forecasting process. The firms 
explanation attempts to buffer the technology that confers legitimacy on their 
activity from any responsibility in the failures, even though, as I have explained, 
that technology is ineffective at correctly forecasting the party distribution of 
parliamentary seats in the Spanish parliamentary elections. 

Neoinstitutional propositions about isomorphism by mimesis, as well as about 
the social construction of institutions, are relevant in explaining the convergence 
of forecasts across polling firms. My study indicates the importance of collective 
aspects in processes of institutional isomorphism: the social construction of 
the institution adopted as a convention, and the dependence of a collectivity's 
social legitimacy on the isomorphism of its members. Some collectivities may 
be especially dependent on members' isomorphism if preserving the source 
of their legitimacy (or reputation or status) requires consensus among them. 
This circumstance may delimit the extent and the form of competition among 
the members in the collectivity. In such context of weakened or conditioned 
competition, explicit or tacit forms of cooperative relationships may emerge that 
benefit the collectivity, because competition-related uncertainties are reduced or 
the common social legitimacy is consolidated. That circumstance is likely to be 
the case for organizations whose activity is socially legitimated by expert and 
professional knowledge, such as hospitals, research organizations, consulting and 
auditing firms, and organizations that make all kinds of prospective analyses, 
from elections to stock markets. 

Therefore, professional knowledge would have two effects on processes of 
institutional isomorphism. The first consists of normative pressures impinging 
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on an organization from its environment (DiMaggio and Powell 1983). In 
such instance, common forms of expert knowledge are internalized within 
organizations through the incorporation of professionals, and by other less direct 
mechanisms. A second effect, exemplified in my study, consists of that same 
knowledge constraining organizational decisions—provided that the legitimacy 
of the organizations’ activity is based on that knowledge—and resulting in the 
convergence of the productions placed in society by similar organizations. My 
investigation considered the extreme circumstance in which production outcomes 
are subordinated to the goal of preserving the legitimacy of the scientific 
knowledge on which the forecasts should normatively be based. In this regard, 
I indicate how the interests underlying organizational decisions are affected 
by both cultural understandings attached to that knowledge and the resulting 
interdependent relationship among polling firms. 


Epilog 


The election following the series of elections investigated here was held on March 
14, 2004, three days after a terrorist attack in Madrid killed 191 and injured more 
than 1,500 people who where traveling in four trains in which bombs had been 
placed. This terrorist attack—by far the most deadly ever in Spain—caused a 
great agitation in the country. Political parties decided to conclude the electoral 
campaign that day, and millions of people demonstrated showing their sorrow 
for and solidarity with the victims. Initially, the government firmly pinned 
responsibility for the attack on the Basque terrorist organization ETA, despite the 
fact that this organization announced that it was not responsible for the attack. 
Some political parties and media accused the government of concealing the 
possibility that a terrorist group linked to Al Qaeda could be actually responsible. 
In the evening of the day of the attack, the government rectified its initial position 
and acknowledged the possibility that Al Qaeda could be responsible, because 
during the morning police had found an automobile with explosives and an 
audiotape with Koran verses. 

The controversy over how the government was informing the public about 
the responsibility of the terrorist attack arose during the two days before the 
election. Thousands of people demonstrated in several city headquarters of the 
government’s party, Partido Popular (PP), accusing the government of lying, 
manipulation, and responsibility for the deaths of the murdered people because 
of its decision (without parliamentary support) to send troops to the Iraq war, 
which would have made Al Qaeda target Spain. The war had been a major political 
issue in the country for more than a year, and one of the electoral promises of 
the second party in the Parliament, Partido Socialista Obrero Español (PSOE), 
had been the return of the Spanish troops from Iraq within a short period. 
Eventually, the terrorists were identified and responsibility for the attacks placed 


912 / Social Forces 83:3, March 2005 


on a group linked to Al Qaeda. 

How well did electoral polls predict the 2004 election? Both pre-election 
and exit polls failed. Pre-election polls forecasted PP to win over PSOE; and 
the exit polls, made after the terrorist attack, forecasted a tie between these two 
main parties. Conversely, PSOE turned to be the clear winner of the election by 
a difference of 18 parliamentary seats over PP. Did the terrorist attack affect the 
results of this abnormal election? Probably it did, in which case this event would 
be one of the causes of the differences between pre-election forecasts and the 
electoral results.? Would the forecasts have been correct in the counterfactual case 
that the terrorist attack had not occurred? We will never know, but the technical 
deficiencies of the forecasts explained above suggest that the 2004 polls most 
likely would have failed again anyway. 


Notes 


1. Electoral forecasts have been criticized in the Spanish media for hindering the democratic 
electoral process because they facilitate strategic voting of the electorate to the detriment 
of ideological voting, affect the mobilization of the electorate, and may provide incorrect 
predictions of the electoral results. Political parties usually express ad hoc skepticism and 
criticisms about the electoral polls depending on how the forecasts affect their immediate 
electoral interests. 


2. The role of abstract knowledge systenis and their scientific character in the social legitimacy 
of professional activities is considered by the literature on professions (e.g., Freidson 1986; 
Abbott 1988; Latour 1987). 


3. In a similar vein to my argument on collective benefit, Meyer and Rowan's (1978) study 
of educational organizations suggests that ritual adoption of institutions decoupled from 
actual activities may benefit the system formed by state, community, administrators, and 
schools because it results in little evidence of ineffectiveness, conflict, or inconsistencies. The 
legitimacy of those institutions is maintained in order to sustain the system itself. 


4. Emirbayer and Goodwin (1994) suggest the importance of considering both relational 
and cultural aspects in sociological research. A few organizational and economic sociology 
studies have considered the reciprocal effect between cultural aspects and social relationships. 
Fligstein (1990, 1996) explores the effect of cultural elements on market actors' definition 
and interpretation of the their relationships. Podolny (1993) proposes a model of market 
competition based on perceived producers’ status as a signal of product quality; status depends 
on the ties of a producer with other actors in the market (customers, other producers, 
and third parties). Baker, Faulkner, and Fisher (1998) analyze the effect of prevailing 
institutionalized norms about rules of exchange between advertising agencies and their clients 
on the duration of agency-client ties. 


5. Organizational technologies may be considered analogous to the schemes and scripts 
of human cognition. For individuals and organizations, respectively, both cognitive and 
technical systems function as guidelines in the assimilation and processing of information, 
giving coherence to the relations of those using the same system. Similarly, neoinstitutional 
theory has indicated that isomorphic organizational structures facilitate communication and 
coordinated action among organizations (DiMaggio and Powell 1983). 
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6. Weber ([1922] 1947) defined substantive rationality in market transactions as characterized 
by individual "utilities" that require that actors subordinate individual opportunistic behavior 
to anticipated advantages emerging from good standing in a collectivity. 


7. These two problems undermine the accuracy in predicting the electoral result. The firms 
never publish the number of people that do not answer the questions of the electoral polls, 
even though that is a requirement of the Spanish electoral law (Régimen Electoral General, 
1985, Section VIII, article 69, 1, c.). The firms publish the percentage of undecided interviewees 
in very few occasions. Some examples of percentage of undecided vote of polls published 
one week before the elections are: 1196 in a 1977 poll; 4196 in 1979; 4596 in 1982; and 3296 
in 2000. 


8. These projections of the undecided interviewees usually consider the respondent's vote in 
previous elections, political preference (left-wing/right-wing scale), sympathy for political 
parties and candidates, and opinion about agenda issues and specific cleavages of the political 
context. 


9. An exception to this is Gallup Institute. This polling firm has rejected requests to forecast 
the distribution of parliamentary seats after it misforecasted the 1986 election (Martí-Font 
1996). 


10. This level of sampling error is based on the usual assumption of simple random 
sampling and binomial probability, although the actual research scenario is characterized by 
multinomial probability because each constituency usually has from 5 to 20 voting options 
(Díez-Nicolás 1996). Sampling error occurs in all research that uses samples and equals plus 
and minus the critical value for a specific significance level times the standard error of the 
sampling distribution. 


11. Paradoxically, a larger sample size has not produced more accurate forecasts. For example, 
the polling firm outlier in 1996 (O), using samples of less than 2,500 cases, published 
forecasts closer to the electoral result than the firms that used samples ranging from 10,000 
to 15,000 cases. 


12. The political party Partido Socialista Obrero Español (PSOE) has been one of the two 
main parties in all eight parliamentary elections. The party Unión de Centro Democrático 
(UCD) was the other main party in 1977 and 1979. The party presently called Partido 
Popular (PP) (also with its previous denominations, Alianza Popular [AP] and Coalición 
Popular [CP]) was the other main party in 1982 through 2000). I do not include more than 
two parties in these graphs for the sake of simplifying the presentation. The graphs include 
all polls with forecasts of parliamentary seats published starting about 75 days before the 
election (the moment that the government indicates the election day, which coincides with 
the time newspapers begin publishing forecasts of the distribution of parliamentary seats 
among parties) until the fifth day before the election day (the legal deadline for publishing 
electoral results—established in 1985). The graphs also include the exit polls made the day of 
the election and broadcasted by radio and television. Analyses not presented here considering 
the deviations in the vote intention forecasts from the electoral result reveal high coincidence 
across polling firms, but this kind of forecast is not as critical for my arguments as are the 
forecasts for parliamentary seats. 


13. The graph of the 1993 election omits some dates to avoid clutter. The vertical axis in 
five out of the eight graphs has a range of 100, but the ranges for the 1982, 1986, and 1989 
elections are wider to accommodate larger differences in the results and forecasts between 
the two main political parties. 
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14. Nevertheless, analyses not presented here of the failure of forecasts at electoral district 
level suggest that the relative polling success in 1989 is doubtful, since it may have resulted 
from compensations between over-estimated and under-estimated forecasts at the district 
level, from which the national electoral result is computed. 


15. This exception does not necessarily reflect the same convergence processes suggested for 
the 1989 to 2000 elections. The coincidence of the two polls published in the 1979 election 
could be the result of chance. 


16. See the note in Table 1 for technical details about this analysis. The same analysis using 
another dispersion statistic (the average absolute deviation instead of the standard deviation) 
yields the same results that I present here. 


17. The cases refer to the forecasts in the 52 electoral districts for the three first political 
parties in each election, plus the seven forecasts about the main Catalan and Basque nationalist 
parties (these two parties usually are the fourth and fifth parties in the elections, running 
only in four and three electoral districts, respectively). 


18. The simpler factor model tested for the 1986 election is not fully comparable to the 
more complex models of the 1989 to 2000 elections. I tested the simpler model for these 
later elections, and the results confirm the different conclusions for the earlier election of 
1986 and the 1989 to 2000 elections. 


19. Another expectation is that different polls should not offer identical results, but very 
similar ones. The observed differences among individual polls might partly reflect firms' 
attempts to present a forecast closer to the political preferences of the particular media 
ordering the poll. The failure of the forecasts, though, cannot indicate manipulation of the 
electorate since there is no partition in two or more sets of forecasts hypothetically competing 
for alternative manipulation attempts of the public opinion. Moreover, the difference between 
forecasts and electoral results remains in the exit polls, which are published after the election 
and thus cannot influence the electorate's vote. 


20. The CEO indicated that the coincidence of the forecasts in the 2000 election resulted 
from identical decisions (that he qualifies as a "question of opinion" or "interpretation") 
made by all the pollsters. The coincidence of forecasts allowed the pollsters to attribute the 
polling failure to their miscalculation of the electorate's behavior during the closing days 
of the electoral campaign. Furthermore, this CEO argued that the firms' miscalculation, 
and consequently the failure of the polls, would not have occurred without the limitation 
to their research posed by the legal deadline that forbids the publication of forecasts later 
than five days before the election. His argument is incorrect considering that in the 2000 
election—the one considered by this pollster, but also in previous elections—the three exit 
polls also coincided with the erroneous generic forecast (see Figure 8), and these polls were 
obviously unaffected by that legal deadline. 


21. These political sections are the 17 Autonomous Communities mentioned above. Because 
the analysis requires the comparative examination of districts within a section, it is only 
feasible for the 10 Autonomous Communities with at least two electoral districts. This 
requirement reduces the analysis to 43 of the 52 Spanish electoral districts. 


22. In eight of the ten political sections analyzed, one index is calculated for each of the three 
leading political parties. In the other two sections, a fourth index is calculated because the 
main nationalist party is also considered. 
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23. For example, for a section with three electoral districts, the formula yields the following 


computation: 
I(K-R)-(& -R)b-(8-R)-(5 - R.)IH(S 7 8,)- (5 - 8,)I/3 


24. The Spanish parliamentary election campaign focuses on political issues, leaders, and 
parties at the national level rather than on local issues. Therefore, I assume that events 
occurring during the electoral campaign affect the electorate similarly across electoral districts 
with similar sociopolitical characteristics, that is, those within the same political section. The 
restriction of the comparisons in my analysis to electoral districts of the same Autonomous 
Community controls for possible sociopolitical differences across these political sections. 


25. The outlier in the 1993 election (polling firm A) presented a forecast that deviated from 
the generic forecast and was much farther from the electoral result. That firm did not publish 
electoral polls in the following elections. The outlier of the 1996 election (polling firm O) 
presented several forecasts that were closer to the electoral result than the generic forecast. This 
firm made electoral polls in the next election, as did almost all other firms that conformed 
their individual forecasts to the generic forecast in the 1996 election. 


The following argument suggests the extent to which the firms' common source of legitimacy 
makes the firms interdependent: a polling firm that publishes a wrong forecast different from 
the generic forecast is better off if the forecasts of the rest of the firms conform to a generic 
forecast (even a wrong one). This generic forecast allows firms to preserve the legitimacy of 
the forecasting technology, and that deviant firm will still be able to use such firms’ common 
source of legitimacy in the future and thus have a chance to restore its individual prestige. 


26. Replicating a survey with pooled independent samples is a technically appropriate statistical 
practice to improve the estimates. Nevertheless, if the polling firms in the case investigated 
would try that research strategy, they would still be considering a set of parliamentary seat 
forecasts that are all estimated with the technical limitations discussed in a previous section 
of this article. Therefore, that research strategy would not improve the firms’ forecast. 


27. In this regard, Fligstein (1996, 2001) indicates that a key aspect of markets is firms’ 
actions to stabilize their competitive relationships to avoid fierce competition that could 
threaten their survival. 


28. The arguably low concern by polling firms to lose competitive advantage over one another 
would facilitate collusive practices. Firms probably prefer not to stand out in case of failure 
rather than to try differentiating their forecasts. Firms’ competition to obtain media orders 
for forecasts is decreased by the fact that firms can only offer one poll at the time, and they 
usually work for the same media client election after election, which suggests the existence 
of stable commercial relationships between specific polling firms and media. 


29. Polls forecasting the distribution of parliamentary seats of a parliamentary election are 
made during the two or three months before the election date. The polling firms also publish 
surveys during the four years between parliamentary elections addressing diverse issues of 
the political agenda and the vote intention regarding political parties. 


30. See technical details of the analysis in the notes of Table 4. The analysis is feasible only 
for the elections 1986 through 2000 because the few polls published in the 1977, 1979, and 
1982 elections do not allow calculation of standard deviations for different chronological 
sets of polls. 
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31. A total of 26 polling firms have forecasted parliamentary seats during the eight 
parliamentary elections investigated. The number of firms varied from election to election: 
two firms in 1977 election; two firms in 1979 election; three firms (two of them collaborated 
doing one poll) in 1982 election; seven firms in 1986 (two collaborating); seven firms in 
1989 election; ten firms in 1993 election (two collaborating); eight firms in 1996 election 
(two collaborating); and ten firms in 2000 election (four firms formed two collaboration 
sets). These firms were the more prominent polling firms in Spain in a given period. Most 
of them are Spanish, founded during the democratic period analyzed, and a few belong to 
multinational groups. 


32. An analysis of the survival capability of the polling firms over time is beyond the scope 
of this study since that phenomenon is affected by many elements of the industry and the 
individual firms apart from forecasting the Spanish parliamentary elections, which is only 
one (albeit the most publicly relevant) among their many survey researches. Nevertheless, 
the wrong forecasts of the parliamentary elections can greatly influence the overall prestige 
and legitimacy of the polling firms. Importantly, the electoral polls are one of few polling 
activities whose accuracy can be measured, as I have done, by comparing the forecasts to 
the electoral results. 


33. Although the exceptional circumstances resulting from the terrorist attack could explain 
the differences between the electoral forecasts and results for PP and PSOE, it is unlikely 
that those same circumstances could explain the important overestimation (by twice as 
much) of all electoral polls for the parliamentary seats gained by IU, a political party firmly 
opposed to the Iraq war. 
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The Grass Widows of Mexico: Migration and 
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Abstract 


This article provides an empirical test of the widely accepted assumption that 
migration contributes to union instability. The data come from the Mexican 
Migration Project (MMP) data base MMP93. We use multilevel discrete time event 
history analysis to specify the odds of union dissolution for male household heads 
by individual- and community-level U.S. migration experience. In the context of 
the U.S.-Mexico migration flow, we find that U.S. migration significantly increases 
the odds of union dissolution for individuals with extensive migration experience as 
well as for residents in communities with medium international migration levels. 
We conclude that changes in normative values and social control levels, for both 
individuals and communities, are partial contributors to this relationship. 


More than one hundred years of continuous movement across the U.S.-Mexico 
border has led to a unique arrangement of family life for many Mexicans. Unlike 
the majority of other U.S.-bound imigration flows, a stable feature of Mexican 
migration has been the circular migration of men, while spouses and children 
remain in the origin community (Durand et al. 2001; Kana'iaupuni 2000b). 
This arrangement provides migrants with access to scarce capital while at the 
same time allowing them to avoid the major lifestyle changes and restrictive 


* The authors are listed in alphabetical order denoting equal contributions. This work was supported 
by a National Institute of Child Health and Human Development (NICHD) postdoctoral fellowship 
at the University of Chicago and a NICHD predoctoral fellowship at the University of Texas at Austin. 
The authors would like to thank Nadia Flores, the Mexican Migration Project data manager, for 
her generous assistance and Kelly Raley, Robert Hummer, Rebecca Clark, Larry Bumpass, and Sam 
Field for their helpful comments. Direct correspondence to Reanne Frank, Department of Sociology, 
300 Bricker Hall, 190 N. Oval Mall, Columbus, OH 43210. E-mail: 

frank.219@sociology.osu.edu. 


© The University of North Carolina Press Social Forces, March 2005, 83(3):919—948 


920 / Social Forces 83:3, March 2005 


legal barriers that make familial migration more difficult. The result has been 
an extraordinary high rate of divided households, with husbands and wives 
separated across national boundaries for significant portions of their married life 
(Buznego 2001; Marroni 2000). For some families, the divided household is a one- 
time event, with the husband returning after a trip to the U.S. and permanently 
settling in Mexico. For a much smaller minority, the family permanently reunites 
when the wife and children join the husband in the U.S. But for the rest, the 
divided household becomes a stable feature of their family life, characterized by a 
repetitive pattern of departures, remittances, and return visits. In many Mexican 
communities with long traditions of migration to the U.S., this pattern of family 
life has become the norm (Gupta 2002). 

Yet while the divided household arrangement is often accepted as a necessary 
way of life among the majority of Mexican migrants and their families, it is not 
without its recognized disadvantages. For many, the embodiment of the negative 
costs associated with this situation is desertion. A preoccupation with familial 
abandonment by the migrant husband is omnipresent in the Mexican migrant 
narrative and has been documented in nearly every case study of Mexican 
migration (Buznego 2001; Davis 1992; Gupta 2002; Kana'iaupuni 2000b; Marroni 
2000; Salgado de Snyder 1993; Stephen 1991; Trigueros 1992; Viadro 1997). 
Results from a survey administered to 202 Mexican women living in Mexico with 
migrant spouses residing in the U.S. demonstrates the persistent fear of financial 
or complete abandonment exhibited among nonmigrant women who remain 
in Mexico. Almost half of the sample reported that they were worried that, *My 
husband may forget his customs”; “[he] may never come back home”; and “[he] 
may start a new family in the U.S.” (Salgado de Snyder 1993). 

The U.S.-Mexico migration flow is not the first to have been linked to the 
phenomena of spousal desertion. In fact, the issue of abandonment was so 
visible in the earlier waves of the mass European migrations that the Jewish 
immigrant community established the National Desertion Board in 1911 and was 
immediately inundated with requests from grass widows, or agunot, an ancient 
Hebrew term resurrected to refer to the scores of abandoned women searching 
for their missing immigrant husbands (Fridkis 1981; Friedman 1982). However, 
in the European case, migration was overwhelmingly family-based, and the vast 
majority of desertions that did occur happened once the entire family was on 
U.S. soil (Baker 1995; Diner 1983). Eventually, families arrived and migration- 
related marital problems evolved into U.S. social problems. What is unique to the 
Mexican migration flow to the U.S. is that the issue of spousal abandonment is 
occurring within the context of a migration system that is built almost entirely 
on the premise of divided households. The Mexican migration flow has remained 
overwhelmingly circular, spanning national boundaries in a very gendered way 
for decades.' The result has been the creation of an enduring pattern of familial 
division that has spawned persistent concern over the prevalence and causes of 
spousal desertion. 
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Yet surprisingly, given the centrality of desertion to the Mexican migrant 
narrative and the systematic features of the temporary migration flow that would 
support its occurrence, the nature and strength of the relationship between 
Mexican migration and spousal desertion is largely unknown. It follows that the 
aim of the present study is to undertake an empirical test of this often expressed 
anxiety in the case of the U.S.-Mexico migration flow. Our use of life history data 
that span both sides of the border allows us to quantify the association between 
migration experience and desertion, as measured by union dissolution, and to 
identify the features of the temporary migration system that may or may not 
contribute to its occurrence. It has been noted that many Mexican communities 
with long traditions of migration to the U.S. have experienced significant 
changes to their internal social structure. Because our data measure individual- 
level and community-level characteristics, we are able to test the possibility 
that community-wide migration-related factors exert an independent effect on 
individual-level union stability. We focus the analysis on union dissolution, as 
opposed to desertion, given the facility of accurately measuring the former and 
the difficulties implied in capturing the true incidence of the latter. 

These questions are important for a number of reasons. First, the divided 
household arrangement that has grown out of the U.S.-Mexico migration 
system involves a very high level of dependency. The survival of the majority of 
households in Mexico with migrant heads is pegged to the continued commitment 
of the migrant to that household while he is in the U.S. (Gupta 2002). Dependence 
on the migrant spouse is exacerbated by social norms in many smaller Mexican 
communities that discourage women from working outside of the home and the 
limited opportunities open to them even if they decide to do so (Kana'iaupuni 
2000b). The threat of desertion is made all the more acute by recent evidence 
suggesting that Mexico presents a particularly harsh environment for female- 
headed households, who have been defined as the country's "poorest of the 
poor" (Finley 2003). The precarious position of women with migrant spouses 
makes understanding the prevalence of desertion and identifying the factors that 
contribute to its incidence of clear consequence for the well-being of millions of 
Mexican women and their children. 

A multilevel analysis of the relationship between migration and union 
stability also provides valuable insights into how migration engenders change in 
the social structure and normative climate of many Mexican communities. An 
increasing amount of scholarly attention has shifted towards examining the effects 
of migration on origin communities, as opposed to an exclusive focus on the 
effects of migration on migrant-receiving destinations (Frank and Hummer 2002; 
Giorguli-Saucedo 2004; Kana'iaupuni and Donato 1999; Kandel and Kao 2000; 
Massey 1998). An examination of the relationship between the two demographic 
processes of migration and union stability will help us understand the ways in 
which migration affects the social structure of Mexican communities at various 
stages in the migration process. 
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Migration-related union dissolution is not a phenomenon that affects Mexico 
overall. In general, union dissolution rates throughout the country are low as 
compared to those found in developed countries (Goméz de León 2001). Likewise, 
although U.S.-bound migration is an impressive demographic phenomenon in 
Mexico, only a minority of the population participate in it first hand. What makes 
migration-related union dissolution important then is not the sheer number 
of people that it affects but rather the force of its impact. The sectors of the 
population susceptible to migration-related union dissolution are precisely those 
that are least able to weather the negative economic and social effects associated 
with it. In addition, the uneven pattern of international migration in Mexico, 
which differentially impacts certain communities and segments of the population, 
is likely to concentrate the effects of migration-related familial break-ups and 
compound their influence. While the phenomenon of migration-related union 
dissolution may not be widespread, the critical importance of international 
migration to Mexico and the high number of dependent women and children 


that it creates make it a social problem worthy of our attention. 


Migration and Union Dissolution: The Case of Mexico 


It has already been noted that in the case of Mexico, migration often translates 
into long periods of spousal separation. In an analysis of married couples in 52 
migrant-sending communities in Mexico, Gupta (2002) found that in any given 
survey year, around 85% of all migrants were separated from their partners. The 
probability of experiencing spousal separation was found to increase during 
the first 12 years of the union, precisely the time period when intense family 
formation activity takes place and the household is trying to establish itself 
economically. While the majority of migrant households that participate in a 
binational living arrangement accept it as a normal part of their lives together, 
migration-induced spousal separation is likely to increase family instability. 
Using the case of union dissolution, the following section discusses the principal 
pathways through which migration has the potential to alter family stability. In 
the subsequent discussion and analysis, unions are defined as either a marriage 
sanctioned by the church and/or the state or an informal union. Union dissolution 
is thus defined as the ending point of either state. In contrast to much of the 
developed world, informal/consensual unions in Mexico and in Latin America 
are best described as surrogate marriages with full social recognition (Castro 
Martin 2002). Far from representing a new cultural arrangement, as they have 
been characterized in the U.S. and much of Europe, consensual unions in Mexico 
have been around for centuries and represent a traditional analog to formal 
marriages (Goode 1993; Solis 2004). 

In the broader literature on social integration and mobility, increased mobility 
is understood to be positively related to union dissolution to the extent that it 
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implies decreased embeddedness in a framework of collective ties that stand as 
barriers to union fragmentation (Booth et al. 1991). In this sense, migration 
may lead to an increased risk of union dissolution to the degree that it decreases 
social integration and disrupts social ties. Ín an analysis of 48 in-depth interviews 
conducted with women living in three different villages in Western Mexico, 
Kana'iaupuni (2000) found that the social ties linking migrant men living in 
the U.S. to their friends and families in Mexico were compromised by their 
physical absence. As a method of compensation, male migrants were found 
to rely increasingly on their nonmigrant spouses back in Mexico to maintain 
their social networks and facilitate their social integration back into the home 
community upon their return. The degree to which migration decreases social 
cohesion in the case of male migrants and subsequently increases their risk of 
union dissolution has not yet been tested in the case of Mexican migration. There 
is existing evidence, however, in support of this possibility in the case of Puerto 
Rican women. Using data from the Puerto Rican Fertility and Family Planning 
Assessment (PRFFPA) and New York Fertility, Employment and Migration 
Survey (NYFEM), Landale and Ogena (1995) find that migration increases the 
individual chances of union disruption among women in Puerto Rico and on 
the U.S. mainland. The odds of union disruption were found to be particularly 
high for recent migrants in informal unions, leading the authors to suggest 
that a reduction in social support coupled with a reduction in barriers to legal 
disruption are two ways in which union instability is increased in the case of 
migrant women. 

By virtue of their physical absence, migrants are often free from the vigilant 
observation of kin and community that characterize small-town Mexico. Although 
migrants usually use their social networks to migrate to locations in the U.S. 
where they have friends and family, the density of ties in their U.S. communities 
is considerably less than in their home communities (Gupta 2002; Kana'iaupuni 
2000b). Interviews with migrant couples split between Atlanta, Georgia, and 
Mexico confirm this point. Hirsch (1999) observed a feeling of anonymity 
among her respondents in the U.S., free from an audience that would normally 
monitor gendered behavior. Accompanying this new freedom was also a concern 
that, once exempt from codes of normative behavior, migrants may engage in 
experimentation and transgression. Evidence in support of this possibility comes 
from a survey of the sexual behavior of 39 male Mexican migrants in North 
Carolina. The respondents frequently expressed the ambiguities inherent in their 
long-distance marital status noting that they were “married there, single here” 
(Viadro 1997). As one of the respondents notes, “Men have more opportunities 
here. In Mexico . . . they live in the same city or town, everyone knows them... 
Here, it’s easier for them, because where is their wife? Who is going to tell her? 
How is she going to find out? So they feel free, free to do whatever they want 
(Quoted in Viadro 1997:48). 
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A migrant's physical absence and frequent mobility may limit his susceptibility 
to social control, subsequently increasing the risk of transgressions and threatening 
union stability. Conversely, nonmigrants or migrants who remain embedded in 
family and community life in Mexico are likely to remain subject to a higher 
degree of social control. High population turnover is hypothesized to foster 
greater individualism and weaken social control over the actions of individuals, 
resulting in a weaker normative consensus regarding community life. Several 
studies have used aggregate data to demonstrate that population mobility is linked 
to increases in divorce (Breault and Kposowa 1987). Yet as Landale and Ogena 
(1995) have pointed out, these aggregate-level studies are unable to distinguish 
between migrants and nonmigrants, making it unclear whether population 
change is related to high divorce rates through changes in community-level 
social cohesion or whether migrants themselves are more susceptible to union 
disruption. The data used in the present analysis addresses this problem by 
including both individual- and community-level effects, allowing us to test the 
role of social cohesion in contributing to barriers to union disruption. 

Another pathway through which migration may influence the risk of union 
dissolution in the case of Mexican-U.S. migration is by altering normative values 
governing union behavior. The U.S. and Mexico are characterized by strong 
differences in the social acceptability of separation and divorce. In the U.S., 50 of 
every 100 marriages end in divorce. In Mexico, only 5.7 of every 100 marriages do 
so (Suarez-Lopez 2004). In an analysis of a nationally representative longitudinal 
sample of married persons, Booth et al. (1991) find that having a friend or sibling 
who is divorced is strongly related to the probability of experiencing a divorce 
oneself. Migration has the potential to influence this relationship by exposing 
migrants to a social climate where divorce is more prevalent and more socially 
acceptable than in the case of the origin community. By virtue of contact with 
U.S. culture, migrants may be more inclined to consider divorce a viable option 
even once they have returned back to Mexico. Migration may also engender 
change in norms regarding union formation and union expectations and/or 
ideals. In her work among Mexican couples in Atlanta, Georgia, and Mexico, 
Hirsch (2003) has found that U.S. migration experience is associated with a shift 
in marital ideals towards those that are characterized by cooperative decision 
making, heterosociality, and a less gendered division of labor. The extent to 
which a couples marital ideals change at different rates could affect the extent to 
which the couples experience marital friction and have a propensity to dissolve 
the union (Foner 1999; Grasmuck and Pessar 1991). Additionally, Hirsch (1999) 
notes that these new marital ideals are often based on a bond of desire rather than 
obligation, which may form a less solid basis for enduring marriages, resulting 
in the paradoxical effect of increasing the likelihood of union disruption. The 
degree that migrants experience change in their ideals governing divorce and 
marriage is likely to be highly dependent on the extent of their contact with 
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U.S. culture. Migrants who are more socially integrated into U.S. social life are 
more susceptible to the influence of U.S. norms and are more likely to alter 
their union behavior as compared to Mexican migrants who are not as socially 
integrated into U.S. life. 

Changes in norms regarding dissolution and/or union expectations may be 
transferred to Mexican communities via return migration. Levitt (1998) has 
called these transfers of ideas and norms that flow from receiving to sending 
country communities “social remittances.” Whereas economic remittances 
refer to the transfer of money and goods, social remittances involve transfers of 
ideas and information that have the potential to transform social life. Change 
in social norms governing the propensity to dissolve a union could operate 
individually on the migrants themselves, or communally, via return migration 
to the sending communities. In the case of fertility behavior, Lindstrom and 
Giorguli-Saucedo (2002) find that women residing in Mexican communities 
with higher rates of female migration, have a lower risk of experiencing a birth 
during a given year. The authors conclude that, through nonmigrant women’s 
interpersonal communication about preferences and contraceptive methods with 
return female migrants, U.S. attitudes and ideas about childbearing and family 
size work to lower fertility rates among women who have never gone to the U.S. 
In communities marked by longstanding and widespread migration, the ideas 
and norms brought back by return migrants may gradually set the standard for 
migrants and nonmigrants alike. In the case of union instability, a reverse flow 
of social remittances influencing family formation behavior in Mexico may 
involve changes in familial roles and in the social expectations governing union 
formation/dissolution. 

A final consideration involves the role of selection—that is, a migrant’s 
selecting to migrate—in contributing to the relationship between migration 
experience and union stability. Although it is widely recognized that migration is 
a selective process, it is with considerable less frequency that the issue of selection 
is directly addressed in modeling the behavior and outcomes of migrants. In 
large part due to data constraints, the role of selection in contributing to the 
observed process is often left unknown. In the case of union stability, selection 
may contribute to the role of migration if migrants migrate to select themselves 
out of unstable unions. A proper test of this possibility would necessitate a 
binational data set that includes detailed information on union quality. Short of 
this, however, multilevel data permit us to evaluate the possibility that selectivity 
may increase the prevalence of union dissolution among migrants by determining 
if individual-level effects are accompanied by community-level effects. The extent 
that community-level migration levels exert an independent effect on union 
dissolution indicates whether mechanisms other than selection are operating to 
influence the relationship between migration and union dissolution. 
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The data for-this analysis come from the Mexican Migration Project (MMP) data 
base MMP93, a binational data base consisting of information on both Mexican 
migrants and nonmigrants and their families. Respondents living in Mexico were 
interviewed as well as respondents living in the U.S. The inclusion of respondents 
from both countries is a necessary precondition for the present analysis, given that 
the relationship between migration and union instability can only be properly 
specified with information on nonmigrants in Mexico, U.S. migrants currently 
residing in Mexico, and U.S. migrants currently residing in the U.S. Ethnosurveys 
that collected basic social and demographic data were administered in 93 different 
communities within Mexico. Two hundred households from each community 
were selected using representative survey sampling.” The communities were 
chosen to provide a range of different sizes, ethnic compositions, and economic 
bases. For each community, the surveys were administered during the two months 
in which the highest number of seasonal migrants had returned home (usually 
December and January) in order to maximize the number of household heads 
that were the principal respondents to the survey. In the event that the household 
head was absent (e.g., on a trip to the U.S.), a proxy informant was interviewed. 
In this way, if a married male household head was currently in the U.S. during 
the time of the survey, he was still included in the sample via the information 
provided by another household member. 

These surveys were then matched by nonrandom samples of 10-20 out- 
migrant households in each corresponding U.S. community. From each Mexican 
community, it was determined where the majority of migrants went; interviewers 
were then sent to those areas to survey those who had settled abroad. The U.S. 
surveys include the crucial subsample of migrants who had established their 
households within the U.S. This sample includes household heads who had 
permanently settled in the U.S. with their families, those who had permanently 
settled in the U.S. and had left their families in Mexico and/or created new families 
in the U.S. (i.e., union dissolvers), and single household heads who did not have 
any family in the U.S. or in Mexico. To ensure that the U.S. and Mexican samples 
represent an accurate binational sample, a series of weights were developed that 
represent the inverse of the sampling fraction employed at each Mexican and 
U.S. site? Once weighted and merged, the U.S. and Mexican surveys form a 
binational sample, representative of both migrant and nonmigrant communities 
in Mexico. i 

Most important for the present analysis is the ability to distinguish between 
the relative timing of life-course events. Longitudinal data that specifies pre- and 
postmigration behavior allows for the proper modeling of the aforementioned 
processes. The retrospective life histories collected by the MMP include 
information on the beginning and end dates of up to four marital/cohabiting 
unions of the household head as well as dates on all U.S. migratory trips, 
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allowing us to use discrete time event history models to evaluate the risk of union 
dissolution by U.S. migration experience. Particularly important for the present 
analysis is the ability to distinguish the cause behind each union dissolution, that 
is, those unions that ended as a result of divorce or separation from those that 
ended as a result of widowhood. Prior to 1997, the MMP did not collect data 
that identified the cause behind union dissolution. For this reason, we restrict 
the sample to those communities surveyed after 1997, resulting in a sample of 41 
Mexican communities and their U.S. counterparts. Complete union and migration 
histories were collected only for the household head. Accordingly, this analysis is 
restricted to the union behavior of migrant and nonmigrant Mexican men. Given 
the male predominance of the Mexico-U.S. migration flow, the restriction of the 


analysis to male household heads parallels the composition of the flow itself. 


Measurement 


This article evaluates the risk of first union dissolution by U.S. migration 
experience. As mentioned earlier, union dissolution is chosen as the outcome 
variable instead of spousal desertion, which is more difficult to estimate accurately. 
Additionally, unions are defined as either a marriage sanctioned by the church 
and/or the state or an informal union.^ Currently, informal unions account for 
approximately 1796 of all unions in Mexico and 1596 of the unions in our sample 
(Goméz de León 2001). We restrict the analysis to first unions for two reasons: 
(1) factors predicting union dissolution differ for first and subsequent unions, 
and (2) a very small proportion of the sample had a second union. Only those 
unions that began after 1960 are included in the analysis, in keeping with changes 
in union patterns that resulted in an increase in divorce rates beginning around 
1960 (Bumpass 1990; Cherlin 1992; López et al. 2001). Union histories for each 
male head of household were constructed using information on the beginning 
and end date of the first union. A person-year file was created in which each 
individual contributes one year of information for each year they are in a union. 
The dichotomous dependent variable indicates whether a union ended within 
each year due to separation or divorce. Observations are additionally censored 
at the date of the interview or when unions end due to widowhood. The final 
sample is comprised of 83,405 person-years contributed by 4,582 male heads of 
household (4,401 interviewed in Mexico, 181 in the U.S.). 

The primary independent variable of interest is the migration status of the 
male head of household. A categorical time-varying variable measuring the 
migration experience of the household head is constructed. This is a three- 
category variable indicating within each year whether or not the household head 
had ever migrated to the U.S. and for those who had whether they had experienced 
high or low cumulative levels of migration by that year. By distinguishing between 
extensiveness of U.S. migration experience, we hope to gain some purchase on 
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which components of the U.S. migration experience affect union stability. For all 
male heads of household who had migrated, the average cumulative amount of 
time spent in the U.S. is roughly 63 months. Those with more than 63 months 
experience are identified as having a high cumulative amount of time in the U.S., 
while those below 63 months are identified as experiencing a lower amount of 
time in the U.S. This variable is lagged one year to ensure that any migration 
experience is measured prior to the union dissolution. In this way, the model 
measures the impact of the cumulative migration experience of the household 
head on the likelihood of union dissolution in the current year. 

We include several variables to test the possibility that different levels of social 
control and sets of norms may mediate the relationship between migration and 
union stability. While the data do not allow us to distinguish directly between 
these two hypotheses, we are able to operationalize several mechanisms that are 
likely involved in both explanations. In addition, our ability to measure these 
processes at both the individual- and community-level sheds light on the ways 
in which migration influences union stability. In order to test the possibility 
that the extent of integration in each community affects union stability, we use 
a detailed set of time-varying questions regarding purchases of homes, property, 
businesses, or land back in Mexico or in the U.S. to create a proxy for social ties to 
each community. The resulting measure is a time-varying categorical variable that 
distinguishes whether respondents owned a house, business, or land in Mexico; 
a house, business, or land in the U.S.; or none at all. While this variable clearly 
captures individual-level socioeconomic status, the statistical models additionally 
control for educational attainment, which lends validity to this measure as a proxy 
of ties to the home and host communities. 

The relationship between migration and union stability may also operate at the 
community level. This possibility is particularly relevant in the case of the Mexico- 
U.S. flow, which is characterized by an institutionalization of the international 
migration process in many of its sending communities (Kana'iaupuni and 
Donato 1999). On the one hand, it is possible that overall community migration 
levels may mitigate the effect of migration on union instability if migration has 
become a standard feature of the social structure. On the other hand, community 
migration levels may also enhance the relationship between migration and 
union dissolution if high levels of community migration are accompanied by 
changes in social norms and/or lower levels of social control that make union 
dissolution more socially acceptable and/or easier to obtain. A time varying tri- 
level community migration measure is constructed that differentiates between 
low, medium, and high levels of migration to the U.S. for each community in 
1960, 1970, 1980, 1990, and 2000. In each of these years, the bottom quartile of 
community-level migration to the U.S. is identified as low, the upper quartile 
as high, and the middle quartiles as medium. These measures are appended to 
the person-year file such that 1960 migration levels are attached to all person- 
years lived between 1960 and 1964, 1970 migration levels to person-years lived 
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between 1965 and 1974, 1980 migration levels to person-years between 1975 
and 1984, and so on. 

Additional explanatory variables included in the analysis are those that have 
been associated with the likelihood of union dissolution. First are controls for 
union duration and age at union formation. Research has documented that 
the risk of union dissolution increases slightly very early in the union and then 
generally declines as the union ages (Bramlett and Mosher 2001; CONAPO 1999a; 
White 1990). In order to model this process, we include a continuous measure 
of marriage duration, measured in years, as well as a squared term of marriage 
duration. Age at marriage is strongly associated with union dissolution, such that 
people who marry at younger ages have a much higher risk of dissolution than 
those who marry at older ages (CONAPO 1999a; Martin and Bumpass 1989; 
Morgan and Rindfuss 1985). Age at union formation is included as a continuous 
variable? In part, to make the most of the sample size, we include all unions 
that began since 1960. However, the risk of dissolution has increased over the 
last forty years. In order to account for period effects, we include a continuous 
measure of calendar year which is allowed to vary over time. 

A number of other sociodemographic characteristics are associated with the 
risk of union dissolution. Past studies have shown that the presence of one’s own 
children tends to discourage union dissolution (Morgan and. Rindfuss 1985; South 
and Lloyd 1995; White 1990). We include a time varying dichotomous variable 
that indicates the presence or absence of any children less than eighteen years of 
age in the household in each year of the union. With regard to socioeconomic 
status, U.S. studies show that increased educational attainment is generally 
associated with a decreased risk of union dissolution (Bramlett and Mosher 2001; 
Martin and Bumpass 1989; South and Lloyd 1995). However, studies in Mexico 
have shown that increased socioeconomic status, as measured by a professional 
career, is associated with an increased risk of union dissolution, an effect that 
is particularly pronounced for women (López et al. 2001). In this analysis we 
include a time varying measure of the household head's educational status, 
differentiating between whether or not the head completed primary school. Lastly, 
we include a continuous measure of community size, which serves as a proxy for 
the metropolitan status of the community. The population of each community is 
measured every decade and appended to the appropriate person-year record, in 
the same manner as the community migration variables. This variable is logged 


in the final model. 


Methods 


The analysis proceeds in two parts. First, we present a series of descriptive statistics 
that describe the sociodemographic characteristics of the sample of Mexican 
migrant and nonmigrant male heads of household. In addition, because the 
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descriptive analysis is not subject to the same time-varying restrictions as are 
the event history models, we are able to include measures of the level of social 
integration into U.S. society for all migrants according to the extensiveness 
of their U.S. migration experience. This exercise demonstrates the degree of 
differential exposure to U.S. society. We repeat this exercise at the community 
level to determine if communities characterized by varying migration prevalence 
levels also vary across other sociodemographic characteristics. 

Secondly, we estimate discrete-time multilevel event history models to 
evaluate the effect of migration on the risk of union dissolution. In this sample 
individual person-years are nested within communities. However, there are 
repeated measures of community characteristics across decades. As a result, there 
are potentially three levels of variation in the risk of union dissolution: variation 
at the individual level due to a respondent's own characteristics, variation within 
a community across decades, and variation at the community level due to the 
static conditions of the community. To the extent that unmeasured community 
characteristics (or community decade effects) exert an influence on individual 
behavior, individuals within a community (or within a community decade) 
may be more similar to one another than to individuals in another community 
(or community decade). If this is the case, then the error terms are no longer 
independent, and standard errors may be underestimated, while the significance 
of parameter estimates may be overstated. Multilevel models are designed to deal 
with this nonindependence of error terms resulting in more accurate estimates 
of model error and statistical significance. 

Multilevel models can be thought of as a system of equations, one for each 
level of analysis (Mosher et al. 2003; Raudenbush and Bryk 2002; Teachman and 
Crowder 2002). In this case there are three equations. At the individual level (level 
1), the log odds of a union dissolution is expressed as: 
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where @, is the probability of observing a union dissolution for man i in year t 
within community decade d in community c. X „are the time constant individual- 
level attributes, while Z,, are the time varying individual-level attributes. 
Community decade specific effects are included in the level-2 equation: 
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prevalence and community population) and r,,, is the level-2 error term. Finally, 


community mean probabilities of union dissolution are captured in the level-3 
equation: 


where V,,, are characteristics of each community-specific decade (migration 
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where 4/,,, is the level-3 error term. 

Preliminary analyses (not shown) demonstrate that in this case there is no 
significant community-level (level-3) variation in the risk of union dissolution. 
This suggests that communities have experienced increases in the risk of union 
dissolution over time and that the bulk of the variation in the risk of union 
dissolution above that at the individual level occurs at the intersection of decade 
and community. As a result, the models in this article incorporate only the first 
two levels of variation. For this analysis, the intercept, ,,,» is allowed to vary 
across community decade, while the other effects are fixed at the overall mean. 
The event history models are estimated using HLM 5.61d software (Raudenbush, 
Byrk, and Congdon 2004). The weights used in the analysis are normalized 
versions of the inverse of the sampling fraction used in each Mexican and U.S. 
community. 

The event history analysis involves specifying the effect of migrant status on 
union dissolution in a series of multivariate models that build upon each other. 
The first model measures the baseline relationship between migration experience 
and union instability. The second model adds controls for known correlates of 
union dissolution, including controls for union duration, age at union initiation, 
calendar year, education, and presence of children in the household. The third 
model includes a variable measuring the existence of social ties in either Mexico 
or U.S. While owning a home, land, or property in the U.S. serves as a measure 
of the extent of the respondent’s social integration into U.S. life and exposure to 
U.S. norms, productive investment in Mexico serves as a measure of the strength 
of ties to the home community. This model determines the extent to which social 
ties mediate the relationship between union dissolution and migration. The 
fourth model adds a measure of community-level migration prevalence in order 
to determine if the individual relationship between union stability and migration 
experience is accompanied and/or mediated by one at the community-level. This 
model provides a test of the possibility that levels of social control and/or social 
norms vary by community migration prevalence. It also helps to address the issue 
of selection by determining if all male residents in communities characterized 
by varying migration levels face different rates of union dissolution. The final 
model includes a control for community population in order to ensure that the 
community-level effects we document in the previous model are not due solely 
to differences in community size, which is highly correlated with community- 
level socioeconomic standing. 
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Table 1 provides a general description of the MMP sample of male household 
heads that have been in a union at some point post-1960. Around 496 of the 
sample reported that they had ever experienced a divorce or separation. This 
number is only slightly lower than estimates from national surveys (CONAPO 
1999b; Goméz de León 2001). Almost 4096 of the household heads in the sample 
have migrated to the U.S. at some point in their lives. The mean age of the 
sample of male household heads is 41 years old, and the mean age at marriage 
is roughly 24 years old. Levels of education are fairly low—a full quarter of the 
sample has not completed primary school. 

The lower half of Table 1 presents the weighted percentage distribution of 
the time varying variables during the thirtieth year of life for the respondents. 
Looking at the heads of households when they are age 30, we see that roughly 30% 
have ever been in the U.S. prior to age 30, with a fairly even split between those 
with extensive U.S. cumulative experience and those with low U.S. cumulative 
experience. At age 30, 93% have children under the age of 18 in the household. 
Roughly 48% of the sample have purchased a home, land, or property in Mexico, 
but less than 1% of the sample report owning land or property in the U.S. 

Table 2 presents select characteristics of male household heads by their 
migration status at the time of the survey. The three migration status categories 
are: never migrated, migrated with below average cumulative time spent in 
the U.S., and migrated with above average cumulative amount of time in the 
U.S. With regard to the demographic variables, migrants are slightly younger 
on average than nonmigrants, although within the migrant population those 
respondents with more U.S. cumulative experience are slightly older. U.S. migrants 
with low levels of cumulative experience are the most likely to come from small 
communities and are the least educated, while those with longer U.S. migratory 
careers are the most educated of the entire sample. The distribution of education 
level by migration status may be the result of selection, whereby the earliest 
migrants with the most cumulative migration experience are subject to more 
stringent selection criteria. With time, as migration becomes a more standard 
feature of the community structure, the attendant costs are decreased and the 
forces of selection are slackened (Massey et al. 1994). 

In order to examine differences in social integration levels, the next five 
variables are all common measures of migrant integration and exposure to U.S. 
society norms (Lindstrom and Giorguli Saucedo 2002). While these variables 
cannot be included in the multivariate models, given that these variables are 
measured at the time of the interview and thus after the event of interest, we 
include them here to assess the pattern of U.S. social integration by migration 
experience. Within the migrant sample, those migrants with higher cumulative 
U.S. migration experience are more likely to report having friends in the U.S. and 
having participated in a U.S. social organization as compared to migrants with 
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Table 1. Summary Sample Statistics Across 41 Communities 





Individual Characteristics 





(men ever married since 1960) Weighted Percent 
Variable 
Ever divorced or separated 
Yes 9.7. 
No 96.3 
Size of locality 
« 2,500 19 
2,500—14,999 25 
15,000—19,000 ; 2 
20,000—99,999 15:3 
100,0004- 51.6 
Education 
«6years — 25.1 
6+ years 74.9 
Age 
Mean 41.2 
U.S. migrant 
Yes S! 
No 62.9 
Age at union initiation j 
Mean 23.9 
Average duration of union 
Mean (years) 16.5 
(Unweighted N) 4,582 


Person-year characteristics (age 30) 
U.S. migration status 


U.S. migrant 30.0 

High U.S. experience 14.4 

Low U.S. experience 15.6 
Nonmigrant 70.0 
Children < 18 in household 92.7 
Property/land in the U.S. J 
Property/land in Mexico 47.6 
(Unweighted N) 3,259 





Source: Mexican Migration Project (MMP) 
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low levels of U.S. migration experience. Migrants with more U.S. experience also 
had higher English proficiency ratings and reported sending a higher amount of 
remittances back home on the last migratory trip. These distributions suggest 
that more extensive U.S. migration experience is associated with more extensive 
social integration into U.S. life, raising the possibility that more time in the U.S. 
leads to greater exposure to U.S. norms, information, and behaviors. 

As a first step in documenting how inhabitants living in communities 
characterized by varying levels of migration differ from one another, Table 3 
presents individual and household characteristics for ever-married male heads of 
household, differentiating the sample by 1990 community migration levels—low, 
medium, and high. Most dramatically, we see that over half of ever-married 
male household heads living in communities with high levels of migration are 
characterized by divided family living situations. The proportion of divided 
families is defined as the percent of male heads who were unaccompanied by their 
wives during their last migration to the U.S. The high levels of divided families 
among household heads in communities with both moderate and high levels of 
migration point to significant differences in the social patterning of family life 
in these communities as compared to communities with lower migration levels. 
With regard to education level, there is a monotonic increase in the proportion 
of male heads of household with less than six years of education according to 
community migration prevalence. One possibility for this pattern is that once 
migration becomes a standard feature of the social structure, as represented by 
those communities with the highest migration levels, it competes with education 
as a career choice (Giorguli-Saucedo 2004; Kandel and Kao 2000; Kandel and 
Massey 2002). As the costs of a migrant trip decrease, the benefit of staying in 
school as opposed to making a migratory trip may experience a concomitant 
decline. 

The second part of Table 3 examines differences in community characteristics 
by community migration level. Although none of the differences are significant, 
undoubtedly due to the limited sample size (n=41), the descriptive statistics do 
provide an illustrative snapshot of the differences created by varying migration 
prevalence levels. Corroborating past work, the descriptive statistics show that 
communities characterized by high levels of migration are not the poorest 
communities nor are they the most socioeconomically advantaged (Massey and 
Espinosa 1997). Origin communities with high migration levels have the highest 
proportion of residents who reported earning less than the minimum wage and 
the lowest male and female labor participation rates. At the same time, origin 
communities with high migration levels are characterized by positive infrastructure 
profiles, in spite of the fact that they are more likely to be rural and have the 
smallest average population size. These patterns support the possibility that the 
high receipt of remittances found within origin communities with high migration 
levels translates into improvements in household and community infrastructure 
and economic well-being. We also include two indicators that reflect the degree 
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of institutionalization of the migration process in each community. They include 
whether or not the community performs a mass for its international migrants on 
its patron saint day and whether or not the community's international migrants 
have helped to finance the community church. The distributions demonstrate 
monotonic increases across levels of migration prevalence, with 6096 of high 
prevalence communities reporting that they perform mass for their migrants 
and that the migrants have helped to finance their churches. 

Taken together, these distributions suggest that migrants with extensive 
U.S. migration experience and those living in communities characterized by 
medium to high levels of migration present different familial, socioeconomic, 
and acculturation profiles as compared to respondents with little or no U.S. 
migration experience. 

Table 4 presents the odds ratios from the event history analysis modeling 
the relationship between. migration and union disruption in the sample of 41 
communities. We see in the baseline model that respondents with a high level 
of U.S. migration experience are over two times as likely to experience a union 
dissolution relative to those who have never migrated to the U.S. In sharp contrast 
to this effect, men who have migrated to the U.S. at some point in their lives 
but who exhibit a below average amount of U.S. migration experience are not 
significantly more likely to experience a union dissolution than those who have 
never migrated to the U.S. 

The second model includes controls for duration of union, age at union 
initiation, calendar year, educational level, and presence of children in the 
household. The addition of these variables slightly increases the differential in 
union dissolution between migrants and nonmigrants, such that migrants with 
extensive U.S. migration experience demonstrate 2.24 times increased risk of 
union dissolution as compared to their nonmigrant counterparts. The control 
variables operate in the expected direction. The risk of union dissolution increases 
up to a point with increased union duration and then, as evidenced by the 
significance of the squared term, begins to decrease beyond that point. An older 
age at union initiation decreases the odds of experiencing a union dissolution. 
The significance of calendar year confirms that the risk of union dissolution 
has increased over time. The presence of children in the household is inversely 
related to union dissolution, so that men living in households with children 
experience a significantly decreased risk of experiencing a divorce/separation. 
Interestingly, men with more than six years of education do not have a statistically 
significant higher risk of union dissolution than men with less than 6 years of 
education. 

The third model includes controls for the extent of ties to the U.S. and/or 
Mexico. This set of variables serves as a proxy for the extent of social and 
economic integration of each respondent in the respective countries. We should 
note that, while the receipt of remittances would be a desired proxy for social 
ties back to Mexico, the MMP only measures remittances sent to Mexico by 
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Table 4. Odds of Union Dissolution for Mexican Male Heads of Household 
Model 1 Model 2 Model 3 Model 4 Model 5 








Fixed effects 
Migration to the U.S. [Never] 

Above average U.S. experience 2:152 2.24* 25751 2.21 DI 

Below average U.S. experience 1.01 .94 .94 92 is 
Duration of union 

[Continuous] DU 1:15:53 Jess Ii guis 

[Squared] 0020 HO SOR O9 
Age at union 

[Continuous] : OZ 09 E YEER 10755 
Calendar year 

[Continuous] 1:025 1:05:55 110915 Ose 
Education [<6 years] 

6+ years 1.10 1.14 1.14 111 
Children under 18 in households [No] 

Yes 685 1972] 137^ 5$ 37: 

Ties [None] 

Property in Mexico AS A8*** AS 

Property in U.S. 1.61 1.54 1.64 
Community migration prevalence [Low] 

Medium 72 1.76* 

High 1.25 1.33 
Community population size 

[Logged] 1.05} 
Intercept 
Random effects 

Variance v .83 .80 .78 .78 

P-value .00 .00 .00 .00 .00 


Source: Mexican Migration Project (MMP) 
Note: Results are presented as odds ratios. Brackets [] indicate reference group. 


tp<.l  *p<.05 *p<.0l **p«.001 


the head of household during their last trip to the U.S. This information is not 
available throughout their history of migration to the U.S., and for this reason 
we are unable to include a measure of remittances in this model. Owning a 
house, land, property, or a business in Mexico significantly reduces the odds of 
union dissolution, while having land or property holdings in the U.S. increases 
the likelihood of union dissolution, though this latter effect is not significant. 
The inclusion of this variable, however, does little to reduce the individual-level 
effect of extensive migration experience on union dissolution. 

The fourth model adds community-level migration effects. Most importantly, 
we see that individuals who come from communities with higher levels of 
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migration experience greater odds of union dissolution. Community-level 
migration prevalence exhibits a curvilinear effect such that men who live in 
communities with medium-levels of migration have 4096 increased odds of 
union dissolution, while those in high migration communities do not experience 
significantly different odds of union dissolution as compared to those in 
communities with low levels of migration. The inclusion of the community- 
level migration does not affect the risk of union dissolution among international 
migrants. Even with the community-level migration controls, the relationship 
between individual-level U.S. migration experience and union dissolution 
remains strong and statistically significant, suggesting that factors other than 
those included here are likely contributors to the relationship between union 
dissolution and migration experience. 

In the final model, we include a control for community population in order 
to address the possibility that respondents in communities with high migration 
levels may demonstrate a higher propensity to dissolve a union as a result of 
other unmeasured community-level characteristics, such as size and/or rural 
status. As expected, residents in communities with higher population levels 
exhibit increased odds of experiencing a union dissolution, yet controlling for this 
does not reduce the effects of individual- or community-level migration status. 
Net of all factors, individuals who have extensive U.S. migratory experience are 
still 2.2 times as likely to experience dissolution compared to men who never 


migrated to the U.S. 


Discussion 


This study explores the widespread assumption, initially forged out of the 
experience of early nineteenth-century immigrant groups and reinforced in more 
recent ethnographic research conducted among Mexican migrants, that migration 
is associated with elevated rates of union disruption. In the context of the U.S.- 
Mexico migration flow, we find evidence in support of this hypothesis. In spite 
of important differences in the size, scope, and nature of this contemporary 
migration as compared to those of earlier years, its effect on union instability 
corroborates earlier accounts. 

Male household heads with extensive U.S. migration experience present higher 
odds for experiencing the dissolution of a union compared to nonmigrants. 
In addition, residents of communities with medium migration levels were 
also found to experience increased risk of union dissolution as compared to 
residents of communities characterized by low levels of U.S. migration. These 
findings temper the positive view of the beneficial effects of migration on origin 
communities (Cohen 2001; Durand et al. 1996; Massey 1998; Massey and Parrado 
1994). Our results suggest that the long-standing labor migration system that has 
evolved between the U.S. and Mexico has not translated into simple unilateral 
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economic benefits for origin households and communities. Instead, they suggest 
a more complex story, whereby improvements in economic standing are often 
accompanied by divided households, familial tension, and, perhaps, union 
dissolution. The scope of these findings is likely more extensive than what has 
been documented here, given that our analysis is limited to measuring union 
dissolution as opposed to desertion. In many instances, cases of desertion may 
not be reported and therefore do not qualify as a dissolved union. To the extent 
that this is the case, our findings are likely underestimations of the true effects 
of migration on union dissolution. 

The relationship between migration and union stability was found to be 
dependent on the extensiveness of U.S. migration experience. International 
migrants with a high level of cumulative U.S. experience present over two-fold 
odds of experiencing a union dissolution as compared to their nonmigrant 
counterparts. Respondents with a low-level of cumulative U.S. migration 
experience were not significantly different from those respondents who had never 
migrated to the U.S. The discrepancy between these two groups sheds light on the 
ways in which migration impinges on union stability and suggests that a migratory 
trip on its own is not a sufficient causal factor to provoke the dissolution of a 
union. Instead, it is extensive time away from the home community spent in the 
host community that increases the risk of union dissolution. 

This article demonstrates support for both the normative and social control 
hypotheses. According to the latter, more mobile community members are less 
socially embedded in family and community life and as a result are less bound 
to social norms dictating their behavior (Booth et al. 1991). To the extent that 
absence from the home community translates into decreased observation by kin 
and community and a relaxing of codes of normative behavior, migrants with 
less time in the home community are at an increased risk of transgression and 
experimentation. Our findings illustrate this point, as more time in the U.S. is 
associated with higher odds of union dissolution. This is also consistent with what 
Landale and Ogena (1995) have suggested in the case of Puerto Rican migrants, 
namely that migration has the potential to increase the risk of union dissolution 
if it occurs in the context of weak social and economic ties. 

Conversely, male migrants with low levels of U.S. migratory experience did 
not exhibit increased odds of union dissolution. Furthermore, those respondents 
with economic investments in Mexico experienced significantly decreased odds 
of union dissolution. To the extent that productive investment in Mexico reflects 
social and economic ties to origin community households, this finding suggests 
that such ties of commitment lower the risk of union disruption by increasing the 
barriers that impede its dissolution. The elevated risk of union dissolution among 
residents of communities with medium levels of migration also lends support to 
the possibility, mainly observed in U.S. studies on internal migration, that high 
population turnover leads to less effective social control over individuals and 
weakens normative consensus regarding social life (Landale and Ogena 1995). The 
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risk of union dissolution among residents of origin communities with high levels 
of migration is lower than the risk found among residents of origin communities 
with medium levels of migration, suggesting that the relationship between 
migration and union dissolution is somewhat dampened in communities in which 
migration has become a standard feature of the social structure. To the extent 
that the patterns documented in the descriptive statistics reflect the degree of 
institutionalization of the migration process (i.e., high migration communities had 
the highest percent performing mass for migrants and receiving financial aid from 
migrants), they provide one explanation for the curvilinear pattern in the odds 
of union dissolution by community migration level. It follows that the:disruptive 
aspects of migration may become less pronounced in communities in which the 
social processes associated with migration have become institutionalized. An 
additional consideration not addressed here is the prevalence of female migration 
in a community. Our analysis was primarily concerned with overall community 
migration rates. Past research has demonstrated that higher male migration 
levels are often accompanied by higher female migration levels (Kana‘iaupuni 
2000a). To the extent that migration becomes an increasingly available option 
for the female partners of male migrants, the ability of male migrants to escape 
the social control of their kin and community may be decreased. This possibility 
does not help explain the curvilinear pattern of community-level migration, as 
female migration rates are likely to increase in tandem with male migration rates. 
However, it does leave open the possibility that there may be a tipping point, 
whereby female migration becomes a prevalent enough option in communities 
characterized by high migration levels to lower the risk of union dissolution. 
Extensive U.S. migration experience may not only indicate a lack of social 
integration in the home community, but also increased social integration in the 
host community. The normative hypothesis predicts that exposure to social norms 
sanctioning separation and divorce will increase the probability of experiencing 
a divorce oneself (Booth et al. 1991). As such, Mexican migrants. who are more 
socially integrated into U.S. society, where divorce is a socially acceptable option, 
are more susceptible to experiencing a divorce themselves. Our findings support 
this hypothesis by demonstrating that migrants with extensive U.S. experience and 
migrants who own U.S. land or property experience an increased risk of union 
dissolution. In addition, the descriptive statistics demonstrate that migrants with 
more extensive migratory careers are also more acculturated into U.S. society, as 
measured by friendship groups, social activities, and English language ability. 
At the community level, the analysis provides some support for the normative 
hypothesis. The continued relevance of community migration levels, in the 
presence of controls for individual-level migration experience, supports the 
possibility that the Mexico-U.S. migration flow of people is accompanied by 
a reverse flow of norms, perceptions, information, and social capital (Kandel 
and Massey 2002). These social remittances that flow from receiving to sending 
communities have the potential to restructure community life, including changes 
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in family formation behavior. However, this interpretation is somewhat tempered 
by the finding that inhabitants of communities with high migration levels do 
not experience risks of union dissolution any higher than their counterparts in 
communities with low levels of migration. This suggests that there may be limits 
to the extent of the diffusion of ideas, behaviors, and norms, particularly once 
migration becomes a standard feature of the community's social structure. 

The strong community-level effects also speak to the relevance of the 
selection hypothesis in contributing to the relationship between migration and 
union dissolution. The finding that, in addition to the migrants themselves, all 
male residents of communities with medium levels of migration experience an 
increased risk of union dissolution, net of migration status, suggests that the 
effect of migration on divorce/separation is not due to a selection effect alone. 
Instead, the findings presented here support the possibility that migration 
engenders change in social control and normative values at both the individual 
and community level that work to increase the risk of union dissolution. That said, 
however, the strong individual-level migration effects that persist in the full model, 
even with controls for social ties and community migration prevalence, suggest 
that some degree of selection may be operating at the level of the individual 
migrant. While past research has demonstrated that the vast majority of Mexican 
migrants who migrate do so to work and support their families who remain in 
Mexico (Massey and Espinosa 1997), the possibility of a selection effect cannot 
be eliminated with the data here. 

An additional explanation follows that the relationship between extensive U.S. 
migration experience and union dissolution may, in fact, be a reflection of our 
binational sample. Migrants with high rates of cumulative U.S. experience may 
be more likely to reside in the U.S., so that the strong effect of cumulative U.S. 
experience is actually capturing the higher divorce/separation rates characterizing 
Mexican families currently residing in the U.S. as compared to those families 
residing in Mexico. However, there are several reasons that this is unlikely to 
be the case. First, while we were unable to ascertain the physical location of the 
respondent's spouse in each respective person-year, our migrant sample is largely 
made up of divided family households, making it less likely that the spouse was 
residing in the U.S. with the migrant in the year prior to the union dissolution. 
Additionally, we were able to determine whether the migrants’ children were 
residing with him in the U.S. in each person-year as a proxy for measuring if 
the entire family was residing in the U.S. We found that only 896 of those who 
had ever migrated had one or more of their children also living with them in 
the U.S. These distributions suggest that the effect of U.S. residence on union 
dissolution is largely driven by male household heads who experience high levels 
of U.S. migration while their families remain back in Mexico. 
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Conclusion 


Taken together, this research contributes to. past work that has stressed the 
importance of recognizing the role of migration in engendering change in origin 
communities. A key part of broadening migration research to include origin 
communities involves extending the scope of research to include nonmigrants. 
This is particularly important in the case of Mexican migration, given the 
extremely gendered nature of the flow. While great strides have been made in 
incorporating a gender perspective in the study of international migration, all 
too often this approach has been limited to including the experience of female 
migrants along side their male counterparts (Pedraza 1991). The large number 
of nonmigrant women living in divided households, coupled with evidence of 
community-level migration effects, points to the importance of broadening 
this perspective to include nonmigrant as well as migrant women in future 
research efforts. Our own analysis is limited in this respect. The MMP only 
includes complete union and migratory trajectories for household heads, which 
precludes the majority of women. As a result, our analysis and conclusions are 
skewed towards the perspective of the male migrant and involve several inevitable 
assumptions regarding the nonmigrant female spouse. One such assumption is 
that the male migrant, and not his nonmigrant spouse, makes the decision to 
divorce/separate. Support for this assumption comes from existing qualitative 
evidence that documents women’s struggles to ensure that their partners return, 
as well as from suggestions of women’s relative powerlessness in the migrant labor 
system (Kana‘iaupuni 2000b). Nevertheless, the assumption that the decision to 
dissolve a union is solely the result of the male partner is an over-simplification 
that fails to recognize the nonmigrant woman as a decision maker and contributor 
to union. Part and parcel of future efforts to insert the female voice into migration 
research will involve collecting data on the millions of nonmigrant women who 
may not migrate themselves but who are profoundly affected by the migration 
process nonetheless. 

The lack of attention given to nonmigrant women in research on Mexican 
migration is particularly salient in the context of the findings presented here. 
While our unit of analysis is restricted to male household heads, the findings have 
very clear implications for their female counterparts. By directly demonstrating 
that men with extensive migration experience exhibit an increased risk for union 
dissolution, we have implicitly demonstrated that women with migrant spouses 
who migrate frequently to the U.S. are at an increased risk of having their unions 
dissolved. This finding is the first to give an empirical basis to the widespread 
fears voiced by many of the nonmigrant spouses of Mexican migrants. It also 
raises the more troublesome possibility that the Mexican migration process may 
be creating a new generation of grass widows—women, who like their forbearers 
over 100 years ago, remain tied for their survival and livelihood to unions that 


in reality no longer exist. 
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Notes 


1. This is not to say that Mexican migration has not led to a considerable amount of 
settlement, as evidenced by the generational depth of the Mexican-origin population in the 
U.S. But the dominant pattern of Mexican migration has remained a temporary system of 
migration (Durand et al. 2001). 


2. A smaller number of households were interviewed if the community was under 500 
residents. 


3. For further discussion on the creation of the weights, see Massey and Parrado (1994). 


4. We feel confident that in Mexico the nature of formal and informal unions are sufficiently 
similar that they can be collapsed into a single category. In supplementary analyses, we looked 
at the effect of migrants’ being in an informal union as well as at interactions between informal 
status and each of the explanatory variables in the model. Informal unions have a higher risk 
of dissolution compared to formal unions. This is not surprising considering that there are 
fewer legal barriers preventing the dissolution of informal unions. Importantly, while there 
were small differences in the effect size of some explanatory variables by formal union status 
(e.g., the effect of having children on the risk of union dissolution was slightly weaker for 
those in informal unions as compared to those in formal unions), the substantive findings 
did not change. Out of concern for the stability of the estimates, we choose to include both 
formal and informal unions in the final analysis. 


5. Several categorizations of age at union formation were explored, and a continuous measure 
provided the best model fit. 


6. The model specification implies an independence in the risk of union dissolution within 
communities across decades, which is unlikely to be the case. For example, we might expect 
the rate of union dissolution in community x in 1970 to be correlated with the rate of union 
dissolution in community x in 1960. While ideally we would account for autocorrelation in 
the community decade random effects, constraints in our data as well as in existing software 
packages limit the degree of model complexity. 


7. Event history models calculate rates based on the assumption that recent cohorts will 
experience the same rate of dissolution at long union durations as did older cohorts, even 
though these data are not yet observed for recent cohorts. This has the potential to be a 
problematic assumption if the relationship between union duration and union dissolution 
is not proportional across cohorts (or over periods of time). Given this concern, we explored 
how problematic this assumption was in our data by determining whether the effect of 
union duration on the risk of union dissolution varied over time. There were no significant 
interactions, suggesting that the assumption of proportionality is unlikely to bias our estimates 
of the risk of union dissolution. 


References 
Baker, M. 1995. “The Voice of the Deserted Jewish Woman, 1867—1870.” Jewish Social Studies 
2(1):98-123. 


Booth, Alan, John N. Edwards, and David R. Johnson. 1991. “Social Integration and Divorce.” 
Social Forces 70(1):207—24. 


The Grass Widows of Mexico / 945 


Bramlett, M.D., and W.D. Mosher. 2001. “First Marriage Dissolution, Divorce, and Remarriage: 
United States.” Center for Disease Control: National Center for Health Statistics. 


Breault, K.D., and Augustine J. Kposowa. 1987. “Explaining Divorce in the United States: A 
Study of 3,111 Counties, 1980.” Journal of Marriage and the Family 36(1):549-58. 


Bumpass, Larry L. 1990. "What's Happening to the Family? Interactions between Demographic 
and Institutional Change.” Demography 27(4):483-98. 


Buznego, Maria Eugenia D’Aubeterre. 2001. “;Todos estamos bien? Género y parentesco en 
familias de transmigrantes poblanos.” Paper presented at the Latin American Studies 
Association Annual Meeting, September 6-8, Washington, D.C. 


Castro Martin, Teresa. 2002. “Consensual Unions in Latin America: Persistence of a Dual 
Nuptiality System.” Journal of Comparative Family Studies 33(1):35—55. 


Cherlin, Andrew J. 1992. Marriage, Divorce, Remarriage. Harvard University Press. 


Cohen, Jeffrey H. 2001. “Transnational Migration in Rural Oaxaca, Mexico: Dependency, 
Development, and the Household.” American Anthropologist 103(4):954—67. 


Consejo Nacional de Población (CONAPO). 1999a. “La nupcialidad en México: Patrones 
de continuidad y cambio en el áltimo cuarto de siglo? Pp. 191—202 in La situación 
demográfica de México, 1999, edited by CONAPO. 


. 1999b. "La transición demográfica y el curso de vida de las mujeres mexicanas: Los 
desafíos presentes y futuros." Pp. 213-31 in La situación demográfica de México, 1999, 
: edited by CONAPO. 


Davis, M. 1992. Mexican Voices American Dreams: An Oral History of Mexican Migration to 
the United States. Henry Holt and Company. 





Diner, Hasia R. 1983. Erin's Daughters in America : Irish Immigrant Women in the Nineteenth 
Century. Johns Hopkins University Press. 


Durand, Jorge, William Kandel, Emilio A. Parrado, and Douglas S. Massey. 1996. "International 
Migration and Development in Mexican Communities.” Demography 33(2):249—64. 


Durand, Jorge, Douglas S. Massey, and Rene M. Zenteno. 2001. “Mexican Immigration 
to the United States: Continuities and Change.” Latin American Research Review 
36(1):107-27. 

Finley, Ashley P. 2003. Are They the Poorest of the Poor?: Examining Capital Accumulation 
among Female-Headed Households in Mexico. Ph.D. dissertation. Department of 
Sociology, University of Iowa. 


Foner, Nancy. 1999. “Immigrant Women and Work in New York City, Then and Now.” Journal 
of American Ethnic History 18(3):95-113. 
Frank, Reanne, and Robert A. Hummer. 2002. “The Other Side of the Paradox: The Risk 


of Low Birth Weight among Infants of Migrant and Nonmigrant Households within 
Mexico.” International Migration Review 36(3):746—65. 


Fridkis, A.L. 1981. *Desertion in the American Jewish Immigrant Family: The Work of the 
National Desertion Bureau in Cooperation with the Industrial Removal Office? American 
Jewish History 71(2):285-99. 

Friedman, R.S. 1982. “Send Me My Husband Who Is In New York City’: Husband Desertion 
in the American Jewish Immigrant Community 1900-1926.” Jewish Social Studies 
44(1):1-18. 


946 / Social Forces 83:3, March 2005 


Giorguli-Saucedo, Silvia. 2004. “To Study or Not to Study: The Influence of Family Migration 
on School Enrollment among Mexican Adolescents.” Paper presented at the Population 
of American Annual Meetings, Boston. 


Goméz de León, J.C. 2001. “Los cambios en la nupcialidad y la formación de familias: Algunos 
factores explicativos." Pp. 207-42 in La población de México: Tendencias y perspectivas 
sociodemográficas hacia el siglo XXI, edited by José Goméz de León Cruces, and Cecilia 
Rabell Romero. CONAPO. 


Goode, William Josiah. 1993. World Changes in Divorce Patterns. Yale University Press. 


Grasmuck, Sherri, and Patricia R. Pessar. 1991. Between Two Islands : Dominican International 
Migration. University of California Press. 


Gupta, Payal. 2002. Marriage at a Distance: Spouse Separation and the Migrant Family. Ph.D. 
dissertation. Department of Sociology and Demography, University of Pennsylvania. 


Hirsch, Jennifer S. 1999. ^En el Norte la Mujer Manda: Gender, Generation, and Geography in 
a Mexican Transnational Community.” American Behavioral Scientist 42(9):1332—49. 





. 2003. A Courtship after Marriage: Sexuality and Love in Mexican Transnational 
Families. University of California Press. 


Kana‘iaupuni, Shawn Malia. 2000a. “Reframing the Migration Question: An Analysis of Men, 
Women, and Gender in Mexico.” Social Forces 78(4):1311—47. 





. 2000b. “Sustaining Families and Communities: Nonmigrant Women and Mexico- 
U.S. Migration Processes.” University of Wisconsin. Center for Demography and Ecology 
Working Paper. 


Kana‘iaupuni, Shawn Malia, and Katharine M. Donato. 1999. “Migradollars and Mortality: 
The Effects of Migration on Infant Survival in Mexico.” Demography 36(3):339—53. 


Kandel, William, and Grace Kao. 2000. “Shifting Orientations: How U.S. Labor Migration 


Affects Children’s Aspirations in Mexican Migrant Communities.” Social Science Quarterly 
81(1):16—32. 


Kandel, William, and Douglas S. Massey. 2002. “The Culture of Mexican Migration: A 
Theoretical and Empirical Analysis." Social Forces 80(3):981—1004. 


Landale, Nancy S., and Nimfa Ogena. 1995. "Migration and Union Dissolution among Puerto 
Rican Women" International Migration Review 29(3):671—92. 


Levitt, Peggy. 1998. "Social Remittances: Migration Driven, Local-Level Forms of Cultural 
Diffusion?" International Migration Review 32(4):926—48. 


Lindstrom, David P., and Silvia Giorguli Saucedo. 2002. “The Short- and Long-Term Effects 
of U.S. Migration Experience on Mexican Women’s Fertility.” Social Forces 80(4):1348— 
68. 


López, María de la Paz, Vania Salles, and Rodolfo Tuiran. 2001. “Familias y hogares: 
Pervivencias y transformaciones en-un horizonte de largo plazo.” Pp. 635-691 in La 
población de México: Tendencias y perspectivas sociodemográficas hacia el siglo XXI, edited 
by José Goméz de León Cruces, and Cecilia Rabell Romero. CONAPO. 


Marroni, Maria da Gloria. 2000. “‘El siempre me ha dejado con los chiquitos y se ha llevado 
a los grandes . . " Ajustes y desbarajustes familiares de la migracion" Pp. 87-118 in 
Migración y relaciones de género en México, edited by Dalia Barrera-Bassols, and Cristina 
Oehmichen-Bazan. UNAM. 


Martin, T.C., and L.L. Bumpass. 1989. “Recent Trends in Marital Disruption.” Demography 
26(1):37—51. 


The Grass Widows of Mexico / 947 


Massey, Douglas S. 1998. "International Migration and Business Formation in Mexico.” Social 
Science Quarterly 79(1)1—20. 


Massey, Douglas S., and Kristin E. Espinosa. 1997. “What’s Driving Mexico-U.S. Migration? A 
Theoretical, Empirical, and Policy Analysis.” American Journal of Sociology 102(4):939- 
997 


Massey, Douglas S., Luin Goldring, and Jorge Durand. 1994. “Continuities in Transnational 
Migration: An Analysis of Nineteen Mexican Communities.” American Journal of 
Sociology 99(6):1492-1533. 


Massey, Douglas S., and Emilio A. Parrado. 1994. “Migradollars: The Remittances and 
Savings of Mexican Migrants to the United States.” Population Research and Policy 
Review 13:3—30. 


MMP93, Mexican Migration Project. 2004. http://mmp.opr.princeton.edu/. 


Morgan, S. Philip, and Ronald R. Rindfuss. 1985. “Marital Disruption: Structural and 
Temporal Dimensions.” American Journal of Sociology 33(4):1055-77. 


Mosher, W.D., L.P. Deang, and M.D. Bramlett. 2003. “Community Environment and Women’s 
Health Outcomes: Contextual Data.” Vital and Health Statistics, Series 23, No. 23. National 
Center For Health Statistics. 


Pedraza, Silvia. 1991. “Women and Migration: The Social Consequences of Gender.” Annual 
Review of Sociology 39(5):303—25. 


Raudenbush, Stephen W., and Anthony S. Bryk. 2002. Hierarchical Linear Models: Applications 
and Data Analysis Methods, 2nd ed. Sage Publications. 


Raudenbush, Stephen W., Anthony S. Bryk, and Richard Congdon. 2004. Hierarchical Linear 
and Nonlinear Modeling Software Version 5.61d. 


Salgado de Snyder, V. Nelly. 1993. “Family Life across the Border: Mexican Wives Left Behind.” 
Hispanic Journal of Behavioral Sciences 42(2):391—401. 


Solís, Patricio. 2004. “Cambios recientes en la formación de uniones consensuales en México" 
Pp. 351-70 in El Amanecer del siglo y la población mexicana, édited by F. Lozano. 
UNAM. 


South, Scott J., and Kim M. Lloyd. 1995. “Spousal Alternatives and Marital Dissolution.” 
American Sociological Review 60(1):21—35. 


Stephen, Lynn. 1991. Zapotec Women. University of Texas Press. 


Suarez-Lopez, Leticia. 2004. "Revisión demográfica del divorcio en México?" Pp. 371-87 in 
El Amanecer del Siglo y la Población Mexicana, edited by F. Lozano. UNAM. 


Teachman, Jay, and Kyle Crowder. 2002. “Multilevel Models in Family Research: Some 
Conceptual and Methodological Issues.” Journal of Marriage and Family 64(2):280—94. 


Trigueros, Paz. 1992. “Unidades domésticas y papel de la mujer en un poblado rural en el 
que se práctica la migración a Estados Unidos." Pp. 78-104 in La Investigación sobre la 
Mujer: Informes en sus primeras versiones, edited by Vania Salles and Elsie McPhail. El 
Colegio de Mexico. 


Viadro, Claire I. 1997. *Sexually Transmitted Diseases and Married Hispanic Immigrants 
in North Carolina: An Exploration of Social Context, Within-Group Differences, and 
Methodology.” Ph.D. dissertation. Department of Health Behavior and Health Education, 
University of North Carolina. 


White, Lynn K. 1990. “Determinants of Divorce: A Review of Research in the Eighties.” Journal 
of Marriage and the Family 45(1):904—12. 


í 
1 i 


E 





| P E ae ll m 
"Wei pe E x 
u Yeo Qati Dya Da al on m Ser 
Biel ies ees Kee da ee » ms "n mun E t ya’ 
rinay il i gs er DA Metak T 
m 


tanneries f eee » TVs Vaid “put anqa 0 gk: ae 






























Ke Apure ^ p Ciro ELM PA s 
[T yum lettre c ‘hi 
da Mo ee tid D die ws — axo AR ey s 

ett iot qii. la y "m Du b Uo 

"et DUET Mud Vd M 

en o. p. Syd vou "^ SPA P TN a f ny > Wisi 

ir» * Mte Coh is 1 ee Da oo 
AF023481 Pt Wn A A RT suerte eum 

uM Ver ars M PO Wee 

é iw 
“Ra Neate censo ab (UA apy ( etinm Windey Wor 


A TA GaN, uh «s Wi UA a uart A AAS ck WORT i 
W a e^ = ka NM Y oio oap M i 
‘ hi Pea alor ioa cid Ioco yet mit n a hw inr Us nies t$ 
sp cO VL Y à " I è phu hyattiana 9, onn 
iy. pege "d Nen “41 i P. d i } "wea MEZ IL wis b; Am 1 
"i a pa "m fe Bw ones bree O tal nu, ant OY 2 I a 
W AT "n e TN Ws owe end div t Wa a 
nodi M (Um 4 hi a Gb M l^ 4 ^ HN " V. bere" y xA t La iwi 
£o amp WO gp vod b. AI WP Ns piles S bor Cue) 


m aji af I» ud 
ak h <a ZA) kL aly! "4 ( A AM AE M, T ES D 


aie stk Rl wins. ia a3 6G AIOE OR 
i a L Dira pN a. yd ms zi Tos eun ML IMS DNUS med 


e 


T. pP n Yu 7$ 
ë; 1 i eui p) "i roe ban rin) C Lobo " i e w LT ° b 
T js XM O ote Meli x ale WV 


Eg í ool aude: dbi 
hus hri Hs Wim wa js i ybarat 
i Ë e) W. ^t s hy n2 EE ane n "T 
LL fi» Mr TO WEM Adnmwud m xt v 


- Ete 


Ethnic Classification in Southeastern 
Puerto Rico: The Cultural Model of *Color"* 


CLARENCE C. GRAVLEE, Florida State University 





Abstract 


This article presents a systematic ethnographic study of emic ethnic classification 
in Puerto Rico, including a replication and extension of Marvin Harris's (1970) 
seminal study in Brazil. I address three questions: (1) what are the core emic 
categories of color? (2) what dimensions of semantic structure organize this cultural 
domain? and (3) is the assumption of a shared cultural model justified? Data are 
from two sets of ethnographic interviews in southeastern Puerto Rico, including 
23 free listing interviews and 42 structured interviews using Harris's standardized 
facial portraits. Results indicate a small core of salient emic categories with well- 
defined semantic structure and high interinformant agreement, reflecting shared 
cultural understandings of color. I discuss how systematic ethnographic methods 
can contribute to comparative research on ethnic classification. 


The prevailing view of ethnicity in Puerto Rico emphasizes ambiguity as a 
defining feature of emic ethnic classification. According to this view, the primacy 
of phenotype over descent leads to the proliferation of categories with uncertain 
boundaries and fluid meaning. In contrast, in the mainland United States the 
rule of hypodescent sustains a simple classification scheme founded on a well- 
defined, binary opposition between black and white. Similar contrasts are drawn 
between other Latin American societies and the United States. Although more 
and more researchers question such contrasts (Rodríguez 2000; Skidmore 1993; 
Winant 1994), one basic question remains neglected: To what extent is there a 
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shared cultural model of emic ethnic classification in the United States, in Puerto 
Rico, or in other societies? 

This article takes up that question with an exploratory ethnographic study 
of ethnic classification in southeastern Puerto Rico. Its unique contribution is to 
apply methods recently developed by cognitive anthropologists to elicit cultural 
knowledge and to measure the extent to which such knowledge is shared. In that 
regard, it revives a program of research using systematic ethnographic methods 
that Marvin Harris and his students launched 40 years ago in Brazil (Harris 1970; 
Harris and Kottak 1963; Kottak 1967; Sanjek 1971). 

The most well-known result from Harris’s studies is his seminal article, 
“Referential Ambiguity in the Calculus of Brazilian Racial Identity” (Harris 1970). 
In that article, Harris argued that the Brazilian system of ethnic classification is 
characterized by the “maximization of noise and ambiguity,” a view to which 
many still adhere. Yet just a year after Harris's article appeared, Sanjek (1971) 
argued that Brazilian racial classification was more coherent and consensual 
than his mentor had suggested. Some researchers agree with Sanjek (e.g., Telles 
2002:435; Wade 1993:4; Whitten 1985:42), while others accept Harris’s position 
(e.g., Bailey 2002:428; Loveman 1999:893; Yelvington 2001:243). Despite these 
differing conclusions, little systematic ethnographic research has been done on 
emic ethnic classification in Brazil or in other parts of Latin America since the 
early 1970s. 

This gap is unfortunate because it coincides with the development of methods 
for answering the questions Harris, Sanjek, and others were asking (D’Andrade 
1995). Indeed, Harris and Sanjek saw this development coming. Harris (1970:2) 
cautioned that “clarification of the nature of the ambiguity in the Brazilian ‘racial’ 
calculus awaits the development of cross-culturally valid methods of cognitive 
analysis.” Sanjek (1971:1127) saw “the domain of Brazilian racial vocabulary” as 
“an arena for the testing of quantitative procedures in cognitive anthropology.” In 
particular, he challenged cognitive anthropologists to investigate the distribution 
of shared knowledge and to test their assumptions about the existence and 
location of cultural boundaries. 

Three decades later, we have well-defined procedures for answering this 
challenge (Handwerker 2002; Ross 2004). Methodological developments in four 
areas are relevant. First, ethnoscientists in the 1960s commonly relied on “a single 
informant to verify the psychological reality of a componential analysis” and 
seldom asked “how many informants told the ethnographer something and how 
thoroughly responses were cross-checked among different informants” (Sanjek 
1971:1127). Today, there are established procedures for selecting ethnographic 
informants to sample a broad range of life experiences and cultural knowledge 
(Handwerker and Wozniak 1997; Johnson 1990). Second, many pioneering 
studies in cognitive anthropology lacked “ethnographic discovery procedures” 
for eliciting the terms in a cultural domain (Sanjek 1971:1127). We now have 
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systematic methods for defining the content and boundaries of a domain 
without imposing the ethnographer's conceptual framework (Ross 2004; Weller 
and Romney 1988). Third, at the time of Harris's studies, methods for detecting 
semantic structure were in their infancy (e.g., Metzger and Williams 1966). 
There have been important recent advances in such methods (e.g., Romney et 
al. 2000). Finally, a fundamental problem, until recently, was the inability to 
estimate precisely the degree of intracultural variation or to test the assumption 
that a group of informants share a single culture. Significant progress has been 
made, resulting in formal methods to measure the amount and distribution of 
cultural knowledge (Handwerker 2002; Romney et al. 1986). 

These developments make it a good time to revive and expand Harris's 
program of research. There is likely to be interest in applying these methods in 
Brazil for direct comparison with Harris's conclusions, but the comparative study 
of emic ethnic classification will benefit from systematic ethnographic research in 
other societies too. In this article, I report findings from an ethnographic study 
during 2000-2001 in a coastal town of southeastern Puerto Rico. In addition to 
participant observation and semistructured interviews, I replicated and extended 
Harris’s (1970) method, using the original standardized facial portraits that 
both he and Sanjek used. The results provide evidence of a coherent and highly 
structured cultural model of color (ko-lór) that appears to be shared across 
divisions of age, sex, class, and color in this part of Puerto Rico. This preliminary 
finding warrants further research on emic ethnic classification in Puerto Rico, and 
it illustrates how systematic ethnographic methods complement more standard 
approaches to the study of ethnicity in Latin America and elsewhere. 


Background 


In the mid-1900s, Puerto Rico came to be seen by North American and Puerto 
Rican scholars alike as a so-called racial democracy (Arana Soto 1976; Blanco 
1948; Petrullo 1947; Rogler 1940; for an exception, see Gordon 1949). More 
recently, this benign view has been discredited (Munoz Vazquez and Alegria 
Ortega 1999; Rivera Ortiz 2001), and there is renewed interest in understanding 
the causes and consequences of racism as it exists in Puerto Rico (Davila 1997; 
Duany 2002; Godreau 1999, 2000; Torres 1995). 

The notion of racial democracy is a relative one, and the reference in classic 
North American scholarship is the pre-civil rights era United States. Consequently, 
a key issue in both the construction and critique of Puerto Rico as a racial 
democracy is the contrast between the cultural model of color in Puerto Rico 
and that of race in the United States. Typically, this contrast emphasizes three 
themes. 

First, whereas the model of racial classification in the United States is regarded 
as a relatively simple system with few emic categories, the Puerto Rican model 
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of color is marked by the proliferation of terms along a continuum from blanco 
(white) to negro (black). For example, Duany (2002:238) lists 19 “major folk racial 
terms,” and Godreau (2000) mentions at least a dozen, including indio, moreno, 
mulato, prieto, jabao, and the most common term, friguefio (literally, ^wheat- 
colored”). This abundance of terms is a persistent theme in the ethnographic 
record (Hoetink 1967; Mintz 1956; Rogler 1944). However, to my knowledge, no 
study has ever systematically elicited emic ethnic categories in Puerto Rico. 

Second, previous work emphasizes the distinct organizing principles of ethnic 
classification in Puerto Rico and the mainland U.S. Traditionally, the rule of 
hypodescent ensures that anyone with a perceptible trace of African ancestry is 
defined as black in the U.S. By contrast, the ascription of color in Puerto Rico is 
primarily a matter of physical appearance—especially as defined by skin color, 
hair texture, and facial features—such that not even siblings need be assigned 
to the same emic category (Duany 2002; Hoetink 1967; Seda Bonilla 1991). 
Many researchers agree that skin color and hair texture are especially important 
criteria of color and that the plethora of terms can be grouped into three basic 
categories: white, brown, and black. Yet there is relatively little systematic evidence 
to support these assumptions. Seda Bonilla's (1991) classic work, first published 
in 1963, remains the only systematic study of how the semantic structure of color 
in Puerto Rico is organized. 

Third, ethnographers have long been fascinated with the "elasticity and 
ambiguity of Puerto Rican racial terms" (Duany 2002:241). For example, Rogler 
notes the *double meanings and ambiguities" of such terms (1944:448) and 
suggests that “the good investigator, who is seeking to understand race distance 
in Puerto Rico, would not have his contribution seriously impaired were he to 
ignore semantics entirely" (1944:453). Others reinforced this view by noting the 
dependence of color on class. Mintz (1956:411) remarks that "an individual's 
‘color’ may ‘vary’ in accord with changes in his socioeconomic status.” However, 
some scholars point out that, despite ambiguity in the referential meaning of color 
categories, there is “attributed to each a corresponding social status” (Gordon 
1949:298). “Nor indeed,” Lewis (1963:228-29) contends, “does the use, however 
charming, of characteristic euphemisms to refer to racial admixture—pardo, 
moreno, triguefio—disguise the fact that social acceptance goes hand in hand 
with the degree of whiteness in skin texture.” 

These themes form three empirical questions this study addresses. First, 
what are the core emic categories that.constitute the cultural model of color in 
southeastern Puerto Rico? Second, what are the dimensions of semantic structure 
that organize this model? Third, how shared is the cultural model of color across 
divisions of age, sex, class, and color in this region of Puerto Rico? 
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RESEARCH SETTING 


I addressed these questions during fieldwork in the southeastern coastal municipio 
(municipality) of Guayama, home to 44,301 people according to the 2000 Census 
- (United States Bureau of the Census 2001). Guayama's history and economic 
development are closely linked to its fertile soils, which made it one of the 
most important centers of the Puerto Rican sugar economy for more than 150 
years (Scarano 1984). Although the flow of African slaves to Puerto Rico never 
developed on a large scale by Caribbean standards, sugar-producing areas of the 
island had concentrated slave populations (Díaz Soler 1965). In Guayama, one of 
the three highest sugar-producing municipios in Puerto Rico, the slave population 
grew 62396 from 1812 to 1828, as sugar production boomed. By that time, slaves 
formed nearly 3096 of the local population (Scarano 1984:78). 

One legacy of sugar is that the proportion of people who claim African 
ancestry is concentrated in coastal towns like Guayama. In 2000, for the first time 
in 50 years, the census asked Puerto Ricans on the island to identify their “race.” 
The results are difficult to interpret, given the emic inappropriateness of U.S. 
racial categories and strategies of blanqueamiento, or whitening, that lead many 
to downplay their African ancestry (Duany 2002). Nevertheless, the percentage 
of people who reported their race as "Black or African American"—alone or in 
combination with some other race—was greatest in the coastal municipios. In 
Guayama, 13.2% self-identified as black (alone or in combination), as compared 
to the island-wide rate of 9.2%. This pattern has drawn other ethnographers 
to the southern coast of Puerto Rico (Godreau 1999, 2000; Mintz 1956, 1974; 
Torres 1995). 


SELECTION OF INFORMANTS 


The data reported here are based on ethnographic interviews with two 
independent samples of informants. The first sample (n=23) participated in 
free-listing interviews, while the second (n=42) completed two structured tasks 
with Harris’s (1970) standardized facial portraits. The sampling strategy is based 
on the insight that the socially constructed nature of cultural phenomena violates 
the assumption of case independence in classical statistical theory (Handwerker 
and Wozniak 1997). Because people acquire and transmit cultural meaning 
through social interaction, efficient ethnographic sampling designs should select 
informants who represent a range of variability in life experiences and social 
contexts. Handwerker and Wozniak (1997) validated this strategy experimentally 
by showing that probability and convenience samples yield identical conclusions 
about cultural data. Further, Weller (1987) demonstrated that the Spearman- 
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Brown prophesy formula can be applied to informants, rather than items, to 
establish the validity and reliability of cultural data. With modest levels of 
interinformant agreement (.50), conclusions based on samples of as few as nine 
informants are estimated to have excellent validity (.95) and reliability (.90) 
(Handwerker and Wozniak 1997:874; Romney et al. 1986:326). 

On these grounds, I selected informants to maximize heterogeneity in age, 
sex, social class, and color. One strategy was to recruit informants from caseríos 
(public housing), barrios (lower and lower-middle class neighborhoods), and 
urbanizaciones (middle and upper class subdivisions). Torres (1995:34) describes 
how these neighborhood types are associated with divisions—real and alleged—of 
class and color. Caseríos are commonly associated with negros (blacks). and with 
drugs, alcoholism, prostitution, and violent crime. Barrios are likewise associated 
with blackness and low social status, though many residents of barrios are 
homeowners. The expression “del barrio” (from the barrio) is often used as a 
derogatory term roughly meaning uneducated and uncultured. It may also be 
used as a euphemism for negro (Godreau 2000). Urbanizaciones, by contrast, are 
associated with higher social status, and their residents are generally assumed to 
be blancos (whites). 

The sampling strategy also took advantage of Guayama's expanse from the 
central mountain chain to the Caribbean coast, with a developed urban center. 
The contrast between interior and coastal areas captures what Torres (1995:35) 
describes as a “racialized landscape,” in which the mountainous interior is 
associated with whiteness, and the coast is associated with blackness. This legacy of 
sugar and slavery is evident in the 2000 Census. The highest percentage of people 
self-identifying as “Black or African American” occurred in the coastal barrios of 
Guayama (27.1%), with the lowest percentage in interior regions (3.6%). 


INTERVIEW PROCEDURES 


The first set of interviews elicited 23 free lists (Weller and Romney 1988) of terms 
that refer to color. Free listing is a simple interviewing technique that involves 
asking informants to list freely all the color categories they know. Free listing is an 
effective method for defining the contents and boundaries of a cultural domain 
using the language, concepts, and categories that are meaningful to informants. 
For coherent domains, samples of 20-30 informants are generally adequate; 
additional informants add few new items (Borgatti 1998; Ross 2004). 

The second set of interviews replicated and extended Harris’s (1970) technique 
for eliciting emic color categorizations of standardized facial drawings. Based 
on previous ethnography (Harris 1952; Harris and Kottak 1963), Harris and 
colleagues developed 36 male and 36 female drawings to represent all possible 
combinations of three skin tones, three hair forms, two nose widths, and two 
lip sizes for each sex; all other features are held constant. Sample drawings were 
published in Harris’s (1970) report.' 


Ethnic Classification in Southeastern Puerto Rico / 955 


Following Harris, I presented the drawings in a unique randomized order and 
allowed each informant to glance through the deck before identifying the first 
portrait. I then asked respondents to tell me how a person like the one depicted 
in each portrait would be classified in terms of color in Puerto Rico. Next, I 
extended Harris's technique by asking respondents to sort the 36 male drawings 
into piles of faces they thought were similar (Weller and Romney 1988:20). 
The identification and pile sort data are analyzed to assess patterns of semantic 
structure and interinformant agreement. Identification data also provide a validity 
check of the free list results. 


Results 


1. CONTENTS AND BOUNDARIES OF CULTURAL DOMAIN 


Table 1 presents descriptive results for the most commonly listed items in the free 
list interviews. Frequency indicates the number of informants who listed each 
term; average rank reflects how soon informants mentioned each item. Smith's 
salience index (S) incorporates both how often and how early items occur in 
informants' lists by computing each item's average percentile rank across all lists 
(Smith 1993). Higher values of Smith's S indicate greater cultural salience. 

The distribution of frequency and of Smith's S help to define the core and 
peripheral items in a cultural domain (Borgatti 1998). Table 1 suggests that 
the cultural model of color includes relatively few core emic categories. Overall, 
informants listed 51 unique terms, but nearly two-thirds of these items were listed 
by a single informant. Only the first six items were mentioned by at least two- 
thirds of the informants, with the frequency dropping off rapidly for subsequent 
terms. Likewise, the highest salience scores are for the first four terms— negro, 
triguefio, jabao, and blanco—with somewhat lower scores for indio and prieto 
and substantially lower scores for the remaining terms. 

The identification task provides independent confirmation of this result ( Table 
2). Most responses to this open-ended task were idiosyncratic variations of core 
terms plus a modifier. Examples include “blanco con facciones de negro,’ “blanco 
con rasgos negros, and “blanco con descendencia de negro,” all of which modify 
blanco to indicate features associated with negro. Such responses were recoded as 
blanco+, trigueño+, indio+, and negro+. I retained the categories blanco, trigueño, 
negro, jabao, indio, prieto, mestizo, and moreno and recoded a small number of 
unusual responses as other.’ 

Table 2 gives two sets of frequency statistics: (1) the number of respondents 
who used each category to identify the facial portraits and (2) the total number of 
times each category was used across the 3,024 categorizations (42 respondents by 
72 faces). Both sets of results point to the primacy of a small set of emic categories. 
Blanco, triguefio, and negro were used by at least 9096 of respondents, while jabao, 
and indio were used by more than 8096. None of the other categories was used 
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Table 2. Frequency Statistics for Unique Categorizations in 
Identification of Standardized Faces, by Respondents (N= 42) 
and Categorizations (N=3024) 





Respondents Categorizations 
Frequency Percent Frequency Percent 
Blanco . 41 97.6 553 13.2 
Trigueno 40 95.2 808 19.2 
Negro 38 90.5 555 13:2 
Jabao 34 81.0 269 6.4 
Indio 34 81.0 417 9.9 
Blanco + 20 47.6 64 = 
Trigueño + 20 47.6 49 13 
Negro + 13 31.0 47 iil 
Prieto 13 31.0 108 2.6 
Indio + 10 23.8 24 6 
Mestizo 9 21.4 85 2.0 
Other 8 191 14 3 
Moreno 8 19:1 31 A 





by half of the respondents, and even the most frequently used of these are either 
synonymous with (e.g., prieto for negro) or modifications of the core categories. 
That these core categories were the most frequently modified terms underscores 
their salience as the basic emic categories of color (cf. Sanjek 1971). 


2. SEMANTIC STRUCTURE 


Because Harris's facial portraits vary systematically by skin tone, hair texture, 
nose shape, lip form, and sex, the identification data provides information about 
which physical features are associated with each emic color categorization. Figure 
1 shows a correspondence analysis (Greenacre 1984) of these associations. 

This graph suggests that distinctions among color categories depend primarily 
on contrasts in skin color and hair form. For example, of all categorizations made 
as jabao, 88% had light skin, and 95.5% had kinky hair. Of those identified as 
indio, 97.5% had dark or intermediate skin, and all had straight or wavy hair. 
Similar contrasts are evident for blanco and negro, while triguefio is strongly 
associated with intermediate skin tone and hair form. The relative insignificance 
of sex, nose shape, and lip form is evident from their position in the middle of 
Figure 1 and from their not being associated with one term more than another. 
Figure 1 also suggests that light skin and kinky hair are particularly distinctive 
features. The difference between blanco and either triguefio or indio is light skin; 
the difference between negro and either triguefio or indio is kinky hair. Likewise, 
the difference between jabao and negro is light skin, while the difference between 
jabao and blanco is kinky hair. 
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Figure l. Correspondence Analysis of Features by Color, from 
Identification Task (N — 42) 


The pile sort data provide a more direct test of the relationships among emic 
color categories and allows us to estimate more precisely the relative salience of 
skin color, hair type, and facial features as criteria of color. Pile sorts produce direct 
measures of emic similarity among items based on the number of times any two 
items occur in the same pile. These similarity data can be represented graphically 
with multidimensional scaling (MDS) to illustrate the cognitive relationships 
among items across all respondents (Kruskal and Wish 1978). Figure 2 displays 
a two-dimensional MDS graph for the pile sort data.’ 

Figure 2 can be read in terms of both clusters and dimensions. The clustering 
of items suggests that the domain consists of five major groupings corresponding 
to the core categories elicited from free lists and the identification task. Triguefio 
and indio are more similar to one another than are any of the other groupings, 
but the overlap is not complete. Hierarchical cluster analysis of the pile sort data 
(not shown) confirmed five major groupings. 

Figure 2 also corroborates that skin color and hair form are the primary 
organizing dimensions of the domain. The graph shows lines produced by 
PROFIT (PROperty FITting) analysis, a regression-based technique for testing 
hypotheses about the attributes that influence judged similarity among items 
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Figure 2. Multidimensional Scaling (MDS) with PROperty FITting 
(PROFIT) Analysis of Pile Sorts (N — 42) 


(Kruskal and Wish 1978:35). It treats map coordinates from an MDS plot as 
independent variables and attributes that are hypothesized to influence perceived 
similarity as dependent variables. PROFIT analysis estimates that skin color (R^ — 
0.87, p=.001) and hair type (R^ —0.70, p —.001) are the most salient dimensions, 
but it provides no evidence that either nose or lip form influences perceived 
similarity (R^ —.00, p = .948, and R? =.01, p=.789, respectively). PROFIT lines are 
interpreted as dimensions organizing the graph, not as boundaries separating it. 
The skin color dimension runs from dark at the top to light at the bottom; hair 
form runs from straight and wavy hair on the left to kinky hair on the right. The 
location of items on either dimension is determined by drawing a perpendicular 
line from each item to the PROFIT line. On the skin color dimension, for example, 
the faces identified as blanco are lightest, followed by jabao and triguefio, with 
negro and indio falling at roughly the same end of the spectrum. 

These bivariate results are confirmed by a multivariate model, using the 
multiple regression quadratic assignment procedure, or MRQAP (Hubert 
and Schultz 1976). MRQAP treats whole matrices as variables in a regression 
analysis and generates its own probability distribution by randomly permuting 
rows and columns of a data matrix. This analysis treats the portrait-by-portrait 
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aggregate similarity matrix from the pile sorts as the dependent matrix. The four 
independent matrices are portrait-by-portrait similarity matrices for each of the 
four attributes: skin, hair, lips, and nose. Overall, these attributes explain 55% 
of the variance in perceived similarity (p «.001). Hair type makes the largest 
contribution to judged similarity among the portraits (8=.54, p< .001), while skin 
color plays nearly as large a role (f—.50, p« .001). Nose form is less important 
(fi— 26, p « .001), and lip shape does not appear significantly to influence emic 
similarity, independent of other attributes (6 =.07, p=.06). 


3. INTERINFORMANT AGREEMENT AND CULTURAL CONSENSUS 


Romney, Weller, and Batchelder’s (1986) cultural consensus model provides a 
formal mathematical test of the assumption that informants’ responses reflect 
a shared cultural model of color. The cultural consensus model conducts a 
minimum residual factor analysis of an informant-by-informant similarity matrix 
(corrected for guessing) to determine whether a single underlying factor explains 
the pattern of interinformant agreement. If the assumption of a single culture 
holds, then consensus analysis should yield a first factor that explains most of 
the variance. This factor represents the underlying cultural model that shapes 
informants’ responses. The model fits well if the eigenvalue ratio of the first factor 
to the second is at least 3:1 and if the average knowledge across informants is 
high, as estimated by first factor loadings. 

Cultural consensus analysis of the identification task data was implemented 
in ANTHROPAC software (Borgatti 1996). The analysis indicates that the 
assumption of shared culture holds. The first factor explains roughly 76% of the 
variance, and its eigenvalue is more than four times larger than that of the second 
factor. The respondents’ moderately high average of estimated knowledge (.62 
+.15) suggests that knowledge about the model of color classification is shared 
across the sampled range of variation in age, sex, class, and color. The model 
also provides an idealized estimate of the culturally appropriate responses to the 
identification task. The resulting consensus classification of Harris’s standardized 
faces requires five terms: blanco, negro, triguefio, indio, and jabao. Estimated 
reliability of the model is high (.96). 

Figure 3 illustrates the pattern of agreement for the pile sort data, following 
Handwerker's (2002) method. Handwerker extends the logic of cultural consensus 
analysis by using a principal components analysis (PCA) of informants (rather 
than variables) to determine whether interinformant agreement reflects a 
common, underlying culture. He argues that high informant loadings on the 
first factor, combined with low loadings on the second, constitute evidence of a 
single culture. Figure 3 plots first and second factor loadings from a PCA of the 
42 pile sort informants. The tight clustering of informants along the right-hand 
edge of the graph reflects consistently high loadings on the first factor (mean = 
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Figure 3. Scatterplot of Loadings on Factor 2 by Loadings on Factor 1; 
Construct Validity Analysis for Pile Sort Data 


.84, S.D.=.04) and low loadings on the second (<+.35). The first factor explains 
more than 71% of variance, and its eigenvalue is nearly 28 times larger than that 
of the second. By Handwerker’s (2002) criteria, these results are strong evidence 
of high interinformant agreement and a single cultural model of color. 


Discussion and Conclusion 


This exploratory study is the first to apply new systematic ethnographic methods 
to describe the cultural model of color in Puerto Rico. It addresses three empirical 
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questions based on previous work: (1) what are the core emic categories that 
constitute this model? (2) what is the semantic structure that organizes it? and 
(3) does the pattern of interinformant agreement justify the assumption that 
there is a single, coherent cultural model of color? The findings confirm key 
elements of previous research. But they also provide new insights that inform 
ongoing discussions about ethnicity and racism in Puerto Rico and point to ways 
that cognitive anthropologists can contribute to the comparative study of ethnic 
classification across societies. 

Free listing and the replication of Harris's identification task independently 
confirm previous observations regarding the proliferation of terms for designating 
color. However, both elicitation techniques identify a small core of culturally 
salient categories: blanco, triguefio, indio, negro, and jabao. The salience of these 
categories is also evident in the pile sort data, which reveal five major groupings 
in informants’ aggregate perception of Harris’s standardized faces. Results from 
the free lists, identification task, and pile sorts differ from other descriptions of 
Puerto Rican terms for color (e.g., Duany 2002; Godreau 2000) because they 
were elicited with transparent methods designed to discover the concepts and 
categories that are meaningful to informants. The results thus provide new 
information about the contents and boundaries of the cultural model of color, 
without imposing a prior conceptual framework. 

This study also confirms previous suggestions that skin color and hair form are 
the primary dimensions of semantic structure (Duany 2002; Seda Bonilla 1991). 
Harris’s standardized facial portraits are useful in this regard because they vary 
systematically in five attributes hypothesized to influence color categorization in 
Puerto Rico: skin tone, hair texture, nose shape, lip form, and sex. We can make 
greater use of this design now than Harris could in his original study. Relatively 
new analytic methods like multidimensional scaling, correspondence analysis, 
PROFIT analysis, and multiple regression quadratic assignment procedure allow 
us to visualize dimensions of semantic structure and to measure the relative 
importance of various criteria for the ascription of color. Using these methods, 
this study provides more direct evidence of how the cultural model of color is 
organized than was possible in previous studies. Future research could extend this 
work by using photographs rather than drawings and by including nonphenotypic 
markers of social status such as occupation, dress, and residence to test the idea 
that “money whitens” in Puerto Rico (Kay 1978:89). 

An important difference between this and previous studies is that it formally 
tests whether there is a coherent cultural model of color. Culture, as most social 
scientists understand it, is a multidimensional construct that refers to patterns 
of shared, socially transmitted cognition and behavior (Brumann 1999). Until 
recently, there was no formal way to assess the validity of this construct, but new 
developments make it possible to verify the existence and location of cultural 
boundaries and to justify the assumption of shared culture (Romney et al. 
1986; Weller 1987; Handwerker 2002). This study demonstrates a high level of 
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agreement among informants, which indicates a shared cultural understanding 
of color. Furthermore, formal properties of the cultural consensus model and 
the theory-driven sampling strategy permit confidence in the findings, even with 
sample sizes that are small by comparison to standard approaches (Romney et al. 
1986). Yet it remains to be determined whether the findings from southeastern 
Puerto Rico generalize to other parts of the island, including the mountainous 
interior and the San Juan metropolitan area. Another important extension 
would be to investigate whether mainland and island Puerto Ricans participate 
in different cultures of color and how these cultures change with the experience 
of migration and acculturation. 

Direct comparisons with previous research in Puerto Rico are difficult because 
of differences in research setting (e.g., Guayama versus San Juan) and historical 
confounds related to the growing U.S. political and economic influence on the 
island over the last half century. Even so, the evidence for a coherent cultural 
model of color is consistent with some prior research. In the 1960s, Seda Bonilla 
(1991) asked a nationally representative sample of approximately 1,800 people to 
group 14 black-and-white photographs according to racial similarity. In contrast 
to others' emphasis on ambiguity and disagreement (e.g., Rogler 1944), Seda 
Bonilla (1991:184) reported “a high level of consensus.” Ginorio and Berry (1972) 
asked 250 high school students in Puerto Rico to rate 60 color photographs on 
a scale from “mds blanco” (whiter) to “mds negro” (blacker). They found that 
students rated photographs “with extraordinary consistency” (1972:288). 

This study also relates to recent work on the significance of ambiguity in 
the everyday experience of color. For example, Godreau (2000) argues that 
Puerto Ricans use the “slippery semantics” of color for many reasons, including 
to avoid being victims of racism or to build distance and intimacy in mundane 
social interaction. Yet Godreau’s notion of “slippery semantics” does not imply 
referential ambiguity in the sense of Harris’s classic Brazilian study. Instead, she 
draws attention to how people manipulate shared understandings of color in 
response to changing social contexts: “There are social norms that guide which 
criteria are used to establish phenotypic distinctions between negros and triguefios” 
in Puerto Rico, even if “these distinctions and norms are complicated by the use 
of euphemism” (2000:56, my translation). This study complements Godreau’s 
analysis by treating the existence of “social norms” as an empirical matter. By 
establishing the coherence and structure of these norms, this study enhances 
Godreau’s emphasis on how Puerto Ricans embrace or obscure shared meanings 
of color to meet different ends in everyday social interaction. 

My replication of Harris’s method invites comparison with his seminal study, 
which argues that “the most distinctive attribute of the Brazilian ‘racial’ calculus 
is its uncertain, indeterminate, and ambiguous output” (1970:1). There are many 
obvious differences between Brazil in the 1960s and Puerto Rico in 2001, even if 
earlier scholars argued that race relations in Puerto Rico “offers certain similarities 
to Brazil” (Hoetink 1967:38). Still, one wonders whether the application of 
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methods that were unavailable to Harris might yield new insights into the 
Brazilian “racial calculus.” Byrne and Forline (1997) addressed this question by 
reanalyzing Harris's data with methods similar to those used here. Byrne and 
Forline verified Harris's finding that there was no single, shared cultural model 
across the entire sample. However, when they analyzed respondents from different 
parts of Brazil separately, they discovered evidence of cultural consensus within 
regional subsamples. Byrne and Forline also estimated the relative salience of skin 
color, hair type, and facial features in the categorization of Harris's standardized 
faces. In contrast to Harris, they find “an orderliness in the dimensions shaping 
the cognitive domain” (1997:24) and establish the primacy of skin color and hair 
form as the organizing principles of the domain. 

This finding closely parallels Sanjek’s (1971) results. Indeed, Byrne and 
Forline’s discovery of regional subcultures probably explains why Harris and 
Sanjek reached different conclusions: Harris sampled from across Brazil, while 
Sanjek worked in a single locale. Although Sanjek identified a large corpus of 
terms (116), he found that just ten terms accounted for 85% of all categorizations. 
Only six terms were used by half or more of Sanjek’s respondents—a striking 
parallel to the results in Puerto Rico. In addition, Sanjek reported that more 
than 80% of his informants shared a basic cognitive map of the Brazilian “racial 
lexicon” (1971:1128) and that there are regular patterns in how children learn to 
discriminate according to this map. He also speculated that “skin color and hair 
form are the two basic components which order this domain” (1971:1130), but 
he lacked analytic methods to measure the amount of interinformant agreement 
or to test his hypothesis regarding the semantic structure of the domain. 

To place this and earlier ethnographic studies in context, it is useful to 
consider Handwerker’s (2002) distinction between “life experience” and “cultural” 
data. Life experience data includes personal attributes and events that reflect an 
individual’s unique life history. Questions such as “What is your race?” and “How 
old are you?” elicit such information. Cultural data, on the other hand, includes 
information about the shared meanings that people acquire in social interaction 
and use to guide and interpret their experience of the world. Cultural data come 
from questions like “How do you know what race someone belongs to?” or “What 
changes do people experience as they age?” In other words, life experience data 
deals with the labels (e.g., race, age) people apply to themselves, while cultural 
data concerns the shared meaning and definitions associated with those labels. 

Recent sociological studies of “race” in Puerto Rico and elsewhere focus on 
the labels people apply to themselves and others. For example, Landale and 
Oropresa (2002) study differences in how mainland and island Puerto Rican 
women identify their race. Telles (2002) examines the concordance between 
self-identification and observer categorization of race in Brazil. Harris (2002) 
studies racial identification across different social contexts among adolescents in 
the United States. These studies of racial identity examine life experience data, 
yet implicit in the studies is a concern for cultural data—how racial classification 
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is affected by "shifting racial regimes" (Harris 2002:624), *popular beliefs about 
race" (Telles 2002:417), or the fact that Puerto Rican “definitions of race are 
more flexible and ambiguous than is the case in the U.S.” (Landale and Oropesa 
2002:234). Ethnographic research on these cultural phenomena thus can help to 
clarify the meaning and measurement of “race” across disciplines. 

One relevant example is the decennial census of Puerto Rico. In 2002, for 
the first time in 50 years, the census asked Puerto Ricans to identify their “race.” 
Over 80% self-identified as “white” (United States Bureau of the Census 2001). 
The meaning of this result is unclear, however, since islanders were asked to 
choose from official racial and ethnic categories established by the U.S. federal 
government. Duany (2002:244) recently noted that “no published studies have 
yet explored the congruence between popular representations of race in Puerto 
Rico and the official racial categories of the United States.” Preliminary evidence 
from this study suggests little correspondence between census categories and 
locally salient distinctions of color. An experimental comparison of measures 
using census categories versus the core categories identified in this study (cf. 
Byrne et al. 1995; Harris et al. 1993) would better test whether census estimates 
accurately describe the demographic profile of Puerto Rico. 

Finally, this study contributes to the debate about the utility of “race” as an 
analytic framework in cross-cultural research. Valid cross-cultural comparisons 
require a distinction between folk concepts and abstract theoretical constructs 
that transcend the limits of a particular cultural context (Banton 2001:174). 
Yet comparisons between the United States and Latin American societies are 
typically framed in terms of race, a culture-bound concept that presupposes a 
set of meanings rooted in the American experience (Smedley 1998). Others have 
hinted at the incompatibility of race with emic constructs in other societies. 
Nobles (2000:86) points out that “Brazilian censuses have not counted by race as 
such. The Portuguese word cér (“color”) refers to physical appearance, not racial 
origins.” Seda Bonilla warned that “what North Americans call ‘race’ is not .. . 
synonymous with what Latin Americans designate with the same name.” (1972:90, 
my translation). For this reason, Harris et al. (1993:460) use the expression 
“race-color” to convey the difference between the Brazilian emic concept of cér 
and the North American emic concept of race. 

Imposing race as an analytic framework in societies where it is not a salient 
emic construct poses problems of measurement and interpretation (Hoetink 
1967:34, 51-2; Bourdieu and Wacqant 1999; Seda Bonilla 1972, 1991). Consider 
Landale and Oropresa's (2002) study of racial identification among mainland and 
island Puerto Rican women, which uses two measures of “race”: a closed-ended 
question based on U.S. federal categories and an open-ended item, "What race do 
you consider yourself?" Response choices for the closed-ended question—white, 
black, five Asian groups, American Indian, and other—clearly do not capture 
locally meaningful distinctions. It is also unlikely that the open-ended question 
elicits identities that are comparable to "race" in the United States. In Puerto Rico, 
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the concept of raza is closely linked to an ideology of mestizaje, or intermixture, 
that rejects racial differences among Puerto Ricans and emphasizes instead the 
blending of Spanish, Taino, and African influences into a singular form of Puerto 
Ricanness: la raza puertorriquena (Davila 1997). It is not surprising, therefore, 
that more than half of Landale and Oropresa’s respondents identified their raza as 
Puerto Rican. Yet most Puerto Ricans have a heightened awareness of differences 
in color, even if they form a single, unique raza. It remains to be determined how 
the use of more appropriate emic categories may affect our understanding of 
self-identification and categorization on the island. 

A corollary of this point is that race should be examined as an emic concept 
alongside relevant emic constructs in other societies. Mukhopadhyay and Moses 
(1997:521) make this point in noting anthropologists’ blind eye to race: “Nor 
have efforts to understand indigenous systems of classification (of plants, colors, 
animals, or kin) been extended and applied to racial and other Euro-American 
human classificatory principles.” Gil-White (1999) likewise notes that social 
scientists almost uniformly accept Barth's (1969) contention that ethnicity is 
about ascriptive cognitive boundaries, making the prevailing approach “self- 
consciously emic" (Gil-White 1999:792, emphasis in original). Yet surprisingly 
little work exists on the emic cognition of ethnic boundaries in the United States 
or elsewhere. This "glaring methodological gap" (Gil-White 1999:792) creates a 
need for more studies like this one to take advantage of advances in methods for 
studying the content and distribution of cultural knowledge. 


Notes 


1. Harris provided me with the original portraits and encouraged me to produce copies 
for my use in Puerto Rico. Digital images of the portraits are available at http://qualquant. 
net/harris. Unstructured pretesting established that the drawings tapped a salient cultural 
domain in Puerto Rico. Key informants spontaneously described the color of Harris’s faces 
and frequently commented on the similarity of portraits to friends or family. In one small 
group, a young woman remarked, “you see a lot of that in the street here.” Another confirmed, 
"Yes, they look a lot like people from around here,’ and an older man identified a younger 
version of himself in one of the portraits. All were surprised that the drawings originally 
were meant to portray Brazilians. 


2. In their reanalysis of Harris’s (1970) data, Byrne and Forline (1997) aggregated terms by 
their principal lexemes. My decision to treat modified terms as a separate category (e.g., 
blanco versus blanco*) is a more conservative analytic approach that errs on the side of 
underestimating the coherence of the domain. 


3. According to rule-of-thumb guidelines, the stress of .186 is somewhat high, suggesting 
that the best fit for the data may require more than two dimensions. Plotting the data in 
three dimensions lowers the stress to .129. However, a recent simulation study (Sturrock and 
Rocha 2000) suggests that the stress of .186 in two dimensions is acceptably low, given the 
number of items. We can regard this graph, therefore, as a reasonably good representation 
of the semantic structure in this domain. 
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The Unintended Significance of Race: 
Environmental Racial Inequality in Detroit* 


LIAM DOWNEY, University of Colorado 





Abstract 


This article addresses shortcomings in the literature on environmental inequality 
by (a) setting forth and testing four models of environmental inequality and (b) 
explicitly linking environmental inequality research to spatial mismatch theory 
and to the debate on the declining significance of race. The explanatory models 
ask whether the distribution of blacks and whites around environmental hazards 
is the result of black/white income inequality, racist siting practices, or residential 
segregation. The models are tested using manufacturing facility and census data 
from the Detroit metropolitan area. It turns out that the distribution of blacks and 
whites around this region's polluting manufacturing facilities is largely the product 
of residential segregation which, paradoxically, has reduced black proximity to 
manufacturing facility pollution. 


Over the last 15 years, an expanding body of research has sought to ascertain 
whether environmental hazards are distributed inequitably according to race and 
income. In particular, researchers want to know whether minorities, the poor, 
and the working class are more likely than whites and wealthier individuals to 
live near environmental hazards. Researchers are interested in these questions 
because of the possible health impacts of exposure to environmental pollutants 
and because survey data suggest that regardless of race or income, people do not 
want to live near hazardous waste sites or polluting industrial facilities (Mohai 
and Bryant 1998). 

Despite increased levels of concern about environmental inequality, mainstream 
sociology has paid relatively little attention to this topic. This is unfortunate in 
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a couple of respects. On the one hand, the study of environmental inequality 
provides sociologists with an opportunity to introduce sociological thinking to 
a field of study with important public policy and health ramifications. On the 
other hand, environmental inequality research has much to teach sociologists 
about the relationship between race and class and about the role racism plays in 
shaping social outcomes. 

For example, several environmental inequality studies have found 
environmental hazards to be distributed equitably according to race in areas 
where they are distributed inequitably according to income, despite the fact that 
minorities generally earn significantly less than whites (Anderton et al. 1994a; 
Oakes, Anderton, and Anderson 1996; Yandle and Burton 1996). But if income is 
negatively associated with environmental hazard presence and minorities earn less 
than whites, then minority presence should be positively associated with hazard 
presence. That this is not always the case suggests that racial status sometimes 
decreases minority representation in environmentally hazardous neighborhoods 
below what we would expect based on income alone. This raises interesting 
and important theoretical questions. Are there situations in which racial status 
separates minorities from socially undesirable goods such as pollution? More 
specifically, can racism in one institutional arena (for example, the housing 
market) weaken racial inequality in another arena (the environment)? 

In order to answer these questions, I use 1970, 1980, and 1990 data to track the 
flow of social groups and polluting manufacturing facilities through the Detroit 
metropolitan area, a region where environmental racial inequality is relatively 
weak despite the fact that black/white income inequality and income-based 
environmental inequality are relatively strong. I use census and manufacturing 
facility data to test four models of environmental inequality that make predictions 
about the determinants of manufacturing facility siting decisions and the impact 
that manufacturing facility presence has on neighborhood demographics. Taken 
collectively, the models ask whether the distribution of blacks and whites around 
polluting manufacturing facilities is the result of black/white income inequality, 
racist siting practices, or the biased operation of the housing market. 

As we shall see, the distribution of blacks and whites around Detroit’s polluting 
manufacturing facilities is not the result of black/white income inequality or 
racist siting practices. Instead, it is largely the product of residential segregation 
which, paradoxically, has reduced black proximity to manufacturing facility 
pollution. Thus, it turns out that racial status and racism play an important 
role in shaping environmental inequality in the Detroit metropolitan area, but 
they operate indirectly through the housing market, and their effect has been to 
separate blacks from manufacturing facility pollution. 
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Environmental Inequality 


Environmental inequality is a relatively new concept that gained national attention 
in the late 1980s and early 1990s due to'the efforts of grassroots environmental 
Justice activists and professional researchers. Calling into question the mainstream 
environmental movement’s assumption that environmental degradation 
affects everyone equally, environmental justice activists and researchers argue, 
among other things, that the poor, the working class, and people of color are 
disproportionately burdened by environmental hazards. 

This assertion has become the subject of a growing body of academic research 
(Anderton et al. 1994a, b; Bowen 2002; Bowen et al. 1995; Brown, Ciambrone, and 
Hunter 1997; Hunter 1998; Lester, Allen, and Hill 2001; Mohai and Bryant 1992; 
Morello-Frosch, Pastor, and Sadd 2001; Pastor, Sadd, and Hipp 2001; Ringquist 
2000; Sadd et al. 1999; Szasz and Meuser 1997; Yandle and Burton 1996). An 
important debate in this new field centers on the relative importance of race 
and income in predicting who lives near environmental hazards (Mohai and 
Bryant 1992). The debate takes the following form: Are minorities more likely 
than whites to live near environmental hazards, and if so, is it because of their 
racial status or because minorities tend to have lower incomes than whites? While 
this debate has generated some important insights, it has also been plagued by 
several critical shortcomings, three of which are discussed here. 

First, because researchers have been preoccupied with determining the 
predictive power of race and income at a single point in time, they have 
devoted little attention to studying the historical processes that have given rise 
to environmental inequality (exceptions include Krieg 1995; Oakes, Anderton, 
and Anderson 1996; Pastor, Sadd and Hipp 2001). Second, hypotheses about 
the determinants of environmental inequality have not been organized into 
coherent models that can be formally tested. Third, environmental justice research 
is not theoretically grounded. For example, although many environmental 
inequality researchers study manufacturing facility pollution, few, if any, discuss 
deindustrialization or class conflict, factors that are likely to play an important 
role in shaping environmental inequality. Similarly, few links have been made 
between environmental inequality research, the declining significance of race 
debate, and spatial mismatch theory, despite the fact that the debate on the 
declining significance of race focuses closely on the relative importance of race 
and income in explaining racial inequality and the fact that spatial mismatch 
theory hypothesizes a spatial disjuncture between manufacturing facilities and 
poor, urban black neighborhoods. 

This article advances our understanding of environmental inequality by (a) 
setting forth four models of environmental inequality, (b) using these models 
to isolate the historical determinants of environmental inequality in the Detroit 
metropolitan area, and (c) explicitly linking environmental inequality research 
to the declining significance of race debate and spatial mismatch theory. 
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Spatial Mismatch Theory and Environmental Inequality 


Spatial mismatch theory highlights the manner in which industrial investment 
patterns since World War II have differentially affected black and white workers. 
According to spatial mismatch theorists, industrial flight from the urban core to 
the suburbs and rural areas after World War II hurt black workers more than white 
workers because residential segregation prevented blacks from leaving the central 
city, creating a spatial mismatch between the location of black neighborhoods 
and the location of manufacturing jobs, and because blacks relied more heavily 
on manufacturing employment than did whites (Darden et al. 1987; Farley, 
Danziger, and Holzer 2000; Frey 1984, 1987; Ilhanfeldt and Sjoquist 1998; Kain 
1968; Kasarda 1995; Massey and Denton 1993; Mouw 2000; Sugrue 1996; Wilson 
1987). 

But if residential segregation and industrial flight have created a spatial 
mismatch between the location of black neighborhoods and the location of 
manufacturing jobs, it stands to reason that these factors have also created a spatial 
mismatch between black neighborhoods and manufacturing facility pollution. 
Thus, if spatial mismatch theory is correct, we would expect the association 
between neighborhood racial composition and manufacturing facility pollution 
levels to be weak, nonexistent, or biased against whites, an expectation that clearly 
contradicts that put forth by environmental inequality researchers. 


The Declining Significance of Race Debate 


The declining significance of race debate arose in response to Wilson's (1978) 
claim that the importance of race has declined in the modern industrial period. 
According to Wilson, prior to World War II there was little differentiation among 
blacks in the labor market. Since then, however, labor market discrimination has 
decreased and occupational differentiation among blacks has increased, so much 
so that the life chances and experiences of blacks now depend much more on 
their class position than their racial status. The increased residential mobility of 
middle-class blacks since the civil rights era has reinforced the salience of class by 
allowing middle-class blacks to flee urban ghettos, thereby isolating poor blacks. 
As a result, the life experiences and opportunities of middle-class blacks are very 
similar, if not identical, to those of middle-class whites and increasingly different 
from those of poor and working-class blacks (Wilson 1987). 

Wilson's thesis has been critiqued by many researchers (Collins 1997; Feagin 
and Sikes 1994; Sugrue 1996; Waldinger 1996). But the most prominent critique of 
his thesis is probably that put forth by Massey and Denton (1993), who document 
in great detail the extreme and unique degree to which blacks historically have 
been and currently are segregated from whites. They argue that in the post—civil 
rights era, residential segregation and high black poverty rates have combined 
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to concentrate poverty in black neighborhoods at much higher rates than in 
white neighborhoods. Segregation has also isolated poor, urban blacks from 
good schools, other social groups, and the social and occupational networks 
that help individuals achieve economic success. Finally, segregation has made it 
difficult for middle-class blacks to distance themselves spatially from the poor 
and working class and prevented them from residing in the same neighborhoods 
as middle-class whites. 

In taking this position, Massey and Denton are arguing not only that racism 
and racial status continue to play a decisive role in shaping blacks’ lives but also 
that residential segregation consistently aggravates racial inequality through 
its impact on blacks’ social and political networks, neighborhood institutions, 
educational opportunities, and residential mobility. Thus, the debate between 
Wilson and his critics revolves around two issues. First, is racial inequality 
caused by racial discrimination or is it merely a byproduct of class inequality? 
And second, does racial inequality in specific institutional arenas arise from 
intentionally racist acts in those arenas or from the racially biased operation of 
other institutions? 


Explanatory Models 


In order to understand what role, if any, racial status and racism play in 
determining who lives near polluting manufacturing facilities, I set forth and 
test four models of environmental inequality: an economic model, a racist intent 
model, a simple segregation model, and a racial succession model. The first 
three models are drawn from the literature on environmental inequality, while 
the fourth is drawn from the broader sociological literature on race and urban 
development. 

Before describing the models, two points must be made. First, although each 
of the four models provides a logically distinct explanation of environmental 
inequality and although predictions made by one model often contradict 
predictions made by the other models, the models are not entirely exclusive. 
For example, as we shall see, the economic model predicts that industrial facilities 
will be sited in low-income neighborhoods, while the racial succession model 
predicts that blacks will be drawn into neighborhoods immediately adjacent to 
already-black neighborhoods. Although it is certainly possible that only one of 
these predictions will be supported by the evidence, it is also possible that both 
(or neither) will be supported by the evidence. 

As we shall also see, the models focus on different aspects of the environmental 
inequality formation process. (This term is borrowed from Pellow 2000.) 
For example, one explanatory model focuses solely on siting decisions, and 
two explanatory models focus solely on demographic changes in facility 
neighborhoods. Thus, it is quite possible that more than one model will be 
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supported by the evidence presented below. 

Second, the four models make predictions about individual and firm-level 
behavior and the aggregate, neighborhood-level consequences of these predicted 
behaviors. For example, the economic model makes predictions about firm-level 
siting decisions and the neighborhood-level consequences of these decisions, 
and the racial succession model makes predictions about the housing choices 
of individual whites and the neighborhood-level consequences of these choices. 
However, because appropriate data are not available to test the individual and 
firm-level predictions of the models, I test the neighborhood-level predictions 
instead. 


AN ECONOMIC MODEL OF ENVIRONMENTAL INEQUALITY 


The economic model makes income-based predictions about industrial siting 
decisions and the impact of facility location on neighborhood demographics.’ 
Specifically, the model holds that industrial facilities tend to be sited in low-income 
neighborhoods because property values tend to be low in these neighborhoods 
and firms attempt to site facilities where property values are low (Mohai and 
Bryant 1992). The model predicts, therefore, that neighborhood incomes and 
property values should be negatively associated with the number of new facilities 
in a neighborhood, the total number of facilities in a neighborhood, and changes 
over time in the number of facilities in a neighborhood. Moreover, because blacks 
tend to have lower incomes than whites, blacks should be more likely than whites 
to live in facility neighborhoods and in neighborhoods with new facilities. As a 
result, the percentage of blacks in a neighborhood should be positively associated 
with the number of facilities in a neighborhood and changes over time in the 
number of facilities in a neighborhood. However, blacks and whites with the 
same incomes should be equally represented in facility neighborhoods and in 
neighborhoods with new facilities. 

The model goes on to note that once industrial facilities are sited in a 
neighborhood, its property values, rental values, and environmental integrity 
begin to decline, leading wealthier individuals to flee the neighborhood and poorer 
ones to move in (Hamilton 1995; Mohai and Bryant 1992). Therefore, income 
levels in neighborhoods surrounding polluting facilities should decline over time 
relative to overall area incomes, and blacks should be drawn disproportionately 
into facility neighborhoods because of their relatively low incomes. Put more 
formally, the model predicts that neighborhood property values and rents should 
be positively associated with changes over time in neighborhood income levels 
and negatively associated with changes over time in the percentage of blacks in 
a tract. It also predicts that facility presence should be negatively associated with 
changes over time in neighborhood income levels and positively associated with 
changes over time in the percentage of blacks in a neighborhood. 

Table 1 summarizes these predictions in terms of the independent and 
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Table 1. Explanatory Model Predictions 


Economic Model Predictions 


Dependent variable concepts Independent variable concepts Expected relationship 
Facility presence and facility change Neighborhood incomes Negative 
Neighborhood property values Negative 
Percent black Positive 
Income change Facility presence Negative 
Property values Positive 
Rents Positive 
Percent black change Facility presence Positive 
Property values Negative 
Rents Negative 


Racist Intent Model Predictions 


Dependent variable concepts Independent variable concepts Expected relationship 


Facility presence and facility change Percent minority Positive 


Simple Segregation Model Predictions 





Dependent variable concepts Independent variable concepts - Expected relationship 


Percent black change Facility presence Positive 


Racial Succession Model Predictions 


Dependent variable concepts Independent variable concepts Expected relationship 
Percent black change Neighborhood adjacency Positive 
Facility presence Not positive 


dependent variable concepts embedded in each of them. The table is divided into 
four sections, one for each explanatory model under consideration, and lists the 
dependent and independent variable concepts embedded in each prediction as 
well as the expected relationship between them. Variable concepts are listed rather 
than the actual indicators employed below in order to keep my theoretical and 
methodological discussions separate. Later in the article, I include a table (Table 
2) that organizes these predictions according to the dependent variables used to 
test them and that lists the indicators used in my regression models. Tables 1 and 
2 in combination help clarify how I translate the explanatory model predictions 
discussed in this section into the regression models I estimate below. 
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A RACIST INTENT MODEL OF ENVIRONMENTAL INEQUALITY 


The racist intent model focuses solely on siting decisions. According to the model, 
environmental hazards are intentionally placed in minority neighborhoods 
because decision makers (politicians, government bureaucrats, and businessmen) 
and their constituents are racially prejudiced. Thus, the model holds that the 
percentage of minorities in a neighborhood should be positively associated with 
the number of new facilities in a neighborhood, the total number of facilities 
in a neighborhood, and changes over time in the number of facilities in a 
neighborhood (see Table 1; Been 1994; Hamilton 1995). 


A SIMPLE SEGREGATION MODEL OF ENVIRONMENTAL INEQUALITY 


The simple segregation model focuses on the racial impact of facility location 
on demographic change. According to the model, residential segregation helps 
produce environmental racial inequality because whites have greater residential 
choice than blacks and are therefore better able than blacks to flee environmentally 
hazardous neighborhoods. Due to their relatively limited residential options, 
blacks are also more likely than whites to move into environmentally hazardous 
neighborhoods (Bullard 1993; Godsil 1991; Mohai and Bryant 1992). Thus, 
the model predicts that blacks will be drawn disproportionately into facility 
neighborhoods and that whites will be drawn disproportionately away from such 
neighborhoods. As a result, facility presence should be positively associated with 
changes over time in the percentage of blacks in a neighborhood (see Table 1). 


A RACIAL SUCCESSION MODEL OF ENVIRONMENTAL INEQUALITY 


According to the racial succession model, facility location has relatively little 
impact on neighborhood racial change. Instead, the distribution of racial groups 
around manufacturing facilities is believed to be an outcome of general processes 
of demographic change and racial succession that apply to urban areas as a 
whole. Racial succession can be defined as “... the replacement of whites by 
nonwhites within the boundaries of a given neighborhood” (White 1984:165). 
In this model the proximity of white neighborhoods to black neighborhoods is 
seen as the prime determinant of racial succession. 

Specifically, the model predicts that racial succession is most likely to occur 
in white neighborhoods that border black neighborhoods. This is because racial 
succession tends to occur, according to the model, when panicked whites flee their 
homes in the belief that their neighborhoods are about to be “overrun” by black 
settlers, thereby opening up their neighborhoods for black settlement. Since white 
panic is likely to be greatest and is easiest to produce in white neighborhoods that 
are adjacent to primarily black neighborhoods, these are the neighborhoods where 
the model predicts racial succession is most likely to occur (Massey and Denton 
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1993; Sugrue 1996). The result is (a) the persistence of black-white residential 
segregation and (b) the expansion of black neighborhoods and ghettos at their 
edges, with blacks moving into manufacturing neighborhoods and corridors 
solely because of the ghetto's overall expansion. 

Thus, the racial succession model holds that residential segregation prevents 
blacks from being drawn disproportionately into manufacturing neighborhoods . 
and corridors by channeling racial succession into neighborhoods immediately 
adjacent to the black ghetto regardless of whether or not those neighborhoods 
house manufacturing facilities. This, in turn, reduces black proximity to polluting 
manufacturing facilities below what we might otherwise expect. 

Put more formally, the racial succession model predicts that changes over time 
in the percentage of blacks in a tract should be greater in white neighborhoods 
that are adjacent to black neighborhoods than in white neighborhoods that are 
not adjacent to black neighborhoods. Thus, neighborhood adjacency should be 
positively associated with changes over time in the percentage of blacks in a tract. 
Finally, facility presence should not be positively associated with changes over 
time in the percentage of blacks in a tract. 


Data 


I obtained demographic data for the Detroit metropolitan area from the 1970, 
1980, and 1990 U.S. Census and data for manufacturing facilities from the 1970, 
1980, and 1990 Michigan manufacturers directories (Manufacturer Publishing 
Co. 1970; Pick Publications 1980, 1990). Researchers who study manufacturing 
facility pollution normally use the Environmental Protection Agency’s (EPA) 
Toxic Release Inventory (TRI), which records the number of pounds of toxic 
chemicals released into the environment each year by individual manufacturing 
facilities. However, because the TRI has only been available since 1987, it cannot 
be used to evaluate historical hypotheses. Michigan manufacturers directories, 
which list the names, addresses, and standard industrial classification codes of 
manufacturing facilities throughout the state, do not have this limitation, so I 
use them instead. 

In order to ensure that the manufacturing facilities included in this study are 
among the most hazardous in the region, | restricted the study to facilities in four 
highly polluting industrial sectors: transportation equipment, chemicals, primary 
metals, and fabricated metal products. These industries are well represented 
both currently and historically in Detroit’s industrial economy, and according 
to the TRI were the four most highly polluting industrial sectors in the Detroit 
metropolitan area in 1992, responsible for approximately 99% of the region’s 
TRI emissions that year.’ 

One potential problem with using data on manufacturing facility location 
rather than data on manufacturing facility pollution to create environmental 
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hazard proximity indicators is that manufacturing facilities do not all produce 
the same amount of pollution. Moreover, older facilities, which are more likely 
to be located in inner-city Detroit than in Detroit' suburbs, may produce more 
pollution than newer facilities, which are more likely to be located in the suburbs. 
However, when I correlate 1990 manufacturing facility pollution indicators 
with the demographic variables used in this study (results not shown here), I 
obtain results almost identical to those I get when I correlate these demographic 
variables with the facility presence indicators used in this study (results for these 
latter correlations are shown in Table 3a). Moreover, I am not alone in using 
facility proximity (rather than pollution proximity) to measure environmental 
inequality. Many researchers have done so (for examples, see Anderton et al. 
1994a, b; Mohai and Bryant 1992; Oakes, Anderton, and Anderson 1996; Szasz 
and Meuser 1997)? 


Study Area 


The Detroit metropolitan area is defined here as the city of Detroit and the 
cities and townships immediately surrounding Detroit. I use this study area 
definition rather than the six-county definition currently employed by the U.S. 
Census Bureau because this article is part of a broader study of Detroit that uses 
tract-level census data and digital census tract maps going back to 1940. Since 
the Detroit metropolitan area was much smaller in 1940 and 1970 than it was 
in 1990, and because of the great difficulties inherent in creating usable digital 
tract maps prior to 1980 and matching census tract observations over time, it 
was necessary to restrict the study area as described above. 

One possible objection to restricting the study area in this way is that in doing 
so I have limited variation in the data by restricting the study to that part of the 
six-county region with extremely high minority concentrations. However, in 1990 
the study area population was still majority white (5496 white, 4396 black), and 
in 1970 more than half of all Detroit city residents were white, while virtually 
none of the suburban population was black. Thus, there is considerable racial 
variation in the data in 1970, 1980, and 1990, and the same is true for the other 
variables included in the analysis. 

Moreover, virtually every environmental inequality study I am aware of 
arbitrarily restricts itself in some way to a study area that limits variation in 
the data. For example, most quantitative environmental inequality studies focus 
on urban areas rather than rural areas, several urban environmental inequality 
studies exclude suburban areas from consideration, and only a few environmental 
inequality researchers have ever conducted statewide or national environmental 
inequality studies that would maximize data variation (see Szasz and Meuser 
1997 for a review of the literature). 

Nevertheless, a few words about the possible impacts of using my study 
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area definition are in order. From 1970 to 1990, the metropolitan area suburbs 
outside my study area were virtually all white and by 1990 the city of Detroit 
was virtually all black. At the same time, the number of factories in Detroit, the 
county housing Detroit (Wayne county), and the study area cities just to the 
north of Detroit were declining, while the number of metropolitan area factories 
outside my study area were increasing. (Despite factory loss in the suburban 
cities included in my study area, the number of factories in the region's five 
suburban counties increased from 3,257 to 5,217 between 1970 and 1990.) Thus, 
the representation of blacks in facility neighborhoods was probably lower in the 
six-county metropolitan area than it was in my study area, and the proportion 
of factories sited in white neighborhoods between 1970 and 1990 was probably 
greater in the census-defined metropolitan area than it was in my study area. As 
a result, the regression models reported below are more likely to provide support 
for the simple segregation, racist intent, and economic models than they would 
if I used the Census Bureau's six-county definition. 


Unit of Analysis 


In this study, the unit of analysis is the census tract, except where census tracts 
were merged to examine change over time. Tract merging was necessary because 
census tract boundaries change from one decade to the next. I overcame this 
problem by creating dual-decade data sets for 1970/1980 and 1980/1990, 
preserving tract boundaries and data where possible and merging tract boundaries 
and data when necessary. 

Merging was accomplished as follows. Count variables such as total population 
and total number of families were summed across merged observations, while 
population-weighted means were calculated for variables representing average 
values. For example, if tracts A, B, and C contain 80, 100, and 120 families 
respectively, and average family income in these tracts equals $20,000, $30,000, 
and $40,000 respectively, the number of families in the merged super-tract 
would equal 80 4- 100 -- 120, or 300, and the population-weighted mean family 
income in the merged super-tract would equal [($20,000*80) + ($30,000*100) + 
($40,000*120)]/300. Variables representing average rather than median values 
were used in the merged data sets because the population-weighted mean of 
merged median values is not an accurate indicator of the true median value of 
the merged population. 


Measuring Facility Presence 


Since environmental inequality researchers do not know the actual health risk 
associated with specific environmental hazards or the residential location and 


982 / Social Forces 83:3, March 2005 


demographic characteristics of every individual in their study area, they typically 
(a) aggregate the number of hazards or quantity of releases that exist in some 
specified census unit to create variables such as the pounds of TRI emissions in 
a census tract or the number of hazardous waste sites in a zip code (Bowen et 
al. 1995) or (b) use a dummy variable to record the presence or absence of some 
type of environmental hazard in specific analysis units (Anderton et al. 1994a, b). 
These variables are then correlated with demographic variables to see whether 
environmental hazard levels vary with characteristics such as percent black and 
median household income. 

Considerable information is lost, however, when environmental hazard 
indicators are constructed in these ways. This is because the actual location of 
each hazard is ignored when we create dummy variable hazard indicators or 
sum together the number of facilities or pounds of releases in each analysis unit. 
As a result, although many manufacturing facilities are located on or near tract 
boundaries, we are forced to assume either that these facilities do not impact 
people residing in adjacent tracts or that they impact every square inch of 
their home tract and adjacent tracts equally. We are also forced to assume that 
manufacturing facilities are evenly distributed within tracts, even though we know 
that they tend to cluster along major transportation routes (Northam 1979). 

Finally, simple summing fails to take tract size into account. However, as 
you proceed further and further from Detroit's central business district, tract 
size increases. Percent black and median household income are also correlated 
with distance from the central business district. Thus, it turns out that percent 
black and median household income are correlated with tract area, a potentially 
important determinant of the number of facilities in a tract. 

In order to overcome these problems, I created a new set of variables, not 
used in prior environmental inequality research, that take tract size, facility 
location, and the boundary problem into consideration. In order to create these 
new variables, I used geographic information system (GIS) mapping software 
to convert Detroit metropolitan area census tract maps into raster maps. Raster 
maps are rectangular grids typically composed of square cells. I created the new 
variables using 25-meter resolution grid cells, cells whose outer dimensions are 
25 by 25 meters in length. 

After converting the census tract maps to grids, I computed the number 
of facilities within a 1⁄4 kilometer radius of the center of each grid cell and the 
distance, in meters, from the center of each grid cell to its nearest facility. Finally, 
for each of these variables, I took the sum of the values of all the cells in each 
tract and divided this sum by the number of cells in each tract. This gave me 
two new tract-level variables: the average exposure in a tract, or the number of 
facilities within a %4 kilometer radius of the average tract cell, and the average 
minimum distance in a tract, or the distance from the average tract cell to its 
nearest facility. 

These new indicators take facility location and tract size into account. They 
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do not force us to assume that facilities are evenly distributed within tracts, that 
facilities have no affect on people residing in adjacent tracts, or that facilities 
impact every square inch of their host tract and adjacent tracts equally. Thus, these 
new indicators are clearly superior to the facility presence indicators typically used 
by environmental inequality researchers (see Downey 2003 for a more detailed 
discussion of this variable construction technique). 


Analytic Strategy 


As noted earlier, although the four models of environmental inequality make 
predictions about individual and firm-level behavior, data are available to test only 
the neighborhood-level predictions of these models, which I do using ordinary 
least squares (OLS) regression and visual map analysis (the visual inspection of 
maps). I use maps to examine the changing distribution of black neighborhoods 
and factory neighborhoods in the Detroit metropolitan area from 1970 to 1990 
and OLS regression to test causal predictions derived from the explanatory 
models discussed above. 

Because the explanatory models make predictions about the determinants of 
facility siting, racial succession, and income succession, the. dependent variables 
in the regression models that follow are changes from one decade to the next 
(1970-1980 and 1980-1990) in percent black, average family income, and facility 
presence. 

I use changes in facility presence rather than actual siting decisions to test 
economic and racist intent model predictions about facility siting decisions 
because I do not have siting decision data and cannot accurately measure siting 
decisions from the data I do possess. Nevertheless, because these explanatory 
models predict that the number of facilities sited in a tract should be negatively 
associated with tract incomes and positively associated with percent black, then 
ceteris paribus these models also predict that changes in facility presence from 
one decade to the next should be negatively associated with tract incomes and 
positively associated with percent black. 

The regression equations take the following form: 


^ 96Black,, , ,, — B, B,?6Black, ,- BM, , BC, ,, (1) 


A Average Income, , ,, - B, t B, Average Income, , &BIM, , c BC, ,, (2) 


A Facility Presence, , ,, - B, * B; Facility Presence, , c BM, , * BC, (3) 

where M, , represents a vector of independent variables used to test the 
four models of environmental inequality and C, , represents a vector of control 
variables drawn from the literatures on environmental inequality, neighborhood 
succession, and urban sociology. 

Table 2 recategorizes the causal predictions listed in Table 1 according to 
the dependent variables used in the regression models, thereby providing a link 


(t, t-1) 


3, March 2005 


984 / Social Forces 83 











9AUeSaN € Sur xovg 
JAT]LBON Z Bull xorg 
= uorssa»ons [eroe 031243 xpe[q Koueovípy 'c 
oSueuqo jua1 sso18 oSe1oAV 
ƏADPSƏN ruou oog 1u21 sso18 ode1oAV sjua1 poou1oqu2iaN 'Z 
əBueuo anjea A31odoud o8e12Ay 
dAT]RBIN orurouooq an[eA A31odoud o3e1oAy sanyea Aj1ado1d pooyrogysian *Z 
ƏADISOd 1oN UOIssadONs [eIIeY 
Ansood uorje32135s o[duitg asuryp a1nsodxo o3e1oAV 
9An1sod »rurouooq ' eunsodxo o3e1oAy aouasaid Á peg ^T 
diysuoyvjas pajoadxq japow Asojouyjdxq s2|qv1.vA 1521 1uapuadopug s1da2uoo ajquisva Juapuadapuy 
Buey xori_ JUadJag :aqeieA Juapusdaq 
ƏZULYI 1uo1 sso18 oSe1oAV 
9ATJISOq 5ruouooq ju2J sso18 oge19AV sjua1 poou1oqu2IaN `Z 
oSueupo one A1odoud o3e1oAy 
ƏADISOd 5rurouooq en[eA A31odoud o8e1oAV san[eA 431edoud poou1oqusiaN| "Z 
oSueuo o1nsodxo oSv1oAy 
9AHESON 5rurouooq o1nsodxo o8e129AV aouasaid Ádpe4 `T 
dujsuoi3v]a4 pa122dx3 Jəpotu Asoyouvjdxq s2|qU14vA 1521 1uapuadoapu] s1d22uoo 2|qviuv4 juapuadopu] 





əšueuoƏ eurooug :e[qerieA JUspusdaq 





oAn?eSoaN 
9An?SoN 
dAT}ISOg 
dAT}ISOg 


diysuoyvjas paqadxq 


onyea Ayiadoid »3vdoAy 
pe388oj ouroour Áqrurej o3e19AV 


sanpeA A31odoid poou1oqusreN `ç 
souroour poou1oqusIaN 'Z 


]9epour »rurouooq 
]epour orurouooq 


]epour orurouooq 
]|9pour 1uajur ISOL 


o»uoJogIp »pe[q Ud 


xpe[q 1ua21eq xpe[q 3ua03aq 'I 


Jəpotu Asoyouvjdxq sa|qui4vA 152) 1uapuadopu] s1d22uo» 2|qvi4va 1uapuadopu] 





oSueq? o»uoso1q Ape :spqeueA 3uspuedaqq 





suorpipoaq [2po]N uorssa182w 'z AQEL 


The Unintended Significance of Race / 985 


between the explanatory models discussed above and the regression models 
presented below. It lists the independent variable concepts from Table 1 and their 
corresponding test variable indicators and tells us which explanatory models are 
tested by each indicator as well as the expected relationship between each test 
variable and its associated dependent variable. 

For example, the top third of Table 2 lists the predictions that are tested when 
facility presence change is the dependent variable. Columns 1, 3, and 4 tell us that 
this dependent variable is used to test the prediction of the racist intent and 
economic models that the percentage of blacks in a neighborhood is positively 
associated with changes over time in facility presence and the prediction of the 
economic model that neighborhood incomes and property values are negatively 
associated with changes over time in facility presence. Column 2 tells us that 
the first prediction is tested using two test variable indicators, percent black and 
percent black difference (the change in percent black from time one to time two), 
and that the second prediction is tested using average family income logged and 
average property value. 

Thus, the racist intent model is supported if percent black and percent black 
difference are positively associated with the dependent variable, and the economic 
model is supported if percent black and percent black difference are positively 
associated with the dependent variable and average family income logged and 
average property value are negatively associated with the dependent variable. 

The number to the left of each independent variable concept in Table 2 
indicates the order in which the independent test variables are inserted into the 
regression equations. In the facility presence change regression models, percent 
black and percent black difference are the first test variables inserted into the 
equation, followed by average family income logged then average property value. 
The control variables are inserted last. The logic behind the ordering differs across 
regression equations and is, therefore, discussed separately for each regression 
equation in the results section. Test and control variable definitions, as well as 
the logic behind the inclusion of specific control variables, are also discussed 
separately for each regression equation in the results section. 

Table 2 shows that I use a different set of test variables in each regression 
equation. I do this because the explanatory models, taken as a whole, do not make 
the same predictions about each dependent variable. For example, although the 
racial succession model predicts that changes in percent black from one decade 
to the next will be greatest in neighborhoods immediately adjacent to already 
established black neighborhoods, none of the explanatory models predict that 
changes in average family income or facility presence will be affected by adjacency 
to already established black neighborhoods. Thus, it makes little sense to use the 
same test variables in each equation. 

Finally, these regression models are potentially problematic because they 
contain temporally lagged endogenous variables (the dependent variable at time 
t-1) which, in the presence of temporal autocorrelation, are correlated with the 
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error term, resulting in inconsistent and generally inflated parameter estimates 
(Ranger-Moore et al. 1995). Spatial autocorrelation is also a potential problem 
in these regression models. In order to determine whether spatial and temporal 
autocorrelation affected regression results, I ran models controlling for both. Since 
substantive results are nearly identical regardless of whether or not I control for 
these potential sources of bias, I present nonadjusted OLS results, with which 
readers are more familiar, in this article. (I note in the text the one case where 
results differed after controlling for spatial and temporal autocorrelation.) 


Results 


Table 3a correlates percent black and median family income with average 
minimum distance and average exposure in 1970, 1980, and 1990. I use median 
family income rather than average family income in Table 3a because single- 
decade census tract data does not need to be merged and median income data 
is generally considered to be preferable to average income data. Nevertheless, 
correlation results are nearly identical regardless of whether I use median or 
average income data. 

Table 3a shows that in 1970, 1980, and 1990, average minimum distance and 
average exposure were both correlated with median family income and percent 
black in the expected direction. As median family income increased, average 
exposure decreased and average minimum distance—the distance from the 
average tract cell to its nearest facility—increased. As percent black increased, 
average exposure increased and average minimum distance decreased. 

Table 3a also shows that the correlations between median family income, 
percent black, and the two facility presence indicators were considerably stronger 
in 1970 than in 1990. For example, the significant correlations between average 
minimum distance and percent black, which equaled —.15 in 1990, equaled —.29 
in 1970, and the significant correlations between average exposure and percent 
black, which equaled .07 in 1990, equaled .17 in 1970.* 

These results demonstrate that race- and income-based environmental 
inequality both existed in the Detroit metropolitan area throughout the period 
under study. But they also demonstrate that by 1990, race-based environmental 
inequality in the Detroit metropolitan area was relatively weak (as indicated 
by the weak correlations between percent black on the one hand and average 
exposure and average minimum distance on the other), and that this was so 
despite the fact that in 1990 income-based environmental inequality was still 
relatively strong and black families were still considerably poorer than white 
families (see Table 3b). 

The maps and regression analyses that follow help us determine why black 
environmental inequality was so weak in 1990. 
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Table 3(a). Kendall Tau-b Correlation Coefficients for Average Exposure, 
Average Minimum Distance, Percent Black, and Median Family 
Income, 1970-1990 


Median Family Income 


1990 1980 1970 
Average minimum distance oe SN E 1996" 
Average exposure —.171*** —.239** P 


Percent Black 


1990 1980 1970 
Average minimum distance S152" —.187*** 29985 
Average exposure .074* 07977 (lige. 


OOS ep Od pup 





Table 3(b). Mean Family Income and Family Poverty Rates for Blacks and 
Whites in the Detroit Metropolitan Area, 1990* 


Black White Black/White Ratio 
Mean family income $26,059 $36,969 ý .70 
Percentage of families in poverty 31 8 3.87 





*[ use mean rather than median family income because it is not possible to calculate a single 
median family income for Detroit and the cities immediately surrounding Detroit without access to 
individual census records. 


THE CHANGING DISTRIBUTION OF BLACK NEIGHBORHOODS AND 
FACTORY NEIGHBORHOODS, 1970—1990 


Figure 1 examines tract-level changes in facility presence and the location of 
black neighborhoods from 1970 to 1980. It tests the prediction of the economic 
and racist intent models that changes in facility presence should be greater in 
black neighborhoods than white neighborhoods, the prediction of the economic 
and simple segregation models that blacks should be drawn disproportionately 
into facility presence tracts, and the prediction of the racial succession model 
that black neighborhoods and ghettos expand at their edges rather than into or 
along manufacturing neighborhoods and corridors. 

In order to test the first prediction, the census tracts in maps (a) and (b) of 
Figure 1 are categorized according to changes in the total number of facilities in 
each tract from 1970 to 1980. The lightest-colored tracts in maps (a) and (b) 
represent tracts that lost between 1 and 15 facilities during the 1970s. However, 
only 3 of these 119 tracts lost more than 10 facilities, and only 9 lost more than 
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Figure 1. Tract-Level Changes in Facility Presence and the Location of 
Black Neighborhoods, 1970—1980 
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4 facilities. The tract that lost 15 facilities was in an all-white neighborhood, and 
the 2 tracts that lost between 10 and 15 facilities were in predominantly black 
neighborhoods. Finally, each dot in map (b) represents the residential location 
of 2000 blacks in 1970. 

Maps (a) and (b) contradict the prediction that changes in facility presence 
will be greater in black neighborhoods than in white neighborhoods. Although 
several black neighborhoods gained manufacturing facilities in the 1970s, it is 
apparent that in most black tracts, the number of facilities declined or remained 
constant. It is also apparent that facility increases occurred primarily outside 
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Detroit's black ghetto. (The neighborhoods outside Detroit's black ghetto were 
overwhelmingly white in 1970 and 1980.) 

Maps (c) and (d) test the prediction of the economic and simple segregation 
models that blacks will be drawn disproportionately into facility presence tracts 
and the prediction of the racial succession model that black neighborhoods 
and ghettos will expand at their edges rather than into or along manufacturing 
neighborhoods and corridors. In order to test these predictions, the census tracts 
in maps (c) and (d) are categorized according to average exposure.* (In the 
following section, I explain why average exposure is used to test some explanatory 
model predictions but not others.) Each dot in map (c) indicates the residential 
location of 2,000 blacks in 1970, and each dot in map (d) indicates a 2,000-person 
increase from 1970 to 1980 in the number of blacks residing in a tract. (While 
it would be ideal to present an "average exposure" map without any dots, space 
considerations prevent me from doing so.) 

Maps (c) and (d) contradict the prediction that blacks will be drawn 
disproportionately into facility presence tracts. While blacks clearly lived in 
many high-exposure tracts in 1970, blacks do not appear to have been drawn 
disproportionately into the metropolitan area's other high-exposure tracts during 
the 1970s. In fact, virtually no blacks moved into the band of medium- to high- 
exposure tracts located just to the north of the metropolitan area's 1970 black 
ghetto or into the pocket of low- to medium-exposure tracts to the southwest 
of the region's 1970 black ghetto. Instead, Detroit's black ghetto appears to have 
expanded at its northern, northeastern, and northwestern edges, as predicted by 
the racial succession model. 

The same basic patterns hold in Figure 2, though changes in the total number 
of facilities in a tract appear to be more evenly distributed across black and 
white tracts in the 1980s than the 1970s. Thus, Figures 1 and 2 both contradict 
the prediction of the economic and racist intent models that changes in facility 
presence will be greater in black neighborhoods than in white neighborhoods 
and the prediction of the economic and simple segregation models that blacks 
will be drawn disproportionately into facility presence tracts. Conversely, Figures 
1 and 2 both support the prediction of the racial succession model that black 
neighborhoods and ghettos expand at their edges rather than into or along 
manufacturing neighborhoods and corridors. (Maps not shown here demonstrate 
that the same basic spatial patterns held from 1920 to 1970.) Thus, the racial 
succession model is the only explanatory model supported by the evidence 
presented in this section. 

In order to confirm this conclusion quantitatively and test additional 
explanatory model predictions, I now turn to regression analysis. 
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Figure 2. Tract-Level Changes in Facility Presence and the Location of 
Black Neighborhoods, 1980—1990 
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FACILITY PRESENCE 


The dependent variable in the first set of regression analyses is the change from 
one decade to the next in the total number of facilities in a tract (the total number 
of facilities in a tract at time two minus the total number of facilities in a tract 
at time one). I use this facility presence indicator rather than changes in average 
exposure or average minimum distance because average exposure and average 
minimum distance are affected by the presence and distribution of facilities in 
adjacent tracts. Thus, in any given tract, the values of these variables and changes 
in their values from one decade to the next are determined by the values of 
test and control variables in multiple tracts. Moreover, total facility change is a 
better proxy for siting decisions than are changes in average exposure or average 
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minimum distance. Nevertheless, I obtain essentially the same results no matter 
which facility presence indicator I use, and this is true for all the regression 
models included in this article. 

The results are listed in Table 4 (see Table 2 for predicted relationships). The 
first two test variables, percent black and percent black difference (the change in 
percent black from one decade to the next), test the racist intent model, which 
holds that facilities are more likely to be sited in black neighborhoods than in 
white neighborhoods because decision makers and their constituents are racially 
prejudiced. Thus, percent black and percent black difference should be positively 
associated with the dependent variable. 

Average family income logged tests the prediction of the economic model that 
facilities will be sited in neighborhoods with low incomes. Thus, this variable 
should be negatively associated with the dependent variable. Moreover, because 
the economic model holds that racially inequitable siting decisions are the result 
of black-white income inequality rather than overt racial bias, the inclusion of 
this variable in the model should wash out or weaken any positive association 
that exists between the dependent variable and the first two test variables. 

Finally, average property value tests the prediction of the economic model that 
facilities will be sited in low-income neighborhoods because property values are 
low in these neighborhoods and firms site facilities where property values are low. 
Thus, average property value should be negatively associated with the dependent 
variable, and its inclusion in the regression equation should wash out or weaken 
any negative association that exists between average family income logged and 
the dependent variable. 

The control variables are listed below the test variables. The first control 
variable is the dependent variable at time one. Detroit is a dummy variable 
set equal to one if a tract is located in the city of Detroit and zero otherwise. 
It is included because central-city location has become an important negative 
determinant of manufacturing facility siting decisions in recent decades (Darden 
et al. 1987; Farley, Danziger, and Holzer 2000; Sugrue 1996). Facility density 
(the number of manufacturing facilities per km?) and population density (the 
number of people per km’) are included because manufacturing facilities are 
often sited near other manufacturing facilities but in areas with plenty of open 
space (Northam 1979; Sinclair 1972).’ The percentage of housing units that are 
owner occupied and the percentage of people five years old and older living in the 
same house as five years ago are included because it is hypothesized that residential 
stability is negatively associated with facility presence change.* The percentage of 
workers employed in manufacturing is included because it is hypothesized that 
firms attempt to site factories near potential employees (Anderton et al. 1994a, 
b), and railroad distance logged is included because Detroit metropolitan area 
manufacturing facilities tend to be located near the region’s railroad lines (Sinclair 
1972).° Finally, all the independent variables except percent black difference are 
measured solely at time one. 
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The results listed in Table 4 contradict both the racist intent model and the 
economic model: regardless of whether the control variables are included in the 
regression model, percent black and percent black difference are never positively 
associated with the dependent variable and average family income logged, and 
average property value are never negatively associated with the dependent 
variable. 

One possible explanation for these results is that low-income tracts, low 
property value tracts, and tracts with high percentages of blacks were already 
saturated with manufacturing facilities in 1970 and 1980, making it impossible 
for new facilities to be sited in them. However, this was not the case. Data not 
reported here show that in the Detroit metropolitan area in 1970, 1980, and 1990, 
there were many low- and medium-income tracts, low and medium property 
value tracts, and tracts with high percentages of blacks with relatively few or no 
manufacturing facilities. 


ECONOMIC SUCCESSION 


The dependent variable in the second set of regression analyses is the change 
in average family income from one decade to the next (average family income at 
time two minus average family income at time one, with 1970 and 1980 average 
family income converted to 1990 dollars). All the independent variables except 
for average exposure change, average property value change, and average gross rent 
change are measured solely at time one. 

The results are listed in Table 5 (see Table 2 for the predicted relationships). 
All six test variables test the economic model, which predicts that neighborhood 
incomes will decline over time, relative to overall area incomes, in facility 
neighborhoods and in neighborhoods with low property values and low rents. 
Specifically, the model holds that average exposure and changes in average 
exposure from one decade to the next (average exposure change) should be 
negatively associated with neighborhood income change because facility presence 
lowers neighborhood property values and rents, leading wealthier individuals 
to flee facility neighborhoods and poorer ones to move in. Thus, the dependent 
variable should be negatively associated with the two facility presence indicators 
and positively associated with average property value, average property value 
change, average gross rent, and average gross rent change. Moreover, including 
the property value and rent variables in the equation should wash out or weaken 
any negative association that exists between the facility presence variables and 
neighborhood income change. 

The control variables are listed below the test variables. The first control 
variable is the dependent variable at time one. Detroit is a dummy variable set 
equal to one if a tract is located in the city of Detroit and zero otherwise. It 
is included because inner-city incomes have declined dramatically relative to 
suburban incomes in many major metropolitan areas since 1970 (Massey and 
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Denton 1993; Sugrue 1996; Wilson 1987). All the remaining control variables 
except the percentage of workers employed in manufacturing and railroad 
distance logged are drawn from the literature on economic succession. Economic 
succession can be defined as the "replacement of nonpoor residents by poor 
residents within the boundaries of a given neighborhood" (modification of 
White 1984:165) and is generally viewed as a function of household mobility, 
neighborhood racial composition, the age and condition of the housing stock, 
and the proximity of nonpoor neighborhoods to poor neighborhoods, which 
can be operationalized as the average family income in the poorest adjacent tract 
(Phillips 1981; Vandell 1981). 

The percentage of workers employed in manufacturing is included because 
it is hypothesized that economic succession occurred disproportionately in 
working-class neighborhoods due in large part to the declining fortunes of 
manufacturing workers in the 1970s and 1980s (Wilson 1987). Finally, railroad 
distance logged is included because it is hypothesized that economic succession 
occurred first near Detroit's railroad lines, where incomes tend to be relatively 
low and services poorly developed (Sinclair 1972), and over time progressively 
further from these lines. 

The results listed in Table 5 show that neither average exposure nor average 
exposure change is significantly associated with the dependent variable in any 
of the regression models. However, average property value and average property 
value change are both positively associated with the dependent variable in both 
decades (models 2, 3, 5, and 6). 

Thus, the economic model is only partially supported. Its prediction that 
average property value and average property value change are positively associated 
with income change is supported, but its predictions that facility presence and 
facility presence change are negatively associated with income change and that 
average gross rent and average gross rent change are positively associated with 
income change are not supported. Moreover, results not reported here show that 
neither average exposure nor average exposure change is a significant predictor 
of average property value change or average rent change when these latter two 
variables are inserted into the regression equation as dependent variables. Thus, 
the economic model prediction that facility presence has a negative impact on 
property values and rents is also contradicted. 

One might argue that facility presence is not negatively associated with average 
income change, average property value change, or average gross rent change 
because facility presence tracts had already experienced severe declines in incomes, 
property values, and rents prior to 1970. Perhaps incomes, property values, and 
rents in these tracts had declined as much as they could decline by 1970. 

Analyses not reported here demonstrate that this was not the case. These 
analyses compare the incomes, property values, and rents in the Detroit 
metropolitan area's highest facility presence tracts to the incomes, property 
values, and rents in the region's other tracts and find that in 1970, 1980, and 1990, 
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incomes, property values, and rents were relatively high in many of the region's 
highest facility presence tracts. The same pattern of results is also found when 
we compare the highest facility presence tracts in the city of Detroit with other 
“city of Detroit” tracts. Thus, we can safely conclude that the economic model 
is only partially supported by the results presented in Table 5. 


RACIAL SUCCESSION 


The dependent variable in the last set of regression analyses is the change in 
percent black from one decade to the next (percent black at time two minus percent 
black at time one). All the independent variables except average exposure change, 
average property value change, and average gross rent change are measured solely 
at time one. The results are listed in Table 6. The first two test variables, average 
exposure and average exposure change, test the economic and simple segregation 
models, which predict that blacks will be drawn disproportionately into facility 
presence tracts. They also test the racial succession model, which predicts that 
blacks will not be drawn disproportionately into facility presence tracts. Thus, 
the economic and simple segregation models are supported if average exposure 
and average exposure change are positively associated with the dependent variable, 
and the racial succession model is supported if these two test variables are not 
positively associated with the dependent variable." 

The economic model, which predicts that blacks are drawn to facility 
neighborhoods because facility neighborhoods have low property values and 
low rents, is also tested by the following four variables: average property value, 
average property value change, average gross rent, and average gross rent change. If 
the economic model is correct, these four variables should be negatively associated 
with the dependent variable and their inclusion in the regression equation should 
wash out or weaken any positive association that exists between the facility 
presence variables and the dependent variable. 

The last three test variables, black ghetto, black ring 2, and black ring 3, are a 
set of dummy variables created by designating census tracts as belonging to a 
black ghetto, a ring of tracts immediately adjacent to the black ghetto (ring 1), 
a ring of tracts immediately adjacent to ring 1 (ring 2), and the remainder of 
the tracts in the region (ring 3). These variables test the prediction of the racial 
succession model that racial succession occurs primarily at the edges of the black 
ghetto. Thus, the model is supported if the change in percent black from one 
decade to the next is lower in rings 2 and 3 than it is in ring 1. In other words, 
the model is supported if rings 2 and 3 are both negatively associated with the 
dependent variable. 

A census tract is defined as belonging to a black ghetto if at least 1596 of 
its residents are black. This threshold was chosen based on Emerson, Chai, and 
Yancey's (2001) finding that whites are unlikely to buy a house in a neighborhood 
that is more than 1596 black, suggesting that whites consider such neighborhoods 
to be undesirable because they view them as either being black neighborhoods 
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or being on the road to becoming black neighborhoods. Because some readers 
might find this threshold too low, I ran a set of models using a 3096 black ghetto 
threshold. These models are not reported because their results are substantively 
identical to the results reported here. 

The control variables are listed below the test variables. The first control 
variable is the dependent variable at time one. Detroit is a dummy variable set 
equal to one if a tract is located in the city of Detroit and zero otherwise. It is 
included because the boundary between Detroit and its suburbs has played an 
important role in shaping residential segregation in the region (Darden et al. 
1987; Sugrue 1996). All the remaining control variables except the percentage 
of workers employed in manufacturing and railroad distance logged are drawn 
from the literature on racial succession. Racial succession is generally viewed as 
a function of white geographic mobility (Steinnes 1977), the age and condition 
of the housing stock (Wilson 1983), and the proximity of white neighborhoods 
to nonwhite neighborhoods (Denton and Massey 1993; Steinnes 1977). The 
percentage of workers employed in manufacturing is included because it is 
hypothesized that racial succession is less likely in working-class neighborhoods 
(Sugrue 1996), and railroad distance logged is included because railroads have 
had a pronounced effect on residential development and succession in the Detroit 
metropolitan area (Sinclair 1972). 

The results listed in Table 6 show that neither average exposure nor average 
exposure change is significantly associated with the dependent variable in either 
decade. Thus, the prediction of the simple segregation and economic models that 
blacks are drawn disproportionately into facility presence tracts is contradicted, 
while the prediction of the racial succession model that blacks are not drawn 
disproportionately into facility presence tracts is supported. 

Of course, if high facility presence tracts were predominantly black in 1970 
and 1980, then the percentage of people in these tracts who were black could 
not have increased very much from 1970 to 1980 or from 1980 to 1990, making 
it highly unlikely that facility presence would be positively associated with the 
dependent variable in either decade. However, data not reported here demonstrate 
that in 1970, 1980, and 1990 many high facility presence tracts in the Detroit 
metropolitan area had few or no blacks. Moreover, as late as 1980, many high 
facility presence tracts within the city of Detroit had few or no blacks. Thus, we 
can safely conclude that the prediction of the simple segregation and economic 
models that blacks are drawn disproportionately into facility presence tracts is 
contradicted, while the prediction of the racial succession model that blacks are 
not drawn disproportionately into facility presence tracts is supported. 

The remaining models include all nine test variable predictors. I do not present 
the intermediate models that include the facility presence, property value, and 
rent variables, but not the black ghetto and ring variables, for two reasons. First, 
because the facility presence variables are not significantly associated with the 
dependent variable in models 1 and 4, it is not necessary to determine whether 
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including the property value and rent variables in the regression equation weakens 
the association (that does not exist) between the facility presence variables and 
the dependent variable. Second, the substantive interpretations of the property 
value and rent coefficients are identical regardless of whether the black ghetto 
and ring variables are inserted into the equation. 

Model 2 includes all the test variable predictors but only one control variable. 
In this model, the 1970/1980 dependent variable is negatively associated with 
average property value, average property value change, and black rings 2 and 3 
and is positively associated with average gross rent, average gross rent change, 
and black ghetto. Results are only slightly different when the remaining control 
variables are inserted into the equation in model 3: with the remaining control 
variables included, average gross rent change and black ghetto are no longer 
statistically significant. 

In models 5 and 6, on the other hand, the only significant test variable 
coefficients are the coefficients for the black ghetto and ring variables. Black 
ghetto is positively associated with the 1980/1990 dependent variable in both 
models, and black rings 2 and 3 are negatively associated with the 1980/1990 
dependent variable in both models.” 

Thus, the racial succession model is fully supported by the results presented 
in Table 6: in neither decade were changes in percent black affected by facility 
presence levels and in both decades changes in percent black were higher in ring 
1 tracts than in ring 2 or ring 3 tracts. 

The economic model, on the other hand, is only weakly supported by the 
results presented in Table 6. Its prediction that average property value and average 
property value change are negatively associated with the dependent variable is 
supported in the 1970s but not the 1980s. Its prediction that average gross rent 
and average gross rent change are negatively associated with the dependent 
variable is contradicted in both decades. Moreover, as noted above, blacks were 
not drawn disproportionately into facility presence tracts as both the economic 
and simple segregation models predict should occur. 


Discussion 


The racial succession model is the only model of environmental inequality fully 
supported by the regression and map analyses reported above. Moreover, it is the 
only explanatory model tested in this article that receives more than weak support 
from these analyses. Consistent with its expectation that black ghettos expand at 
their edges rather than along urban manufacturing corridors, changes in percent 
black from one decade to the next were higher in tracts immediately adjacent 
to Detroit’s black ghetto than they were in other nonghetto tracts. But contrary 
to the expectations of the economic and simple segregation models, blacks were 
not drawn disproportionately into manufacturing facility neighborhoods, and 
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contrary to the expectations of the economic and racist intent models, changes 
in facility presence from one decade to the next were not associated in the 
expected direction with neighborhood racial status, neighborhood income levels, 
or neighborhood property values. Moreover, evidence not reported here shows 
that from 1920 to 1970 Detroit's black ghetto also expanded at its edges rather 
than along the city's manufacturing corridors. 

Thus, the 1990 distribution of blacks and whites around Detroit metropolitan 
area manufacturing facilities was not the result of racist siting practices or black/ 
white income inequality.” This distribution was shaped, however, by residential 
segregation, which limited blacks’ residential options to housing in or at the edges 
of already established black neighborhoods, thereby preventing them from being 
drawn disproportionately into manufacturing neighborhoods and corridors that 
lay far beyond the boundaries of Detroit’s black ghetto. 

As others have documented (Darden et al. 1987; Farley, Danziger, and Holzer 
2000), residential segregation also directed blacks away from Detroit’s most heavily 
industrialized suburb, Warren, and toward two relatively nonindustrial suburbs, 
Oak Park and Southfield, despite the fact that housing was significantly less 
expensive in Warren than in Oak Park and Southfield and that 30% of Warren’s 
workforce was black in 1970. According to Darden et al. (1987), blacks migrated 
to Southfield and Oak Park rather than to Warren because white resistance to 
black settlement was much stronger in working-class Warren than in middle-class 
Southfield and Oak Park. White resistance was exceptionally strong in Warren, 
they argue, because Warren’s white working-class population had fled Detroit 
in order to avoid living with blacks and was determined to neither flee nor live 
near blacks again. 

Figure 3, which maps out the 1990 distribution of blacks and manufacturing 
facilities in the Detroit metropolitan area, illustrates the degree to which 
residential segregation within suburbia separated the region’s blacks from 
suburban manufacturing activity. While almost all of Southfield’s census tracts 
were at least 10% black and more than half of Southfield’s and Oak Park’s 
census tracts were at least 20% black, only seven of the region’s other suburban 
census tracts were more than 5% black. (The map in Figure 3 does not show 
the 5% cutoff point.) As a result, although blacks surrounded almost all the 
manufacturing facilities located in Detroit in 1990, virtually no blacks lived in 
the heavily industrialized manufacturing corridors located just to the north of 
the eastern two-thirds of Detroit. 

It should be clear, then, that in the Detroit metropolitan area, residential 
segregation has played a dual role in shaping environmental racial inequality. 
It has prevented Detroit’s black ghetto from expanding along the region’s 
manufacturing corridors and allowed blacks to move into some suburbs but not 
others—and as many researchers have shown, residential segregation is a product 
of white racism, not black income levels or black housing preferences (Galster et 
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al. 1987; Massey and Denton 1993; Yinger 1991a, b). Thus, racial status and racism 
are important determinants of environmental racial inequality in the Detroit 
metropolitan area, but in this case they operated indirectly through the housing 
market and acted to separate blacks from manufacturing facility pollution. 
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Conclusion 


This conclusion is, of course, consistent with Massey and Denton's argument 
that racial status continues to play a decisive role in shaping racial inequality 
and that housing market racism has a significant impact on racial inequality 
in other institutional arenas. It is also consistent with spatial mismatch theory, 
which predicts that residential segregation and industrial flight have created 
a spatial mismatch between black neighborhoods and manufacturing facility 
employment. 

Nevertheless, this conclusion also highlights an important shortcoming 
common to both segregation theory and spatial mismatch theory: the need 
to recognize that residential segregation sometimes separates minorities from 
socially undesirable goods. Determining the conditions under which segregation 
is likely to do this is clearly beyond the scope of this article. However, I would 
hypothesize that such conditions are relatively rare, existing primarily in situations 
where socially desirable and undesirable goods are tied to one another spatially 
(as is the case with manufacturing jobs and manufacturing pollution) such that 
residential segregation allows whites to monopolize a socially desirable good 
while simultaneously forcing them to live near a spatially linked undesirable 
good. Thus, segregation is likely to have this surprising and unintended effect 
in only a limited number of situations. 

In addition to contributing to our understanding of residential segregation, 
this article demonstrates that environmental inequality researchers need to pay 
more attention to the broader sociological literature on race, urban development, 
and urban neighborhood ethnic dynamics from which the racial succession 
model was drawn. Doing so will not only help us develop more nuanced 
explanations of urban environmental inequality, it will also allow environmental 
inequality researchers to engage in important sociological debates concerning 
urban housing and labor markets, racial and ethnic competition, residential 
segregation, neighborhood succession, spatial mismatch, and group conflict over 
urban space. 

Finally, it should be noted that although this study examines only one type 
of environmental hazard, the spatial distribution of this hazard is quite similar 
to that of other Detroit metropolitan area environmental hazards: the region's 
superfund sites, large-quantity hazardous waste generators, and hazardous waste 
treatment, storage, and disposal facilities (TSDFs) are all located in the same set 
of industrial corridors as are polluting manufacturing facilities, though TSDFs 
are more highly concentrated in the city of Detroit than are the other hazards 
(maps available upon request). 

This suggests that in the Detroit metropolitan area, the factors and processes 
determining who lives near superfund sites and hazardous waste sites are probably 
quite similar to the factors and processes determining who lives near polluting 
manufacturing facilities. In addition, it suggests that although the location of 
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urban environmental hazards is probably heavily constrained by the location 
of urban industrial corridors, the factors affecting the distribution of hazards 
within these corridors probably vary across hazards. Determining what these 
factors are and whether they are related to market dynamics, the infrastructural 
or geophysical needs of different kinds of facilities, the social desirability of 
different types of hazards (i.e., factories vs. hazardous waste dumps), the ability 
of different neighborhoods to attract or repel differently valued hazards, or some 
other set of determinants would contribute greatly to our understanding of urban 
environmental inequality, as would replicating this study in other metropolitan 
areas to see if the patterns uncovered in Detroit hold in other regions of the 
country. 


Notes 


1. Throughout this article, siting refers to the placement of new manufacturing facilities in a 
neighborhood, while facility location refers to the presence of already-existing manufacturing 
facilities in a neighborhood. 


2. The geographic distribution of 1992 TRI facilities (all manufacturing sectors included) 
and facilities in the 1970, 1980, and 1990 four sector database is extremely similar, suggesting 
that the facility and TRI databases capture a common phenomenon. (Facility and TRI maps 
are available upon request). : 


3. Another potential problem with the data is that the factors shaping environmental 
inequality may vary across hazards, making it impossible to generalize the findings reported 
here to other hazards. The only way to determine whether these factors do vary is to study 
multiple hazards at the same time or conduct a series of studies, each with its own hazard 
indicator. Because the former option is not practical, this study focuses on a single, commonly 
studied environmental hazard, polluting manufacturing facilities. 


4. The 1970 and 1990 correlation coefficients for average minimum distance and percent 
black are significantly different at the 9596 confidence level, the 1970 and 1990 correlation 
coefficients for average exposure and percent black are significantly different at the 9396 
confidence level, the 1970 and 1990 correlation coefficients for average minimum distance 
and median family income are significantly different at the 9596 confidence level, and the 
1970 and 1990 correlation coefficients for average exposure and median family income are 
significantly different at the 9496 confidence level. 


5. Although it would have been preferable to assign a single dot for every 500 or 1,000 blacks 
residing in a tract, doing so made it difficult to see the tracts underneath. Moreover, because 
most tracts in the Detroit metropolitan area were virtually either all white or all black in 1970 
and 1980, assigning each dot a lower value did not change the spatial relationships evident 
in Figures 1 or 2 in any substantively significant way. 


6. In Figure 1 and Figure 2, the spatial patterns observed in maps (c) and (d) are very similar 
regardless of whether I categorize census tracts according to average exposure or the total 
number of facilities in a tract. 


7. The facility density indicator really measures the density of facilities included in my data 
set. While it would have been preferable to have density data for all manufacturing facilities, 
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such data were not available. Nevertheless, this indicator is better than no indicator and it is 
negatively associated with the 1970/1980 dependent variable. 


8. I include the residential stability variables because residential stability has been shown 
to be positively associated with neighborhood collective efficacy (Sampson, Morenoff, and 
Earls 1999) which, in turn, is likely to be negatively associated with facility presence change. 
Neighborhood efficacy and facility presence change are likely to be negatively associated with 
one another because efficacious neighborhoods are more likely than other neighborhoods 
to organize themselves successfully against unwanted land uses. 


9. Percent employed in manufacturing is defined as the percentage of individuals in a tract who 
are 16 years old or older and are employed in one of the following occupational categories: 
precision production, craft, and repair; machine operators, assemblers, and inspectors; and 
handlers, equipment cleaners, helpers, and laborers. Railroad distance logged is the natural 
log of the distance in meters from the center of each tract (or merged “super-tract”) to the 
nearest railroad. 


10. Of course, percent black can increase if the number of blacks in a tract increases or the 
number of nonblack tract residents decreases. Thus, if any of the test variables are positively 
associated with percent black change it could be due to black in-migration, white out- 
migration, or both. Increases in percent black due to white out-migration are not inconsistent 
with economic or simple segregation model predictions, however, because both models expect 
that whites will be drawn away from facility presence tracts in disproportionate numbers. 
The economic model expects this to happen because whites are, on average, wealthier than 
blacks, and the model holds that wealthier individuals will flee facility neighborhoods, and 
the simple segregation model expects this to occur because whites have greater residential 
choice than blacks. Finally, the racial succession model holds that racial succession is due to 
black in-migration and white out-migration. 


11. I noted earlier that regression results are nearly identical regardless of whether or not I 
control for spatial and temporal autocorrelations. The one exception occurs in Table 6. When 
I control for spatial and temporal autocorrelation in Table 6, average property value is no 
longer significantly associated with the 1970/1980 dependent variable. 


12. If racist siting practices had played a role in shaping environmental inequality in the 
Detroit metropolitan area, changes in facility presence from one decade to the next would 
have been positively associated with percent black in Table 4. If black/white income inequality 
had played a role in shaping environmental racial inequality in the region, percent black 
would have been positively associated with facility presence change in Table 4, average family 
income logged would have been negatively associated with facility presence change in Table 
4, and the negative association that was expected between facility presence change and 
average family income logged in Table 4 would have washed out or weakened the positive 
association that was expected between percent black and facility presence change. Moreover, 
facility presence would have been negatively associated with average family income change 
in models 1 and 4 in Table 5. 
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Abstract 


The current controversy surrounding racial profiling in America has focused renewed 
attention on the larger issue of racial bias by the police. Yet little is known about the 
extent of police racial bias and even less about public perceptions of the problem. 
This article analyzes recent national survey data on citizens views of, and reported 
personal experiences with, several forms of police bias—including differential 
treatment of individuals and neighborhoods, police prejudice, and racial profiling. 
We find that attitudes toward the prevalence and acceptability of these practices are 
largely shaped by citizens’ race, personal experiences with police discrimination, and 
exposure to news media reporting on incidents of police misconduct. The findings 
lend support to the group-position theory of race relations. 


Racial bias by the police includes such things as racial profiling of motorists, 
racial prejudice among police officers, and discriminatory treatment of minority 
individuals and minority neighborhoods. Little research exists on public 
perceptions of racially biased policing, though such perceptions may have 
important consequences. The perception of police practices as unfair or as racially 
motivated may lead to more frequent and severe confrontations between police 
and citizens and to greater distrust of the police. This article examines citizens’ 
views of and reported experiences with police bias and the major determinants 
of citizens’ perceptions. 
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Explaining Public Perceptions of the Police 


Race is one of the most consistent predictors of attitudes toward the police. 
African Americans are significantly more likely than whites to hold negative views 
of the police, but very little is known about Hispanics’ views. Do Hispanics tend 
to take a “minority group” perspective similar to that of African Americans; do 
they take an intermediate position in a white-Hispanic-black “racial hierarchy” 
pattern; or are their views more closely aligned with those of whites? Since only 
a few studies systematically compare police relations with Hispanics, blacks, and 
whites, the literature is insufficient to address these questions. 

A larger deficiency in the empirical literature concerns the question of why 
racial differences exist in citizens’ relations with the police. We argue that part 
of the explanation can be found in the group-position thesis, a variant of conflict 
theory. The group-position thesis focuses on intergroup competition over material 
rewards, status, and power. Racial attitudes reflect not merely individual-level 
feelings and beliefs but also a collective “sense of group position” vis-a-vis other 
racial groups (Blumer 1958), including (1) perceived threats: dominant group 
members’ fears that their group is at risk of losing privileges or resources to 
competing racial groups, and (2) perceived advantages: minority group members’ 
beliefs that their group interests will be enhanced by challenging the prevailing 
racial order. 

The group-position thesis has been used to explain intergroup racial attitudes 
(Blumer 1958; Bobo and Hutchings 1996; Kinder and Sanders 1996); we extend 
it to an analysis of group relations with social institutions. If the dominant group 
believes that it is entitled to valuable resources, it follows that the group will have 
an affinity with the institutions that serve their interests. One such institution is 
the criminal justice system. 

Dominant racial groups typically see the police as allies. This is especially 
transparent in deeply divided societies, like Northern Ireland and South Africa, 
where the police are or were a key institutional pillar in a sectarian sociopolitical 
system, and where the dominant racial or ethnic group traditionally views the 
police as an instrument for suppressing subordinate groups (Weitzer 1995). This 
affinity between the police and dominant groups is less pronounced in more 
democratic societies, but we argue that even in these societies, the general group- 
position dynamic structures group relations with the police. In the U.S., white 
support for the police has traditionally been robust. At the same time, whites 
tend to see blacks as inclined to criminal or violent behavior (Hurwitz and Peffley 
1997). In the 2000 General Social Survey, for example, half of whites viewed blacks 
as “violence-prone.” For whites who subscribe to these views, there is a tendency 
to condone police suspicion and disparate treatment of blacks as “rational 
discrimination” (Weitzer 2000). These attitudes may be more strongly held by 
some whites (the most racially prejudiced) than by others (Cohn and Barkan 
2004), but the group-position thesis predicts that these views are fairly common 
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throughout the white population. Thus, for many whites, it seems sensible for 
law enforcement to target minority individuals or minority neighborhoods. 

If whites tend to align themselves with the police, it follows that, when the 
police are criticized, whites may perceive their group interests as indirectly 
threatened (Bayley and Mendelsohn 1969; Jackson 1989). Our extension of 
group-position theory predicts that whites will tend to be dubious or dismissive 
of allegations of police misconduct. To accept that minorities are frequently 
mistreated would lend credence to reforms—reforms that might dilute crime 
control, thereby threatening whites. African Americans and Hispanics, on the 
other hand, should be more inclined to view the police as engaged in frequent 
abuse of minority citizens and, thus, as a “visible sign of majority domination” 
(Bayley and Mendelsohn 1969:195). This view of police does not mean that 
minorities are monolithically critical of the police, but it does increase the chances 
that they will see police misconduct as both a general problem and one that 
particularly afflicts Hispanics and blacks (Weitzer and Tuch 2004a). 

The group-position thesis stresses perceived (not necessarily real) threats 
to dominant group interests (Bobo and Hutchings 1996; Kinder and Sanders 
1996). While most blacks and Hispanics want more law enforcement,' leaders 
within the minority community often criticize the police in public, which may 
reinforce whites’ impressions that minorities are trying to. interfere with crime 
control. In a nutshell, white skepticism of charges of police wrongdoing may 
be partly rooted in their attachment to the law-and-order status quo; minority 
perceptions of misconduct, on the other hand, may reflect their desire to gain 
better treatment from the police. 

Although we do not have direct measures of group interests and threats, our 
data do allow us to test the key predictions of group-position theory—namely, 
white defense of the police against charges of racial discrimination and minority 
belief that police racial bias is a serious problem. As noted above, it is unclear 
from the literature whether blacks and Hispanics hold roughly similar views or 
whether one group is more critical of the police. But just as African Americans 
have a deeper and more crystallized sense of relative group position vis-a-vis 
whites than is true for Hispanics (Bobo 1999), they also have a longer and more 
fractious history with the police in America. This is one reason that blacks’ 
opinions of the police might be expected to be significantly more negative than 
Hispanics. Again, we lack a critical mass of studies on this issue. 

Most of the literature is exclusively centered on the role of demographic 
characteristics in shaping views of the police (Brown and Benedict 2002; Gallagher 
et al. 2001; Weitzer and Tuch 1999). The role of nondemographic factors—both 
micro and macro—remains unclear. This is particularly true for “the public’s 
personal experiences with the police, what they learn second-hand from friends 
and acquaintances, and what they learn from the media” (Gallagher et al. 2001: 
v). A few studies suggest that these experiential and media factors may influence 
attitudes. Citizens’ personal contacts with the police—especially negative 
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experiences—appear to influence citizens’ larger views of the police (Tyler and 
Huo 2002). One’s personal contacts thus seem to affect how people view the 
police more generally. Moreover, it is also possible that the experiences of family 
members and friends may be internalized and “vicariously experienced” by the 
actor, affecting his or her larger views of the police. Similarly, exposure to media 
reports on incidents of police misconduct (e.g., Rodney King) may adversely 
affect citizens’ confidence in the police, and this effect may be especially true for 
members of minority groups (Kaminski and Jefferis 1998; Sigelman et al. 1997; 
Tuch and Weitzer 1997; Weitzer 2002). But much more research is necessary to 
test these predictions. The present study examines whether citizens’ views of 
racially biased policing are shaped by citizens’ personal experiences, knowledge 
of others’ experiences, and exposure to media reporting on the police—and 
whether the effects of these factors are more cogent among blacks and Hispanics 
than among whites. 
We test the following hypotheses: 


Hypothesis 1a: Blacks and Hispanics are more likely than whites to believe 
that various forms of racially biased policing exist. 


Hypothesis 1b: These racial differences persist net of the influences of 
experiential, media, and control variables. 


Hypothesis 2a: Exposure to media reports of police misconduct increases 
perceptions of police bias among all groups. 


Hypothesis 2b: The effect, however, is stronger for blacks and Hispanics 
than for whites. 


Hypothesis 3a: Personal or vicarious experience with police bias increases 
perceptions of bias among all groups. 


Hypothesis 3b: The effect, however, is stronger for blacks and Hispanics 
than for whites. 


Data and Methods 


Data for this study come from the authors’ 2002 national survey of 1,792 white, 
African American, and Hispanic adult residents of U.S. metropolitan areas 
with at least 100,000 population. The sample is representative of adults living 
in households with telephones in urban and suburban areas that meet this 
population-size criterion. 

The data for this article were collected as part of a larger study by the authors 
of police-citizen relations in the U.S. The survey has the advantages of (1) 
oversampling African Americans and Hispanics, in contrast to the small number 
of minority respondents typically included in other surveys, and (2) tapping both 
attitudes toward police and personal and vicarious experiences with the police. 
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SAMPLING 


The survey was conducted for the authors by Knowledge Networks, a Web- 
based survey research firm that combines probability sampling with the reach 
and capabilities of the Internet to yield representative samples of respondents 
without sacrificing data quality. Research comparing the quality of data yielded 
by Knowledge Networks' Web-based survey methodology with that of random- 
digit dialing telephone surveys has found that Knowledge Networks yields 
representative samples that produce parameter estimates very similar to the 
estimates of random-digit dialing samples (Baker et al. 2003; Berrens 2003; 
Krosnick and Chang 2001). 

Knowledge Networks uses list-assisted random-digit dialing sampling 
techniques on a sample frame consisting of the entire telephone population, so 
that any household with a telephone can be selected for the Knowledge Networks 
panel, including computer users and nonusers alike. In exchange for free Internet 
hardware (such as a television set-top box), connectivity (an Internet connection 
paid for by Knowledge Networks), and on-site installation, participants agree to 
complete a maximum of three to four surveys per month. Selected households 
remain on the panel for two to three years. Currently, Knowledge Networks has 
more than 25,000 households in its Web-enabled panel. 


PANEL REPRESENTATIVENESS 


When using panels for survey research, potential sampling bias can occur at 
any of several stages. First, respondents consent to become panel members; this 
is referred to as the panel acceptance rate. At the time of the study, Knowledge 
Networks panel acceptance rate was 4096, calculated by standards established 
by the American Association for Public Opinion Research.’ Second, the within- 
survey completion rate—or percentage of panel members who completed our 
questionnaire among those who received it—was 67%. 

The Knowledge Networks panel is representative of and closely mirrors 
the U.S. population on key demographic, geographic, economic, and social 
characteristics. Four factors account for this representativeness. First, as mentioned 
above, the panel is selected using random-digit dialing telephone methodology, 
providing a probability-based starting sample of telephone households (according 
to the census, 98% of white households, 95% of Hispanic households, and 94% 
of African American households have telephone access). Second, the panel 
weights are adjusted to census demographic benchmarks to reduce error due 
to noncoverage of households without telephones and to reduce bias due to 
nonresponse and other nonsampling errors. Third, samples selected from the 
panel for individual studies are selected by probability methods, and sample 
design weights for each study are calculated to meet the design parameters. 
Fourth, nonresponse and poststratification weighting adjustments are applied 
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to the final survey data to reduce the effects of nonsampling error. The result is 
that the weighted demographic estimates from the census and the Knowledge 
Networks panel differ very little across gender, age, race/ethnicity, education, and 
region (see Table 1 for descriptive sample statistics). Moreover, analyses of panel 
attrition indicate that no significant differences separate those who remain on 
the panel from those who do not (Dennis and Li 2003)? 


INDEPENDENT VARIABLES 


Race 


We use the term “race” broadly to include both racial and ethnic groups. Our 
sample consists of respondents who self-identify as African American, Hispanic, 
or non-Hispanic white on Knowledge Networks demographic profile of panel 
members. 


Experiences with Police Discrimination 


Perceived personal and vicarious experiences with discriminatory police behavior 
were measured with the following questions: “Have you ever felt that you were 
treated unfairly by the police specifically because of your race in [your city/your 
own neighborhood]?” “Have you ever felt that you were stopped by the police 
just because of your race or ethnic background?” Parallel questions tapped 
vicarious experience, with reference to whether this had happened to anyone else 
in the respondent’s household.‘ Responses to the personal experience questions 
were combined to form an index of reported personal experience with police 
bias; responses to the vicarious experience questions were combined to create 
an index of reported vicarious experience with police bias. The alpha reliability 
coefficient is .80 for the personal experience index and .84 for the vicarious 
index. In the analysis, both indices are dichotomized (1 = experience with bias, 
0 = no experience). 


Media Exposure 


We asked the following question in order to gauge respondents’ exposure to media 
accounts of police misconduct: “How often do you hear or read about (on the 
radio, television, or in the newspapers) incidents of police misconduct (such as 
police use of excessive force, verbal abuse, and corruption) that occur somewhere 


in the nation?” Response options were “never, “rarely,” “sometimes,” and “often” 
on a four-point scale coded so that higher scores indicate more frequent exposure. 
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Controls 


We control on several demographic factors: age, in years; gender (1 = male, 0 = 
female); place of residence, measured with a dummy variable for city (coded 1) 
versus suburb (coded 0); region (1 = south, 0 = nonsouth); education, measured 
on a 9-point ladder ranging from less than high school (coded 1) to doctorate 
degree (coded 9); and household income, measured on a 17-step ladder ranging 
from less than $5,000 per year (coded 1) to $125,000 or more (coded 17). Three 
items measure respondents’ assessment of neighborhood crime conditions: 
“Overall, how safe do you feel being alone outside in your neighborhood [during 
the day/at night]—very safe, somewhat safe, somewhat unsafe, or very unsafe?” 
“How serious a problem is crime in your neighborhood—very serious, somewhat 
serious, not serious, or not a problem at all?” Responses to these questions are 
coded so that higher scores reflect less safety and more perceived neighborhood 
crime. 


DEPENDENT VARIABLES 


Perceptions of police racial bias? were measured by four sets of questions, with 
each set of items combined into a scale: (1) Racial bias against individuals: “Do 
you think the police in your [city/neighborhood] treat whites and blacks equally, 
do they treat whites worse than blacks, or blacks worse than whites?” A parallel 
question asked respondents to compare whites and Hispanics. Alpha for this 
index is .91. (2) Racial bias against neighborhoods: The police engage in positive 
and negative typifications of different locales (Smith 1986), which may lead to 
disparate treatment of neighborhoods. We asked, “In general in the U.S., do you 
think that police services in white neighborhoods are better, worse, or about the 
same as in black neighborhoods?” A parallel question asked for a comparison 
between white and Hispanic neighborhoods. Alpha for this index is .88. (3) 
Police prejudice: “How common do you think racial or ethnic prejudice is among 
police officers [throughout the U.S./in your city/in your neighborhood]?” Alpha 
for this index is .86. (4) Racial profiling: “Since many drivers engage in minor 
traffic violations like speeding, it is sometimes hard to tell why some drivers get 
stopped by the police while others do not. Do you think that black drivers are 
more likely to be stopped by the police than white drivers for the same types of 
violations?” A parallel question compared Hispanic and white drivers. Regarding 
approval of profiling, we replicated a 1999 Gallup poll question: “There have 
been reports that some police officers stop drivers from certain racial groups 
because they think members of these groups are more likely to commit crimes. 
This is known as ‘racial profiling’ Do you approve or disapprove of the use of 
this practice?” Regarding the scope of profiling, we asked, “Do you think that 
racial profiling is widespread or not widespread [in the U.S./in your city/in your 
neighborhood]?” Alpha for the profiling index is .73. 


3, March 2005 


1016 / Social Forces 83 


‘co’ >d Quarayyip AQuesyrusis əre sueəur »ruedsrg pue »pe[q aur. 
-c0 > d 3uo19yip Apuvogru$is o1e suvaur »ruedsrg pue 23M 2U pa 
‘co’ >d 4uo1sgrp Apueogrusis Ie SUBST »xpe[q pue aj ƏU L, 


MM HUE ME WM $$ uo Tm eS SS o oum o A M A T 


», (8£1) 66€ » (9T) S£ qv (£91) Z9'C |. (t$&)9re Sui[go1d [er»ew 
», (6V 7) 878 x (LTZ) 696 qs (6I) €0'Z (Lc) 994 ooipnía1d 2or[od 
xq (€O'T) OTE » (V6) YV'€ qv (60 T) SET (Zia) oo spooyrogysieu jsurese serg 
xq (C0 C) v6'S » (S8'I) 689 qe (CF T) O£' (SLT) 6US spenprarpur jsure3e serq 
»q (99) 0€ » (69) 8€ qe (CC) SO" (PE) ET (sÁ = 1) əx2uUərrədxə snoteorA 
a (LV) C€ »» (08) 0€ qe (CC) S0 (9€) SU (sad = |) a0uatiadxe [euosieg 
xq (08°) ZOE » (65) SC€ qe (89°) 68°7 (EL) LET a1nsodxo erpojq 
q (88) 6c £s) XT qe (C£) V8T (87) 961 eur poou1oqusIN 
a (76`) 90°Z » (S8°) 00°7 ae (SLO (08) ZZ'I Gys) KeS 
ONU e (89°) 9S'T qv (S9) 9CI (I9) 9€I (Aep) Ayayes 
> (Sh) 87 se (0S") IS » (LP) TE (87) SE (ymos= I) uorsəy 
>a (TF) 62: oe (9F') OL’ qe (8h) cO: (95) 07 (4 = [) 2»uoprseq 
xq (6€ V1) EVLE se (7S°ST) 6F'0P qe (€6°91) EV LE (FL'91) OF ST ə3y 
(09) 69 :« (089) YF. « (0€) 0€ (0€) 65 (ə|gur= I) 1opuar) 
a (PTF) 9898 e (CIT) 668 “qe (07€) v€'OT (S6'€) 66 euroou[ 
xq (6S°T) EEE oe (CG1) CLE qe (69'T) 8UT (89°) 90°F uoneonpg 
soruedsrH Sxperg soy ajdures [e10T, 


IEEE a ee ne o 4.20 c cc c Se ERR E E d 


so[qer1eA [JV uo (suonerAaq paepuejs puv) sueay :sonsnejs sAndrseq `T AQEL 


Racially Biased Policing / 1017 


Table 1 presents descriptive statistics for all study variables. Compared to 
whites, African Americans and Hispanics in our sample report significantly less 
education and lower incomes; are younger; reside in cities rather than suburbs; 
report less neighborhood safety and more crime; report greater exposure to media 
coverage of police misconduct; and have more frequent experiences with, and 
greater perceptions of, police bias. Blacks and Hispanics also differ significantly on 
each of these variables (except income, neighborhood safety and crime). Blacks 
are more likely than Hispanics to report media exposure to police misconduct 
and personal/vicarious experience with racially biased policing, and they are also 
more likely to perceive the existence of our four types of police racial bias. 


Results 


Hypothesis 1 predicted that blacks and Hispanics are more likely than whites to 
believe that racially biased policing exists. To test this hypothesis, we first cross- 
tabulated with race each of the items that compose the various racial bias indices; 
we briefly summarize those results here (not shown in a table). First, a majority 
of blacks (7596) and Hispanics (5496) believe that police in their city treat blacks 
worse than whites, and virtually the same proportions also believe that Hispanics 
are treated worse than whites (7496 and 5396, respectively). Few whites agree: 
The overwhelming majority of whites (7596 —7796) believe that police in their 
city treat whites and the two minority groups "equally" 

A second set of questions pertains to perceived disparities in police services 
toward neighborhoods populated by the racial groups. A majority of blacks 
and Hispanics—but just one-third of whites— believes that police provide 
“worse” services to black and Hispanic neighborhoods (in comparison to white 
neighborhoods) throughout the U.S. It is interesting that blacks are more likely 
than Hispanics (7796 vs. 6196, respectively) to believe that Hispanic neighborhoods 
are treated worse than white areas. The majority of whites believe that police treat 
neighborhoods similarly. 

Third, blacks and Hispanics are also much more likely than whites to believe 
that police prejudice is a problem. Three times as many blacks as whites believe 
that police prejudice is “very common” throughout the U.S., and blacks are 
about six times as likely as whites to believe it is very common in their own city 
(Hispanics take an intermediate position). And fourth, while large majorities of 
all three groups disapprove of racial profiling (73% of whites, 77% of Hispanics, 
90% of blacks) and believe that it is widespread in the U.S. (70% of whites, 
83% of Hispanics, 92% of blacks), only a third of whites believe that profiling is ` 
pervasive in their own city (vs. 59% of Hispanics and 80% of blacks). 

Finally, a large gulf separates minorities and whites when it comes to their 
own experiences with racial discrimination by the police. For instance, significant 
numbers of Hispanics (23%) and blacks (37%), but almost no whites (1%), 
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report being "treated unfairly" by police in their city specifically because of their 
race, and similar percentages of each group report being stopped by police solely 
because of their race. The same disparities are found for vicarious experience, 
that is, racially biased treatment of someone in the respondent's household. 

These race differences are consistent with hypothesis 1a, but do they persist 
net of the influences of demographic, neighborhood, media, and experiential 
variables? And what other factors besides race shape views of police bias? To 
answer these questions, we turn to the multivariate results in Table 2. 

Table 2 displays results from regressing each of the police bias indices on race, 
the other demographics, and the neighborhood, media, and experience variables. 
Each model in the table presents coefficients both for the total sample and for 
each racial group separately. The total sample coefficients in Table 2 show that 
race differences in views of police bias persist net of the influence of the other 
predictors, supporting hypothesis 1b. In all four models—bias against individuals, 
bias against neighborhoods, police prejudice, and racial profiling—blacks and 
Hispanics are significantly more likely than whites to adopt a critical view of the 
police, and—consistent with the racial hierarchy thesis—Hispanics’ assessments 
of police bias are intermediate between the views of whites, who are less negative, 
and those of blacks, who are more negative.? 

Several of the coefficients associated with the remaining predictors in the total 
sample columns of Table 2 are significant in at least three of the four models: 
younger people, persons exposed to media reports on police misconduct, and 
those who have personally experienced police bias are more likely to perceive all 
four types of bias. In addition, increasing education, city residence, and living 
in a neighborhood with a serious crime problem each significantly increases 
perceptions of bias in three of the four models. 

The group-specific coefficients in Table 2 allow for an examination of racial 
differences in the effects of the predictors on attitudes toward racialized policing. 
Model 1 presents results for bias against minority individuals. The demographic 
variables have generally limited and inconsistent effects on citizen perceptions 
of this type of police bias, although social class does have some effect. Among 
whites, higher education increases perceptions of bias against individuals; among 
blacks, higher income increases these perceptions; but neither education nor 
income shapes Hispanics’ attitudes in this area. Similarly, the three dimensions of 
neighborhood crime have limited effects, though the feeling that crime is serious 
in one’s neighborhood increases blacks’ and Hispanics’ perceptions of police bias, 
and fears about safety (during the day) does so for blacks. 

As predicted by hypothesis 2a, exposure to media accounts of police 
misconduct significantly increases perceptions of police bias against minority 
individuals among all three racial groups—though difference-in-slopes tests 
indicate that the effect of media exposure is not significantly stronger among 
blacks and Hispanics than among whites, as hypothesis 2b predicted. Hypothesis 
3a predicted that personal and vicarious experience with police bias would 
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amplify the perception that police discriminate against minority individuals, and 
hypothesis 3b predicted that this effect would be strongest among blacks and 
Hispanics. We found that personal (but not vicarious) experience with police 
discrimination significantly shapes perceived bias among blacks and Hispanics 
(but not among whites), and, consistent with hypothesis 3b, these effects are 
significantly stronger among blacks and Hispanics than among whites. Moreover, 
as indicated by the magnitudes of the standardized coefficients associated with 
the media and personal experience variables, these effects are among the strongest 
in the model. 

Model 2 in Table 2 reports the effects of the predictors on perceived bias 
against minority neighborhoods. In this model, higher education among whites, 
and higher education and income among blacks, increase perceptions of police 
bias; age is significant for whites and blacks, with older members of both groups 
reporting less perceived bias; and the remaining demographic predictors are 
largely unimportant. The view that neighborhood crime is high increases whites' 
perceptions of police bias against minority neighborhoods; and fears about safety 
(at night) increase blacks' perceptions of bias; safety fears (during the day), on the 
other hand, decrease blacks' and Hispanics' perceptions of bias. For each racial 
group, exposure to media reports of police misconduct significantly influences 
perceptions of differential police treatment of neighborhoods, strongly supporting 
hypothesis 2a. Contrary to the prediction of hypothesis 2b, however, we found 
no white-black or white-Hispanic differences in the effects of media exposure, 
though we did find that the effect of the media variable is significantly stronger 
for blacks than for Hispanics. Similarly, personal (but not vicarious) experience 
with police discrimination increases blacks’ and Hispanics’ perceptions of bias, 
lending partial support to hypothesis 3a. The differences with whites are not 
significant, despite hypothesis 2b’s predictions. 

The third model in Table 2 summarizes findings for perceptions of police 
prejudice. Consistent with earlier results, the demographic and neighborhood 
crime factors have only sporadic effects on perceptions of prejudice, though black 
and Hispanic city dwellers have higher levels than their suburban counterparts; 
and neighborhood crime increases perceptions of prejudice among blacks and 
Hispanics. Media exposure and both personal and vicarious experience with 
police misconduct, on the other hand, increase perceptions among all three racial 
groups that police officers are racially prejudiced, again providing strong support 
for hypotheses 2a and 3a. As before, however, the largely uniform effects of these 
variables across racial groups are not consistent with hypotheses 2b and 3b. 

Model 4 in Table 2 reports results for the racial profiling index. Again, 
demographic’ and other control factors play only a small role in accounting for 
citizens’ perceptions and evaluations of racial profiling. As predicted, however, 
media exposure strongly shapes perceptions; and either personal or vicarious 
experience with police bias significantly increases perceptions that racial profiling 
is widespread and unacceptable. Again, evidence of significant minority—white 
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differences of the kind predicted by hypotheses 2b and 3b is weak. 

In sum, the demographic factors (other than race) tend to have only sporadic 
effects in each model; this finding is consistent with the literature on public 
perceptions of racial discrimination in other institutional arenas (housing, jobs, 
education) documenting limited or weak demographic effects (Kluegel and 
Bobo 2001; Sigelman and Welch 1991). Supporting hypothesis 2a, we found that 
exposure to media accounts of police misconduct is significant in all models 
for each group. Hypothesis 2b's prediction of stronger media effects among 
minority than among white citizens was not supported, however. Hypothesis 
3a predicted that personal or vicarious experience with police bias would 
increase citizens’ overall perceptions of racialized policing among all groups. The 
personal experience variant of hypothesis 3a receives strong support; evidence 
for the vicarious experience version of the hypothesis is somewhat weaker, 
with significant effects present in the prejudice and profiling, but not in the 
individual- or neighborhood-bias, models. Hypothesis 3b also received partial 
support. Compared to whites, personal experience with police bias significantly 
increases perceptions of biased policing for blacks and Hispanics in model 1, and 
for Hispanics in model 4. Vicarious experience with police bias, though not as 
consistent a predictor as personal experience, is nevertheless significant in two 
of the four models for each racial group, lending partial support to hypothesis 
3a—though the racial differences predicted by hypothesis 3b are not present. The 
fact that there are few racial differences with regard to the effect of personal and 
vicarious experience on perceptions of police bias is consistent with some other 
studies that have found that the effect of certain kinds of personal experience— 
such as racial profiling—is so powerful that it colors attitudes toward the police 
irrespective of race (Weitzer and Tuch 2002). 


Conclusion 


Race structures citizen views of police racial bias, as it does other aspects of 
policing. In all four models, blacks and Hispanics are more likely than whites to 
believe that police bias is a problem. Blacks, however, are more likely to perceive 
such bias than Hispanics, net of other factors. This finding helps address one 
unanswered question in the literature on police—minority relations—whether 
blacks and Hispanics share a minority-group perspective or whether perceptions 
take the form of a white-Hispanic-black racial hierarchy. On the issue of racialized 
policing, we find that, consistent with the racial-hierarchy pattern, blacks and 
Hispanics do differ significantly. Indeed, on some questions, Hispanics are much 
less likely to perceive bias than are blacks. For instance, blacks are more likely to 
perceive police discrimination against Hispanics than Hispanics themselves are 
to hold this belief, and blacks are more likely than Hispanics to say that Hispanic 
neighborhoods are discriminated against (vs. white neighborhoods) and to believe 
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that Hispanic drivers are racially profiled (compared to white drivers). Blacks are 
thus more inclined to perceive racial bias against both minority groups. These 
findings indicate that “minority” perceptions are not monolithic: in terms of 
_ racially biased policing, Hispanics differ in some important ways from African 
Americans. One reason that the two groups differ, at least with regard to their 
personal experiences, may have to do with their visibility. For instance, blacks 
may be more vulnerable than Hispanics to traffic stops by police because their 
skin color heightens their visibility. But further research is needed to account 
more fully for black—Hispanic differences in relations with police. It may be that 
black and Hispanic views are issue-specific: on some issues most Hispanics and 
blacks may agree, while on other issues there may be less consensus (see Weitzer 
and Tuch 2004a, 2004b). 

Americans are overwhelmingly opposed in principle to racially biased law 
enforcement. When asked in one poll whether it is the responsibility of the 
federal government to ensure that minorities and whites receive equal treatment 
from the police and the courts, large majorities of whites, Hispanics, and African 
Americans answered affirmatively (Washington Post 1995). But support for the 
principle of equal justice does not necessarily mean that one believes the system 
dispenses unequal justice. Our data indicate that many whites believe that the 
system operates impartially. Over three-quarters of whites think that police 
treat individual blacks and Hispanics the same as whites; a substantial majority 
of whites take the same view of minority and white neighborhoods; among 
whites who believe that police officers are prejudiced, most take the position 
that prejudice is mild, “somewhat common” instead of “very common”; and 
only one-third of whites believe that police engage in racial profiling of minority 
drivers—stopping them more frequently than whites for the same kinds of traffic 
violations. That many whites are skeptical with regard to police discrimination, 
or see it as isolated and episodic rather than widespread, is consistent with their 
views of racial discrimination elsewhere in American society (Hochschild 1995; 
Schuman et al. 1997). In one poll, for instance, only 17% of whites—compared to 
44% of blacks—thought that blacks are discriminated against “a lot” in America 
(Washington Post, 12 June 1997). For most whites, racial discrimination in general, 
and police discrimination in particular, is not a serious problem in America. 
Minorities, by contrast, tend to perceive racial discrimination in a wide range of 
institutional arenas, including housing, employment, and education (Hochschild 
1995; Schuman et al. 1997). 

Blacks and Hispanics are also significantly more likely than whites to report 
that they have personally been discriminated against by the police and that this has 
happened to another member of their household. While Hispanics are less likely 
than blacks to report these kinds of experiences, the percentage of Hispanics who 
do so is closer to that of blacks than it is to that of whites—departing somewhat 
from the racial hierarchy pattern. 

Of course, perceived experience with police bias is not necessarily equivalent 
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to actual discrimination—since the sheer exercise of police authority (typically in 
a brusque and authoritarian manner) may be construed as racial bias by citizens 
(Sykes and Clark 1975; Wilson 1972). But, as is true for racial discrimination in 
other spheres, there is at least a rough aggregate correspondence between actual 
practice and blacks' and Hispanics' reported experiences of police treatment. Our 
respondents’ self-reports are consistent with evidence from street observations 
of police-citizen interactions and records of police stops, which indicate that 
police indeed tend to view minorities with a high degree of suspicion and as 
having criminal propensities. Minorities tend to be stopped more often than 
whites (Fagan and Davies 2000; Harris 2002) and to be treated more harshly 
in encounters (Hepburn 1978; Smith 1986; Terrill and Reisig 2003). Similarly, 
the high percentage of blacks and Hispanics who believe that police prejudice 
is widespread in the nation is consistent with data on police officers themselves. 
As Jefferson concludes, “All the major British and North American studies, from 
the early post-war period on, agree that negative, stereotypical, prejudiced, and 
hostile attitudes to blacks are rife amongst police officers (1988:522).” 

To explore further these racial differences, we examined several possible 
predictors of perceptions. Most studies of police—citizen relations focus on 
demographic variables, which, we find, do not fully exhaust the range of 
determinants of public opinion. Citizen views are also strongly influenced by 
certain nondemographic factors. 

First, personal and vicarious experience of racially biased policing shapes 
perceptions of police bias, net of other factors. For blacks and Hispanics, such 
personal experience significantly increases perceptions of racialized policing in 
all four models. This perception is not the case for whites (with one exception). 
Vicarious experience significantly affects perceived police bias in two of the 
four models for each racial group. A similar pattern is found with respect to 
discrimination in other arenas, such as jobs and housing. One study, for instance, 
found that blacks, Hispanics, and Asians who felt that they had personally 
experienced job discrimination were more likely to perceive job discrimination 
against their minority group generally (Kluegel and Bobo 2001). 

A second key finding is the mass media’s role in shaping perceptions of 
racialized policing. Repeated exposure to media reports on police abuse (1.e., 
excessive force, verbal abuse, corruption) is a strong predictor of the belief that 
police bias exists, is widespread, and is unacceptable. Media effects are extremely 
robust—operating for all three racial groups in all four models, net of other 
factors. People who frequently hear or read about incidents of police misconduct, 
as transmitted by the media, are inclined to conclude that the police engage in 
racial profiling, are prejudiced, and discriminate against minority individuals 
and neighborhoods. Though it is usually overlooked by researchers who study 
public perceptions of the police, the mass media appears to be an important 
determinant of those perceptions. 
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As indicated earlier, much of the literature documents race differences but 
does not adequately explain them. Our extension of the group-position thesis 
holds that views of social institutions will be influenced by group interests and 
perceived threats. Dominant groups should perceive the police as an institution 
allied with their interests, whereas minorities should be more inclined to view 
the police as contributing to their subordination. These predictions are generally 
supported by our findings. Whites tend to minimize or discount the existence 
of racialized policing and perhaps view charges of police racism as a threat to a 
revered institution. Blacks are inclined to believe that police bias is common, and 
many Hispanics share this view. Both groups are interested in curbing abuses of 
citizens, particularly minority citizens, who are disproportionately the recipients 
of mistreatment. Greater controls on the police may have the effect of advancing 
the group interests of blacks and Hispanics (Weitzer and Tuch 2004b). 

Our findings highlight the role of both micro- and macro-level factors in 
fostering racial differences in evaluations of the police: The greater tendency for 
blacks and Hispanics to perceive bias is largely a function of their disproportionate 
adverse experiences with officers and exposure to media reports of police abuse. 
Views of the police are thus related to racial differences not only in general 
group-position relationships but also in real or perceived group vulnerability to 
abusive practices, which is reinforced by both personal experience and exposure 
to media reports of abuse. 


Notes 


1. In the present survey, Hispanics and blacks overwhelmingly supported an increase, in their 
city, in the number of officers patrolling the streets in police cars (80% for both groups) 
and on foot patrol (69% and 80%, respectively) as well as “more police surveillance of areas 
where street crimes occur frequently” (85% and 88%, respectively). White support for these 
changes was similarly high. 


2. The American Association for Public Opinion Research response rate definitions can be 
viewed at www.aapor.org. 


3. A detailed demographic panel analysis is available at www.knowledgenetworks.com. 


4, We recognize that only one type of vicarious experience is measured by our question. It 
does not include other associates, such as friends, coworkers, and neighbors. 


5, Because our media-exposure measure is based on respondents’ self-reports, some caution 
is in order in interpreting media effects. Some self-selection may be involved in exposure 
to media coverage of the police, with acutely interested persons being more attentive than 
others. Our media variable is also fairly broad, asking about exposure to reports of police 
abuse anywhere in the country, which may or may not include the respondent's city of 
residence. An alternative measure would ask specifically about media coverage of incidents 
in the respondent's city. 


6. We use the term “racial bias” as a construct referring to the four types of outcomes 
described in this paragraph, while recognizing that two of the questions refer to disparate 
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treatment of individuals and neighborhoods, which may or may not reflect outright racial 
discrimination. 


7. All the cross-tabulated relationships discussed in this section are statistically significant 
(p<.001). 


8. Hispanics were asked to identify their ancestry as Mexican, Puerto Rican, Cuban, Central 
or South American, Caribbean, or other Hispanic. No significant subgroup differences 
on perceived police bias were found. Because of the small sample sizes for some of these 
subgroups, this finding should be considered tentative. It is also for this reason that our 
primary analyses compare the three major racial groups rather than the Hispanic nationality 
groups. 


9. A previous study (Weitzer and Tuch 2002) reported class effects on attitudes toward racial 
profiling among African Americans, with more educated blacks expressing more disapproval 
of the practice and greater awareness of its existence than less educated blacks. We suspected 
that the absence of class effects among blacks (in model 4, Table 2) was attributable to 
differences in the measures of profiling used in the two studies. To test this, we selected from 
our four-item index of profiling attitudes the two questions that we replicated from our earlier 
study—approval of profiling and assessments of whether the practice is widespread—and fit 
logistic regression models to the data for blacks. We found class effects consistent with our 
earlier findings: Higher-income blacks are more likely than lower-income blacks to disapprove 
of profiling, and more educated blacks are more likely than their less educated counterparts 
to believe that profiling is widespread. 


10. The only exception to this pattern is that a majority of whites believe that racial profiling 
is widespread in the U.S. (but not in their city or neighborhood). Curiously, in an apparent 
contradiction, whites are twice as likely to believe that profiling is widespread than to believe 
that blacks or Hispanics are stopped more frequently than whites. 
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Abstract 


Is neighborhood disadvantage associated with social support? If so, does residential 
stability modify that association? And are there gender- and race-contingent 
patterns? Among a sample of adults aged 65 years and older, neighborhood 
disadvantage is associated positively with received and donated support among 
black women, but only in neighborhoods with higher levels of residential stability. In 
contrast, neighborhood disadvantage is associated negatively with donated support 
among white men and negatively with received support among white women, but 
only under conditions of low residential stability. I discuss the implications of these 
findings for theories about stress and community-level effects on social relationships 
in late life and draw linkages to the broader sociological discourse on social capital 
and collective efficacy. 


The importance of social cohesion and social capital to individual and societal 
well-being is of long-standing interest in sociology (Berkman et al. 2000; Coleman 
1988, 1990; Durkheim 1897; Portes 1998; Putnam 2000). The availability and 
presence of social relationships that are marked by trust and reciprocity are 
prominent features of social capital (Kawachi and Berkman 2000). In their review 
of social support research, House, Umberson, and Landis (1988) called for more 
attention to the social-structural influences on one aspect of social capital, social 
support, claiming that “while there is a substantial theoretical base in sociology 
suggesting that macrosocial features influence social relationships, there is little 
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empirical evidence to substantiate the nature of that influence" (310). Given the 
generally positive effects of social support for health, researchers have recently 
started to focus more attention on the social impact of neighborhood disadvantage 
(Aneshensel and Sucoff 1996; Sampson 1988, 1991). Much remains unknown, 
however, about the association between neighborhood disadvantage and different 
forms of support. Moreover, although prior studies document that residential 
stability modifies the effects of neighborhood disadvantage (Ross, Reynolds and 
Geis 2000), and others identify the plausibility of gender- and race-contingent 
effects (Belle 1982, 1983, 1991; Stack 1974), there is a lack of systematic analysis 
of those possibilities with respect to both received and donated social support 
(House 1987; Thoits 1995). 

To address those deficits, I examine four questions: (1) What is the effect of 
neighborhood structural disadvantage on received and donated social support? 
(2) Does residential stability modify those associations? (3) Are there gender- and 
race-contingent patterns? and (4) Do individual-level socioeconomic conditions, 
perceptions of neighborhood problems, and residential tenure contribute to 
community-level effects? 


Definition and Significance of Social Support 


The stress process framework identifies social support as an important 
environmental resource because of its distribution across roles and statuses, 
because its absence can diminish well-being, and because it often helps people 
avoid or manage stress (Cutrona and Russell 1990; Pearlin et al. 1981; Weiss 1974). 
Supportive bonds moderate the effects of stress by enhancing other personal 
resources such as self-esteem, confidence, and personal control (Turner and 
Roszell 1994). Pearlin contends that "it is good to have supports during times 
of travail, but it is also good to have them when nothing is needed but a sense 
of place and belongingness" (1985:51). 

An array of interpersonal dimensions such as trust, reliability, and obligation 
signify social capital (Coleman 1990). In the context of community-level stress, 
Putnam's (2000) concept of "bonding social capital" is particularly relevant 
because it underscores the significance of the strong bonds that link individuals 
to friends and relatives. Unlike the processes of “bridging social capital,” in which 
social ties help individuals “get ahead,” bonding social capital helps people “get 
by” under adverse conditions. Researchers in the sociology of mental health have 
long investigated the effects of bonding social capital, although usually under the 
conceptual rubric of “social resources” or, more specifically, “social support.” Most 
of that research examines the perception of received support (Thoits 1995), which 
involves “the clarity or certainty with which the individual experiences being 
loved, valued, and able to count on others should the need arise” (Turner and 
Turner 1999:303).! Although there are many forms of support (Barrera 1986), 
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the broad consensus is that perceived support is most relevant for well-being 
during late life (Norris and Kaniasty 1996). Such findings have led Krause to 
conclude that "efforts to better understand the social support process will be more 
profitably directed toward the study of perceived support" (2001:273). Moreover, 
despite the fact that support is often transactional (Kahn and Antonucci 1980), 
the provision of support has received less attention (Berkman et al. 2000; Thoits 
1995). Support donors are available when others need help, advice, or when they 
are feeling down. The provision of emotional support, in part, involves empathic 
acts, such as listening and understanding the sorrows of others. 

Macrolevel influences on the social fabric of community life have been a 
core theme in sociology, especially in urban sociology (Kasarda and Janowitz 
1974; Wirth 1938). Conversely, proximate microlevel elements—consisting of 
supportive ties—gel over time to create the form and consequences of the social 
climate of the community. Most early research labeled such processes “community 
attachment.” Individual residents are only as attached to the general community 
as they are to specific human relationships in that community. The systemic 
view of community social organization identifies a complex web of social ties 
that entail mutual obligations and expectations (Kasarda and Janowitz 1974; 
Sampson 1988). Reliable sources of support, as well as those to whom support is 
provided, are often, although not exclusively, derived from proximate networks of 
friendship and kinship ties (Wellman 1992; Wellman and Wortley 1990). Many 
individuals maintain close, reciprocal emotional bonds with people who reside 
in proximate geographic space. Because supportive relations are often rooted in 
ecological, institutional; and normative community structures (Sampson 1991), 
the quality of local conditions is relevant, especially with respect to neighborhood 
disadvantage and stability. 


Neighborhood Conditions and Social Support 


In their classic book Juvenile Delinquency and Urban Areas (1942), Shaw 
and McKay identified economic disadvantage as among the most influential 
community-level conditions because of its potential to erode social organization 
and, in turn, foster criminal activity. Since then, studies in the “neighborhood 
effects” literature have applied social disorganization theory, which posits that 
a lack of legitimate avenues for socioeconomic achievement cultivates isolation 
from mainstream institutions and formal organizations. These factors purportedly 
erode social control, cohesion, and collective action (Sampson 1991; Wilson 
1991)? Drawing upon that research, I focus on the social impact of neighborhood 
disadvantage, net of individual-level socioeconomic status (SES), perceived 
neighborhood problems, and residential tenure. 

I frame the analysis around two views: social disorganization versus social 
mobilization. The social disorganization hypothesis asserts that neighborhood 
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social and economic disadvantages erode the availability and quality of social 
support. Concentrated disadvantage reflects limited socioeconomic opportunities 
and lower-quality health care, education, leisure, and transportation services 
(LaGrange, Ferraro, and Supancic 1992; LeClere, Rogers, and Peters 1997b; 
Massey 1996; Massey and Denton 1993; Thompson and Krause 1998; Wilson 
1987, 1996). Studies of urban social problems, especially crime, have focused on 
social disorganization and concepts such as “collective efficacy” as an intervening 
mechanism (Browning and Cagney 2003; Bursik 1988; Sampson and Groves 1989; 
Sampson, Raudenbush, and Earls 1997). Although direct evidence about support 
is scarce, Geis and Ross (1998) found that residents in neighborhoods with more 
poverty reported having fewer interactions and exchanging less instrumental 
support with their neighbors. 

Alternatively, the stress process perspective (Pearlin 1999) posits that 
neighborhood disadvantage elevates exposure to social and economic stressors 
(Boardman et al. 2001). According to Wheaton (1985), the presence of adversity 
may encourage individuals to mobilize social resources. As the concept of bonding 
social capital suggests, supportive social relations may help people enhance their 
coping repertoires (Pearlin and Schooler 1978) and to get by under disadvantaged 
conditions (Putnam 2000). There is also a parallel with research on structural 
disadvantage and collective efficacy: Sampson, Raudenbush, and Earls (1997) 
contend that a higher level of collective efficacy reflects a neighborhood's capacity 
and willingness to marshal social resources in order to achieve community- 
level goals. Likewise, the social mobilization thesis proposes that disadvantaged 
conditions mobilize an individual's capacity or desire to cultivate bonding social 
capital in order to cope with and endure adversity. 


RESIDENTIAL STABILITY AS AN EFFECT MODIFIER 


Shaw and McKay (1942) also underscored residential stability as another critical 
neighborhood condition because of its impact on social organization. Measures 
of stability typically involve the percentage of individuals in a census tract who 
have lived in the same household for the past five years (Ross, Reynolds, and Geis 
2000). Like structural disadvantage, residential stability has social implications. 
Wirth claimed that “a transitory habitat does not generate binding traditions and 
sentiments” (1938:17). More recently, Kasarda and Janowitz noted that “since 
the assimilation of newcomers into the social fabric of local communities is 
necessarily a temporal process, residential mobility operates as a barrier to the 
development of extensive friendship and kinship bonds and widespread local 
associational ties” (1974:330). 

Drawing upon those themes, social disorganization theory posits that 
residential stability enhances social integration and the likelihood that residents 
know each other (Sampson 1988, 1991), whereas residential mobility raises the 
likelihood that neighbors feel estranged from each other and weakens the capacity 
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for collective action with respect to local issues (Sampson, Raudenbush, and 
Earls 1997). A greater concentration of long-term residents may foster a social 
climate in which people establish familiar, enduring bonds (Freudenberg 1986). 
Some research supports the integrative functions of stability. In a survey of 
residents of England and Wales, Sampson (1988) found that residential stability 
is associated positively with local friendship ties, net of urbanization, density, 
and other community-level controls. Among a sample of adolescents, Aneshensel 
and Sucoff (1996) found that residential stability is associated positively with 
perceived social cohesion. Likewise, among Illinois adults, Ross, Reynolds, and 
Geis (2000) found that residential stability is associated positively with social 
ties to neighbors? 

Residential stability allows the roots of bonding social capital to take hold and 
flourish over time—which may be especially important in disadvantaged areas. 
However, some scholars have also identified the potentially deleterious impact of 
residential stability in combination with higher levels of structural disadvantage 
(Anderson 1992; Jargowsky 1997). Drawing upon Wilson's (1996) research 
on urban poverty, the social isolation view suggests that residential stability 
solidifies the deleterious effects of neighborhood disadvantage because it keeps 
residents stuck in a bad situation with little hope of escape (South and Crowder 
1997; Warner and Pierce 1993). Testing this idea among Illinois residents, Ross, 
Reynolds, and Geis (2000) found that residential stability is associated with higher 
levels of distress in areas with greater economic disadvantages. Taken together, 
these ideas and prior findings help frame the “compounded isolation” view, which 
predicts that the negative association between neighborhood disadvantage and 
social support may be stronger at higher levels of residential stability. 

Alternatively, the “adaptive cohesion” view proposes that residential stability 
has a more positive effect on residents in especially disadvantaged areas. A 
higher level of stability and associated benefits could be particularly crucial in 
disadvantaged areas because of the support and bonding social capital that it 
affords in the face of-deficits in more formal services (Anderson 1992; Jencks 
1992). Some evidence provides indirect evidence that is consistent with those 
predictions. Smith and Jarjoura (1988) document a negative relationship 
between residential stability and violent crime, but only in neighborhoods with 
a higher level of disadvantage. They suggest that stability may enable residents 
in disadvantaged areas to reduce crime. 


GENDER-AND RACE-CONTINGENT EFFECTS 


Prior research also highlights the importance of gender and race. For example, 
women tend to provide more social support to others, especially emotional forms 
of support (Belle 1982, 1983; Burda, Vaux, and Schill 1984; Kessler and McLeod 
1984). With respect to neighborhood ties, Campbell and Lee (1992) found that 
women know more of their neighbors by name and tend to talk or visit with 
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them more frequently than men do. Social role constellations and expectations 
about the moral sense of caring for others contribute to gender disparities in 
support provision (Gilligan 1982). Moreover, traditional gender segregation in 
the occupational and family domains have placed women in more nurturing 
work roles that require greater interpersonal skills, particularly nursing and 
teaching jobs (Kilbourne, England, and Beron 1994) and caregiving for their 
children and aging parents (Kessler and McLeod 1984). Because women tend to 
provide intimate and self-disclosing forms of emotional support, both women 
and men may experience more health-promoting effects through supportive 
bonds with women (Rook 1987). These socioemotional abilities, however, may 
have costs. Women are not only more exposed to the stressors of others in their 
network, but they also tend to experience greater distress associated with those 
stressors (House 1987; Turner 1994), especially if they have their own adversity 
to manage (Fischer 1982; Kessler and McLeod 1984). Therefore, it is plausible 
that women will engage in more supportive exchanges, especially in the context 
of community-level adversity. 

Scholars have also suggested that ethnic minority groups have more supportive 
social bonds in order to compensate for socioeconomic deprivation or isolation 
from mainstream institutions and formal organizations (Keith, Kim, and Schafer 
2000; Markides and Black 1996). Stack's (1974) ethnographic analysis of the black 
urban poor found that many single mothers experience a unique constellation 
of intricate and resilient kinship and friendship bonds. Blacks are more likely to 
reside in areas with greater neighborhood disadvantage, which, in turn, implies 
greater exposure to household poverty, female-headed households, and structural 
deficits in educational attainment (Jargowsky 1997; Massey and Denton 1993; 
Wilson 1996). 

Households in disadvantaged areas face stressful challenges that can reverberate 
through a network of obligatory ties (LeClere, Rogers, and Peters 1997a). The 
sources of support that facilitate coping with such challenges often include family, 
fictive kin, and friends; some scholars suggest that these informal supports are 
more prevalent among older blacks relative to their white counterparts (R.J. 
Taylor 1988). Economic disadvantages and the absence of fathers may increase 
the extent to which many black female elders provide critical sources of emotional 
and instrumental support (Dilworth-Anderson and Burton 1999; Gibson and 
Jackson 1987; Ralston 1997; S. Taylor 1982). Relative to whites, blacks tend to 
have larger extended families, interact with family members more frequently, and 
receive more support from extended family (George 1990). Johnson contends 
that black elders are “less likely to have a spouse or child as a supporter” but “can 
usually draw upon relatives, friends, and fictive kin who serve them well in their 
old age” (1995:323). When family support is deficient, black elders are more likely 
to have a caregiver outside the family (Burton et al. 1995) and a wider pool of 
potential sources of assistance (Dancy and Ralston 2002). 

Other research has specifically focused on neighborhood ties. Higher levels 
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of discrimination in housing have historically constrained blacks' options for 
residential mobility, resulting in concentrated segregation (Lee, Campbell, and 
Miller 1991; Massey and Denton 1993; South and Deane 1993; White 1987) 
and increasing the likelihood that many black individuals' friends and kin also 
reside in the neighborhood or in nearby areas (Feagin 1970; Martineau 1977). 
Blacks, especially single black mothers, often have a higher level of obligatory 
extended kin within the immediate neighborhood (Hogan, Hao, and Parish 1990). 
Moreover, evidence suggests that blacks tend to interact with their neighbors 
more frequently and in a greater variety of ways and to have longer-standing, 
more intimate relationships with them (Campbell and Lee 1992; Lee, Campbell, 
and Miller 1991; Martineau 1977; Warren 1981). Studies indicate that reciprocity 
in social exchanges within neighborhood networks is particularly strong among 
people of lower socioeconomic status and blacks (Campbell and Lee 1992), 
especially black women (Stack 1974). 


THE RELEVANCE OF NEIGHBORHOOD CONDITIONS AND SOCIAL 
SUPPORT FOR ADULTS IN LATE LIFE 


There is an abundance of research about the effects of neighborhood disadvantage 
on development during childhood and adolescence—but what about the effects 
of community-level conditions on the experience of adults in late life? The very 
young and the very old may each spend more time in neighborhoods, thus 
experiencing greater exposure to ambient hazards. However, such conditions may 
be especially stressful and deleterious to the health of adults in late life (Brown 
1995; Krause 1996; LaGory and Fitzpatrick 1992; Pearlin and Skaff 1996). Many 
elders may encounter increased feelings of vulnerability because of the loss of 
control, increasing impairments, and declining health (Mirowsky 1995; Ross 
and Wu 1996). Community-level disadvantage and the proximate indicators 
of social disorder or physical decay can intensify mistrust, fear, and feelings of 
personal threat (Bazargan 1994; Covington and Taylor 1991; Ross and Mirowsky 
2001; Ross, Mirowsky, and Pribesh 2001; Shields, King and Fulks 2002). The 
discouragement of out-of-doors activities may further reduce physical activities 
and increase seclusion (Bazargan 1994; Covington and Taylor 1991). 

In later life, the loss of core social roles related to work and family life 
diminishes the sources of and opportunities for social support (Schmitz, Russell, 
and Cutrona 1997). Therefore, community-level conditions that might also 
influence the receipt and provision of social support are of critical importance 
for elders. Neighborhood disadvantage often reflects chronic structural strains 
associated with inferior or scarce shopping, medical, transportation, and leisure 
services (LeClere, Rogers, and Peters 1997b). Older residents, for whom such 
resources can be accessed only at a distance, are more likely to be penalized, feeling 
greater abandonment from mainstream social institutions and formal service 
organizations (LaGrange, Ferraro, and Supancic 1992; Rittner and Kirk 1995; 
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Robert 1998; Stafford 2001). Limited mobility associated with poorer health in 
late life may amplify the dependence on local environments (Carp 1976; Lawton 
and Simon 1968; Rowles 1978) and increase the importance for elders of the 
immediate neighborhood for social interaction (Logan and Spitze 1994). Evidence 
suggests that informal, supportive bonds—especially in the neighborhood—can 
buffer against the negative psychosocial effects of neighborhood disorder (Ross 
and Jang 2000). The longer-term residence that older age often affords may 
enhance a sense of place and familiarity in late life (Fogel 1992; Rubinstein and 
Parmelee 1992) and the possibility of strong, supportive social bonds (Rosel 
1983). 


SUMMARY OF HYPOTHESES 


Based on the literature and evidence cited above, I formulate and test several 
competing and intersecting hypotheses. The social disorganization hypothesis 
predicts that neighborhood disadvantage is associated negatively with received 
and donated support. In contrast, the social mobilization hypothesis predicts that 
neighborhood disadvantage is associated positively with received and donated 
support. In addition, two hypotheses frame the analysis of residential stability as 
an effect modifier. The compounded isolation view predicts that neighborhood 
disadvantage is associated more negatively with social support at higher levels 
of residential stability. Alternatively, the adaptive cohesion view predicts that 
neighborhood disadvantage is associated more positively with social support 
at higher levels of residential stability. Moreover, I propose gender- and race- 
contingent effects such that the patterns predicted by the social mobilization 
and adaptive cohesion hypotheses are most descriptive of the experience of 
black women. | 


Methods 


SAMPLE 


The data in this sample derive from in-person interviews with people aged 65 
years and older residing in the District of Columbia and two adjoining Maryland 
counties, Prince George's and Montgomery. Consistent with the purpose to 
investigate status inequality and health disparities, the design called for a socially 
and economically diverse sample. The three locales subsume this diversity. 
Sample selection and recruitment began with the Medicare beneficiary files for 
the three areas. In addition to the names of all people aged 65 years and older 
who are entitled to Medicare, the files provided information about the race and 
gender of each beneficiary. The next step entailed selection from the large pool of 
potential participants. To maximize social and economic diversity, 4,800 names 
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were equally divided among the three locales, blacks and whites, women and 
men, creating twelve groups, each containing 400 names. The goal was to enlist 
a sample of 1,200 people living independently, with approximately 100 in each 
of the 12 groups. Roughly 65% of all eligible respondents who were contacted 
(1,741) agreed to participate, yielding a total sample of 1,167 cases. For the present 
analysis, I exclude cases with missing responses on the dependent variable items, 
leaving 1,139 cases. Appendix A compares the age, gender, and race distributions 
of the total sample and the Census 2000 data for the three locales. 


FOCAL MEASURES 


Four items assess donated social support: “there are people you know who depend 
on you when they need help or advice”; “people count on you when they are 
down or blue”; “people seem to tell you things about themselves that they don't 
tell other people”; and “other people count on you to understand what they are 
going through.” Response choices are “strongly agree” (1), “agree” (2), “disagree” 
(3), and “strongly disagree” (4). I reverse-coded and averaged the items to create 
the index; higher scores indicate a higher level of donated support (a=.880). 

Received social support is assessed with responses to the following items: 
“there is no one who really understands you”; “you have a friend or relative 
whose opinions you trust” (reverse-coded); “you have people around you who 
help you to keep your spirits up” (reverse-coded); “you have at least one friend 
or relative you want to be with when you feel down or discouraged” (reverse- 
coded); “you have at least one friend or relative to whom you could confide 
your deepest secrets” (reverse-coded); “you are important to people you know” 
(reverse-coded); “your well-being matters to people you know” (reverse-coded); 
“there are people who do things they know will please you” (reverse-coded); “what 
you think or feel doesn’t seem to make much difference to anyone.” Response 
choices are “strongly agree” (1), “agree” (2), “disagree” (3), and “strongly disagree” 
(4). I averaged the items such that higher scores indicate higher levels of received 
social support (a —.860). 

The index of neighborhood structural disadvantage is composed of the 
seven items from the 2000 census (see Appendix B). The items are similar to 
measures used in prior research (e.g., Land, McCall, and Cohen 1990; Sampson, 
Raudenbush, and Earls 1997). A principal components factor analysis indicates 
that items load on one construct. To create the index, I standardized and averaged 
the items; higher scores indicate greater disadvantage. In addition, to assess 
residential stability, I use a 2000 census item that measures the percentage of 
people in a tract who have resided in the same location for the past five years. 
Higher scores indicate greater residential stability. 

Gender is coded 1 for women and 0 for men. Race is coded 1 for blacks and 


0 for whites. 
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CONTROL MEASURES 


Age is coded in years. Married, widowed, and divorced marital statuses are each 
coded 1, with the never-married group serving as the reference category. Family 
size is a count of the respondent's total number of children, stepchildren, and 
grandchildren. 

A question about educational attainment asks respondents, “Can you tell 
me how far you went in school?" Response choices are "8th grade or less" (1), 
“some high school but did not graduate” (2), “high school graduate or GED” 
(3), “specialized (vocational) training” (4), “some college but no degree earned” 
(5), and “college graduate or more” (6). 

I also included a measure of household income: “Would you please tell me the 
number that gives the best estimate of your total household income before taxes, 
last year (2000). By total household income we mean the total salaries for all of 
the people living in your home plus all other sources of income. Other sources 
of income would include such things as money market funds, social security, 
pensions, real estate, or government entitlements.” Respondents were shown a card 
that contained the following categories: "less than $10,000," *$10,000—$19,999," 
"$20,000—$29,999," *$30,000—$39,999," *$40,000—$49,999," *$50,000—$59,999," 
"$60,000—$69,999," *$70,000—$79,999," *$80,000—$89,999," *$90,000—$99,999, 
and “$100,000 or more.” I imputed the mean value ($50,000-$59,999) for the 
8.65% of missing cases and included a dummy code to adjust for it in the models 
(1 = missing, 0 = not missing).* 

To measure respondents’ subjective assessment of neighborhood problems, I 
use a modified version of the Ross and Mirowsky (1999) “neighborhood disorder” 
scale. Respondents were asked the degree to which the following statements 
describe what they see and experience in their neighborhood (“the area around 


». €« 


where you live”): “your neighborhood is noisy”; “there is vandalism”; “there are 
». « 


run-down houses or buildings”; “there is trash on the streets”; “people hang 


39 «€ ».« 
` . 


around on the streets”; “there is crime”; “there is alcohol and drug use”; “there is 
heavy traffic” The response categories are “not at all” (1), *somewhat" (2), *quite 
a bit” (3), and “very much” (4). I averaged the items to create the index; higher 
scores indicate a greater extent of perceived neighborhood problems (a@=.828). 
To assess residential tenure, one item asks respondents the number of years they 
have resided in the same residence. 

The life events index asks respondents whether or not the following events 
happened in the past year to the respondent or to someone close (i.e., friends 
or family): “a serious accident or injury,” “a serious physical or mental illness,” 
“a relative or close friend died,” “became a victim of a violent crime,” “ran into 


> 
> < 


serious money problems,” “got into trouble with the law,” “got divorced or 
separated,” “became the victim of a robbery,’ “became unemployed.” I summed 
the items to create the life events index; the range is 0 to 9. 


Five items measure economic hardship. Respondents were asked, “Thinking of 
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current times, how difficult is it for you to meet the following needs?" The items 
include housing, food, transportation, medical, and clothing expenses. Response 
choices are "not at all difficult" (1), *somewhat difficult" (2), and *very difficult" 
(3). I averaged the items to create the index (a=.812). 

The rationale for including these particular control variables relates to findings 
from an array of studies that suggest, either directly or indirectly, that they may 
be associated with social support, neighborhood disadvantage, residential stability, 
and gender or race (i.e., Kasarda and Janowitz 1974; Pearlin and Skaff 1996; Ross, 
Reynolds, and Geis 2000; Thoits 1995; Turner and Marino 1994). Table 1 reports 
descriptive statistics for all of the variables in the analysis. 


ANALYTICAL STRATEGY 


In the first model, I regress the dependent variable (either donated or received 
support) on neighborhood disadvantage, residential stability, neighborhood 
disadvantage x residential stability, gender, race, and an initial set of basic control 
variables (age, marital status, and family size). This step allows the examination 
of a base model and tests residential stability as an effect modifier. In the second 
model, I test for gender- and race-contingent effects by multiplying gender and 
race by neighborhood disadvantage and include those terms in the model. I also 
examine possible three-way interactions between neighborhood disadvantage, 
gender, and race. However, as the next section describes in detail, I report results 
only for the statistically significant three-way interaction observed with respect 
to received social support. In the third model, I adjust for individual-level 
socioeconomic status, economic hardship, perceived neighborhood problems, 
life events, and residential tenure to examine if any observed community-level 
effects remain net of individual-level characteristics (i.e., individual conditions 
as mediators). 

I use ordinary least squares (OLS) regression techniques. Hierarchical linear 
model techniques tend to be used to analyze multilevel data. In some instances 
of multilevel analyses, traditional OLS regression methods may yield somewhat 
less efficient estimates than hierarchical linear models. However, Robert asserts 
that such techniques “were designed for data sets that have both within-group 
and between-group variation, requiring many cases within each of many groups” 
(1998:24). In my study, the majority of census tracts have fewer than 5 cases, 
7 tracts contain more than 10 cases, and only 1 tract has more than 20 cases. 
According to Duncan, Connell, and Klebanov (1997), using census data as 
indicators of community-level socioeconomic conditions, such as neighborhood 
disadvantage, should not raise too much concern about inefficiency. At most, 
the serial correlations produced by clustering cases in the census tracts can be 
handled with adjustments that yield more robust standard errors. Standard 
OLS regression assumes that observations are uncorrelated. However, because 
some census tracts contain more than one case, I utilize a clustering technique 
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Table 1. Descriptive Statistics for All Variables 








Mean or Proportion Std. Dev. Range 
Focal variables 
Donated social support 3.188 487 1-4 
Received social support 3.264 .415 1-4 
Neighborhood disadvantage .001 .831 —1.508—3.974 
Residential stability .549 .124 .156—.825 
Gender (female — 1) .503 i — 0-1 
Race (black = 1) .499 I 0-1 
Control variables 
Age 75.586 6.502 65-101 
Marital status’ 
Married 530 — 0-1 
Widowed .290 — 0-1 
Divorced A17 — 0-1 
Family size 7.464 7.302 0—76 
Years of education 4.410 1.705 1-6 
Household income 5.689 3.136 1-11 
Perceived neighborhood problems 1.366 .432 1-4 
Residential tenure 24.749 14.775 1-90 
Life events 12535 1.326 0-7 
Economic hardship 1.102 .263 1-3 
(N=1,139) 


4 The reference category is never-married respondents. 


(in Stata) that relaxes the assumptions that observations are independent and 
allows observations to be correlated within census tracts. Estimates reported in 
Tables 2 and 3 are reweighted by the variance-covariance matrix, which allows 
for correlations across observations within census tracts. 


Results 


DONATED SOCIAL SUPPORT 


In model 1 of Table 2, the positive and significant neighborhood disadvantage x 
residential stability coefficient (b = .341) indicates that the association between 
neighborhood disadvantage and donated support is positive, but only at higher 
levels of residential stability. In addition, women and blacks report significantly 
higher average levels of donated support. As shown in model 2, there are 
significant gender- and race-contingent effects of neighborhood disadvantage. The 
positive association between neighborhood disadvantage and donated support is 
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Table 2. Donated Support Regressed on Neighborhood Disadvantage, 
Residential Stability, Disadvantage x Stability, and Gender- and 


Race-Contingent Effects 


(1) 


(2) 


(3) 


Focal neighborhood characteristics 


Neighborhood disadvantage 
Residential stability 
Neighborhood disadvantage x residential stability 


Gender- and race-contingent effects 
Gender (female = 1) 


Race (black = 1) 
Neighborhood disadvantage x gender 
Neighborhood disadvantage x race 


Controls 
Age 


Married" 

Widowed" 

Divorced" 

Family size 

Years of education 

Household income 

Perceived neighborhood problems 
Residential tenure 

Life events 

Economic hardship 


Constant 
R 


—011 
(.021) 
TA 
(.119) 
341* 
(.147) 


1092: 
(.030) 

OOS a 
(.032) 


—011r* 
(.002) 
.076 
(.060) 
.039 
(.058) 
.038 
(.076) 
.003 
(.001) 


3.042 
.052 


ERI 1 34*** 
(.041) 
171 
(.119) 
.328* 
(.149) 


0872 
(.029) 
120085 
(.033) 
.096** 
(.036) 
.104* 
(.044) 


—.011*** 
(.002) 
.068 
(.060) 
.031 
(.058) 
.034 
(.075) 
.003 
(.001) 


3.015 
.063 


—.056 
(.043) 
.164 
(.125) 
.316* 
(.145) 


.108*** 
(.029) 
A4pee 
(.034) 
.075* 
(.035) 
.089* 
(.043) 


—010*** 
(.002) 
—.002 
(.063) 
—.007 
(.060) 
.010 
(.075) 
.004* 
(.001) 
011 
(.010) 
.018** 
(.006) 
—.076* 
(.035) 
.001 
(.001) 
.034** 
(.011) 
—.069 
(.057) 
3.050 
.094 





Note: Unstandardized regression coefficients are presented, with robust standard errors in parentheses. 


° The reference category is never-married respondents. 
ps5 x pss UT *** p«.001  (2-tailed tests) 
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significantly stronger among women and blacks. Simultaneously, neighborhood 
disadvantage x residential stability remains significant net of the gender- and 
race-contingent effects. Controlling for individual-level socioeconomic status, 
economic hardship, subjective assessments of neighborhood problems, life 
events, and residential tenure has little effect on the neighborhood disadvantage 
x residential stability coefficient, reducing it slightly from .328 to .316 (see model 
3). The coefficients for gender- and race-contingent effects of neighborhood 
disadvantage also decrease slightly, although they both remain statistically 
significant. 

To better illustrate the patterns represented in model 3, I plot the regression 
predictions. All variables are centered (actual score minus mean score) so their 
means equal 0; I also solve the equation for married individuals. Solving the 
equation by substituting other values of the control variables will change the 
intercept only slightly but will not influence the size of the slopes. As shown 
in panel A of Figure 1, at high levels of residential stability, there is a positive 
association between neighborhood disadvantage and donated support among 
black women only. Predicted slopes are more positive among black men and white 
women relative to white men. In contrast, panel B indicates that at low levels of 
residential stability, the slope representing the association between neighborhood 
disadvantage and donated support, which was positive in panel A, becomes flat 
among black women. However, among white men, the slope becomes negative. 
It is also noteworthy that the slope reflecting the negative association between 
neighborhood disadvantage and donated support among white men would be 
even more negative (and significant) if I had plotted the predictions in model 
2. However, controlling for individual-level characteristics reduces the negative 
neighborhood disadvantage coefficient from —.134 (model 2) to —056 (model 3). 
All together, the observed patterns, especially among black women in areas with 
higher levels of residential stability, are consistent with the social mobilization 
and adaptive cohesion views. In contrast, the patterns among white men who 
reside in areas with lower levels of residential stability are more consistent with 
the social disorganization and compounded isolation views. 

Other peripheral findings deserve brief mention. In all three models, age is 
associated negatively with donated support, a pattern consistent with the idea of 
increased social isolation in late life. In addition, a positive effect of family size 
on donated support emerges only after adjusting for household income. Family 
size is correlated negatively with income, which is associated positively with 
donated support. Those patterns account for the suppression effect of family 
size. Likewise, there is a negative association between perceived neighborhood 
problems and donated support, and a positive association between life events 
and donated support. None of the other control variables are associated with 
donated support. 
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Panel B. Low Residential Stability 
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Figure 1. Neighborhood Disadvantage and Donated Social Support by 
Gender and Race 


Note: Lines are based on the equation in model 3 of Table 2. High residential stability = 69.7 percent 
(ninetieth percentile); low residential stability = 36.4 percent (tenth percentile). 
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RECEIVED SOCIAL SUPPORT 


In model 1 of Table 3, the positive and significant neighborhood disadvantage x 
residential stability coefficient (b—.414) indicates that the association between 
neighborhood disadvantage and received support is positive, but only at higher 
levels of residential stability. Conversely, the negative neighborhood disadvantage 
coefficient (b——.048) indicates that neighborhood disadvantage is associated 
negatively with received support when the value of residential stability is at or 
below the mean. Gender and race do not have main associations with received 
support. 

The patterns become more complex in model 2. Specifically, the positive 
coefficient that represents the neighborhood disadvantage x gender x race 
interaction (b=.186) indicates that the association between neighborhood 
disadvantage and received support depends jointly on gender and race. That 
is, the positive association between neighborhood disadvantage and received 
support is strongest among black women and is significantly different from 
the effect observed for white women. Moreover, adjustments for individual- 
level socioeconomic status, economic hardship, subjective assessments of 
neighborhood problems, life events, and residential tenure have little effect on the 
neighborhood disadvantage x residential stability coefficient, reducing it slightly 
from .389 to .386 (see model 3). Likewise, the coefficient representing the three- 
way interaction decreases slightly from .186 to .168, although it also remains 
statistically significant. In sum, individual-level conditions do not substantially 
mediate community-level effects. 

Panel A of Figure 2 shows that at high levels of residential stability there is a 
positive association between neighborhood disadvantage and donated support 
among black women only—and the lines representing black and white women 
diverge significantly. Conversely, the predictions for black and white men are 
almost identical. In contrast, panel B shows that at low levels of residential stability 
the association between neighborhood disadvantage and received support flattens 
among black women. However, among white women, the slope is negative. It is 
also noteworthy that the slope reflecting the association between neighborhood 
disadvantage and received support among white women would be even more 
negative (and marginally significant, p=.055) if I had plotted the predictions 
in model 2. Controlling for individual-level characteristics reduces the negative 
neighborhood disadvantage coefficient from —.086 to .003. In sum, the observed 
patterns among black women are consistent with the social mobilization and 
adaptive cohesion views. Conversely, the predictions for white women concur 
with the social disorganization and compounded isolation hypotheses. 

Among peripheral findings, age is associated negatively with received support. 
In addition, compared to never-married respondents, married and widowed elders 
tend to report a higher level of received support. Family size, education, and 
income are associated positively with received support. Several interrelationships 
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Table3. Received Support Regressed on Neighborhood Disadvantage, 
Residential Stability, Disadvantage x Stability, and Gender- and 


Race-Contingent Effects 


Focal neighborhood characteristics 
Neighborhood disadvantage 


Residential stability 


Neighborhood disadvantage x residential stability 


Gender- and race-contingent effects 
Gender (female = 1) 


Race (black = 1) 

Neighborhood disadvantage x gender 
Neighborhood disadvantage x race 

Gender x race 

Neighborhood disadvantage x gender x race 


Controls 
Age 


Married? 

Widowed? 

Divorced? 

Family size 

Years of education 

Household income 

Perceived neighborhood problems 
Residential tenure 

Life events 

Economic hardship 


Constant 
R2 


(1) 


—.048** 
(.018) 
—.050 
(.104) 
A14** 
(.146) 


048 
(.028) 
—.003 

(.028) 


—.006** 
(.002) 
.155** 
(.052) 
.141** 
(.052) 
.098 
(.062) 
.002 
(.001) 


3.110 
.038 


3.116 
.054 





Note: Unstandardized regression coefficients are presented, with robust standard errors in parentheses. 


4 The reference category is never-married respondents. 
p< 05 epee tp <0) *** p«.001 (2-tailed tests) 
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Panel B. Low Residential Stability 
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Figure 2. Neighborhood Disadvantage and Received Social Support by 
Gender and Race 


Note: Lines are based on the equation in model 3 of Table 3. High residential stability — 69.796 (ninetieth 
percentile); low residential stability — 36.496 (tenth percentile). 
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among the controls are also notable. Controlling for education and income 
reduces the gap between never-married elders and married and widowed 
individuals. Moreover, comparing models 2 and 3, a positive association between 
family size and received support emerges only after adjusting for education and 
income. Family size is associated negatively with education and income; those 
patterns contribute to its suppressed association with received support. 


Discussion 


Social support is not uniformly distributed in society but rather is patterned 
differently across social strata and community contexts. Social disorganization 
and isolation views propose that neighborhood disadvantage should erode the 
presence and quality of social bonds. The findings challenge the uniformity of 
those claims: Neighborhood disadvantage is associated positively with donated and 
received support—but only among black women who reside in areas with greater 
residential stability. To some extent, those patterns concur with the observations of 
Stack (1974) and others about the structure and processes of support among black 
families, especially among black women. However, to my knowledge, this is the 
first systematic documentation of a positive association between neighborhood 
disadvantage and social support. Moreover, the community-level effects hold 
after controlling for black women's individual-level social and economic 
circumstances, perceived neighborhood problems, and residential tenure. The 
persistent neighborhood effects concur with studies that show poverty's effect 
on teenage sexual behavior and educational outcomes, net of individual-level 
conditions (Brooks-Gunn et al. 1993). 

The observations contribute to the general sociological interest in the 
determinants of social capital and the more specific interest of scholars of mental 
health in the distribution of social support—a core resource in the stress process 
model (Pearlin et al. 1981). According to stress process theory, neighborhood 
disadvantage reflects a chronic, ambient hazard that can threaten mental and 
physical health (Aneshensel and Sucoff 1996; Pearlin 1999). In addition, the 
specification of the differential impact of stressful community-level conditions 
across race and gender elaborates on the structural determinants of social support 
(House, Umberson, and Landis 1988). Some observations, however, are more 
consistent with the social mobilization thesis than others. Black women residing 
in conditions of chronic structural strain may develop a higher level of bonding 
social capital (Putnam 2000). Stressors associated with disadvantaged conditions 
may mobilize social resources that help black women and members of their 
proximate network “get by” and survive. That is especially crucial in areas that 
lack access to more formal services. 

The processes described above involve residential stability as an effect modifier 
of neighborhood disadvantage. Wilson’s (1987, 1996) identification of the social 
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isolation perspective contends that residential stability and high rates of poverty 
combine to have the most deleterious impact on individuals. One theme in that 
argument focuses on poor urban blacks' relative isolation from mainstream 
social institutions and formal organizations. In contrast, however, there are 
numerous accounts of the unique constellation of supportive bonds among 
female kin and friends, including neighbors, in black and low-income families 
(Belle 1982; McAdoo 1980; Stack 1974). Although the social capital inherent in 
those informal, supportive, and local bonds may or may not have relevance for 
access to mainstream institutions, it may enhance proximate forms of solidarity, 
foster coping with stress, and ultimately increase survival chances (Burton et al. 
1995; Dilworth-Anderson and Burton 1999; Gibson 1982; Stack 1974). 

These patterns raise issues that are relevant for the social distribution of 
mental health. In particular, some elements of Wheaton's (1985) identification 
of the suppressor model are applicable here. If social support is highest among 
black women who reside in disadvantaged neighborhoods, then we might 
also suspect that their mental health status would be worse were it not for the 
generally protective benefits of support. Simultaneously, however, some forms 
of support provision may exact a toll on mental and physical functioning— 
ideas consistent with the *cost of caring" thesis (Kessler and McLeod 1984). 
Involvement in others' troubles may have detrimental effects on one's own well- 
being (Aneshensel, Pearlin, and Schuler 1993; Sarason, Pierce, and Sarason 1990), 
especially when one is dealing with personally adverse conditions or has fewer 
psychosocial resources (Rook 1984; Schieman and Turner 2001; Turner 1994). 
In the context of community-level strains, supportive exchanges may exacerbate 
the psychosocial effects of one's own personal experience of stress (Belle 1982, 
1991). The provision of support, for instance, may be more strongly associated 
with distress for individuals in disadvantaged neighborhoods—and perhaps 
more so for black women who possess fewer socioeconomic resources. Future 
inquiry might test for potential competing effects of social mobilization and 
cost of caring processes in community context. Moreover, research might assess 
variations in the costs of caring across important social statuses such as race and 
gender, especially among elders. The gender, race, SES, and age composition of 
social networks could influence the nature of received and donated support and 
their impact on stress proliferation. 


SOCIAL SUPPORT AND COLLECTIVE EFFICACY 


The observations might also have relevance for inquiry into the origins, form, and 
consequences of “collective efficacy.” However, the links among bonding social 
capital, social support, and collective efficacy deserve more refined theoretical and 
empirical development. Identifying collective efficacy’s link to social control and 
decrements in community violence, Sampson, Raudenbush, and Earls contend 
that “the willingness of local residents to intervene for the common good depends 
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in large part on conditions of mutual trust and solidarity among neighbors" 
(1997:919). Those researchers delineate two dimensions of collective efficacy: 
“informal social control” and “social cohesion and trust.” The latter dimension 
is comprised of several items that, in my view, are similar to social support: 
“people around here are willing to help their neighbors,” “this is a close-knit 
neighborhood,” and “people in this neighborhood can be trusted.” Responses 
to those items explicitly reference the general neighborhood and not individuals’ 
personal experience of being loved, valued, cared for, and able to count on 
neighbors. Bonding social capital and support processes contain aspects of trust, 
reliability, and obligation (Putnam 2000; Turner and Turner 1999). Bonding social 
capital links people to friends and relatives—some of whom may also happen 
to be defined as “neighbors.” 

Future research might consider the extent to which survey responses to 
questions about social cohesion and trust in a neighborhood contain traces 
of individuals’ perceptions about the specific support that they get or give to 
neighbors. It seems plausible that neighbors whom one can trust, depend on, 
care for, and respect are also likely to be people one would classify as friends; 
many might also be kin. At some threshold, the line between “neighbors” and 
“friends” probably blurs and one’s social relations with the latter emerge as more 
salient and consequential for health and well-being. Elements of intimate social 
ties that represent friendships function to promote the common good of the 
dyadic bond—and also just happen to be contained within the geographic space 
of the neighborhood. These are the same interpersonal ties that may coalesce to 
form the collective mechanisms of social cohesion and control that intervene on 
behalf of the common good of the community. Simply put, individuals and their 
friendships in the neighborhood are primary driving forces for the common good 
because the proximity and density of socially supportive friendships and kin, by 
their very nature, reflect social solidarity. 

Several items in Sampson, Raudenbush, and Earls’s (1997) measure of 
collective efficacy also imply the lack of trust and interpersonal conflict among 
neighbors. These items include “people in this neighborhood generally don’t 
get along with each other” and “people in this neighborhood do not share the 
same values” (items are reverse-coded). These items appear to assess a more 
general impression of conflict in the neighborhood social climate rather than 
the extent to which an individual mistrusts neighbors or has specific conflicts 
with others. In some respects, those two items mirror elements of Ross and 
Mirowsky’s (1999) operationalization of perceived “social disorder and order” 
in the neighborhood with items such as “I’m always having trouble with my 
neighbors” and “in my neighborhood, people watch out for each other.” It may 
be important to distinguish the ways that perceptions of interpersonal conflict 
as a general property of the neighborhood is different from the individual’s 
own personal experience of conflict with neighbors. Moreover, the stress process 
perspective encourages greater clarity in the distinctions between assessments of 
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general interpersonal conflict as a stressful neighborhood condition, alongside 
structural conditions such as the level of poverty, versus the general level of 
interpersonal trust, mutual obligation, and solidarity as indicators of socially 
supportive relations or bonding social capital and its role as a precursor to 
collective efficacy. 


AGE, THE LIFE COURSE, AND NEIGHBORHOOD EFFECTS 


Given that the observations apply to adults aged 65 and older, speculation 
about the role of aging and the life course in the social impact of neighborhood 
effects is warranted. A burgeoning literature exists with respect to neighborhood 
effects on the functioning of children. Some scholarship accentuates possible 
“social mechanisms,” such as informal social control, that link the quality of 
community life to children’s well-being (Sampson, Morenoff, and Earls 1999). 
One question remains unanswered: What role do elders play in informal social 
control processes? 

Adults in late life may spend more time in neighborhoods, especially if they 
are no longer attached to the work role. In addition, researchers have observed age 
patterns in social bonds. For example, among a representative Illinois survey of 
adults, Geis and Ross (1998) found that older adults tend to report more frequent 
informal visits and chats with neighbors, as well as more frequent instrumental 
assistance (i.e., lending tools, giving rides, or watching over houses). Likewise, in 
that same survey, Ross, Mirowsky, and Pribesh (2001) found that age is associated 
negatively with mistrust (“felt it was not safe to trust anyone, suspicious, felt sure 
everyone was against you”). Campbell and Lee (1992) found among a survey of 
adults in Nashville that age is associated positively with number of neighbors 
known by name, length of relationship in years, and feelings of closeness to 
neighbors. In addition, Sampson, Raudenbush, and Earls’s (1997) survey of 
Chicago residents documents a positive association between age and collective 
efficacy. All together, these elements likely contain a blend of informational, 
instrumental, and emotional support—and those features may foster informal 
social control in the community. 

Most discussions about neighborhood-related social isolation processes focus 
on younger adults and are more relevant for Putnam’s (2000) concept of “bridging 
social capital" (i.e., social ties that help one get ahead). For example, younger 
residents in disadvantaged neighborhoods may have fewer informational sources 
of support that foster social network connections to better work opportunities. 
Tigges, Browne, and Green contend that "neighborhoods influence the availability 
of information necessary for success in mainstream society" (1998:56). However, 
bridging social capital (vs. bonding social capital) seems, on balance, less relevant 
in the lives of elders. In particular, elders of all races and social class backgrounds 
may experience declines in physical mobility that constrain social contact and 
elevate the risk of social isolation. The notion that concentrated disadvantage 
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weakens social bonds highlights additional threats to social well-being. As the 
social isolation view implies, elder residents in disadvantaged neighborhoods 
may be especially detached from mainstream society because of constriction of 
contacts with friends or relatives in other parts inside and outside the community 
(Tigges, Browne, and Green 1998; Wilson 1987). Therefore, social relations that 
help elders get by, emotionally and otherwise, are especially critical in late life, 
particularly as individuals lose family members and friends through death and 
experience the physical constraints of impairments of all types (Hogan and 
Eggebeen 1995). 

The patterning of these processes across the life course deserves additional 
study. Social support and disadvantaged conditions probably mean different 
things to people at different points of the life course. Among younger adults, 
disadvantaged economic conditions may take a greater toll on psychosocial 
resources and create more distress because they impede efforts to establish 
economic security for themselves and their families. Older adults, however, may 
be more sensitive to the lack of social cohesion in an area because it poses a 
greater threat to their sense of self, social relationships, and well-being, especially 
if they are encountering increasing physical impairment. 

Age is also an important consideration with respect to residential stability. 
Older adults may feel resigned to higher levels of concentrated disadvantage if 
they have resided for many years in the same place and may have established 
a life where they are. There may be a sense of resilience that has accumulated 
over time of being in a disadvantaged area. Warner and Pierce contend that 
neighborhoods with high disadvantage and residential stability reflect the "last 
resort, where people remain, not because they choose to, but because they have 
no other options" (1993:494). It seems plausible that many individuals under such 
circumstances adapt and adjust—especially adults that survive into late life. 


LIMITATIONS 


Potential limitations of this study include the generalizability of the sample, 
selection bias, causal order, and measurement issues. The generalizability of the 
observations is limited. Although the overall gender, age, and racial composition 
of the sample of adults over age 65 in the three locales is similar, the sample is 
not necessarily representative of elders in those areas or other locations. That 
limitation, however, does not diminish the internal validity of the patterns. It 
is plausible that members of the sample are somewhat healthier and wealthier 
than others. The extent to which that possibility biases the estimates among the 
focal associations remains unknown. To some degree, the sample may reflect 
what some have called the "successfully aged" (Rowe and Kahn 1998). The 
greater the sample-to-sample variations in economic circumstances, health, 
and social integration, the more these estimates are biased. Individuals who 
reside in neighborhoods with higher levels of structural disadvantage are likely 
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to possess fewer economic resources. Residents in neighborhoods with fewer 
socioeconomic resources are probably less likely to survive into late life or to 
reside in the community. That may be especially true among elder black men. 
Statistical adjustments for individual-level education, income, economic hardship, 
and perceived neighborhood problems may help assuage concern about bias. 

At the other end of the spectrum, residents who have survived in disadvantaged 
areas over longer stretches of time may become somewhat more acclimatized to 
community-level adversity and therefore experience it as less stressful. In most 
instances, undesirable conditions increase the motivation to leave disadvantaged 
neighborhoods. Time provides opportunity for residential mobility. Individuals 
with sufficient economic resources can relocate; the less fortunate have fewer 
choices (Fischer 1977; R.B. Taylor 1996). Moreover, moving may sever social 
ties with neighbors. Conversely, interpersonal conflict and deficient or weak ties 
with neighbors may encourage relocation. Also, structural disadvantage limits 
residential mobility, but general stability may enhance the development of 
stronger social ties. Collectively, these points underscore possible caveats about 
selection and the causal ordering of the focal associations described here. 

The measurement of received and donated social support is also somewhat 
limited because the bulk of the items appear to be tapping emotional support. 
Some wording is suggestive of instrumental or informational support, particularly 
in the donated support item: “there are people you know who depend on you 
when they need help or advice.” The assessment of instrumental or informational 
support as an additional dependent variable might reveal somewhat different 
patterns. Moreover, those forms of support may also be more relevant and 
consequential for the quality of life among elders, especially those residing in 
more disadvantaged neighborhoods. Those limitations, however, do not diminish 
the importance of perceived support that is primarily socioemotional in nature. 
Given the themes of social isolation and abandonment that have been identified 
as common issues for adults in late life, feeling valued, cared for, and a sense that 
others think you matter (and do things to show you that you do matter) might 
be the most important buffer against the health effects of neighborhood and 
personal disadvantage. 


Conclusion 


There is a well-established ecological perspective on crime and psychopathology. 
There is also a growing sense of an ecological perspective on social relationships 
and the “capital” that is derived from them. Most prior research demonstrates 
that community-level disadvantage threatens social organization, weakens links 
to mainstream institutions, erodes network interactions and attachments to 
community life, and undermines social control at the local level. Research has 
established the impact of neighborhood conditions on criminal activity. Given the 
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bulk of prior evidence about the negative effects of neighborhood disadvantage 
on social organization, it is plausible that some forms of social capital might 
suffer as well. However, the results presented here suggest otherwise—especially 
among black women. 

House's (1987) call for studies of the social-structural determinants of 
social support and their potential status-contingent effects has inspired the 
present study. By documenting the differential association of neighborhood 
structural disadvantage by race and gender, as well as the role of residential 
stability as an effect modifier, this study takes small strides toward addressing 
those gaps. However, much more remains unknown about the specific forms 
and consequences of different types of social support processes under adverse 
community conditions and the ways that such effects may vary across the life 
course. 


Notes 


1. Some researchers have drawn attention to the importance of “anticipated support,” or 
the sense that significant others are available and willing to help in the future (Krause 2001; 
Wethington and Kessler 1986). Evidence suggests that anticipated support diminishes the 
deleterious impact of stress and may be a more effective coping resource than actual support 
(Krause 19972, 1997b). However, the sense of “future” support is likely contingent upon 
current and past actual or perceived levels of received support. 


2. Researchers have typically measured neighborhood disadvantage by such indicators 
as the percentages of households below poverty, households receiving public assistance, 
single-parent female-headed households, and overcrowded living conditions (Robert 1998; 
Sampson, Morenoff, and Gannon-Rowley 2002; Sampson and Raudenbush 1999). Moreover, 
prior research has shown that neighborhood disadvantage diminishes physical and mental 
well-being, although the effects are often through individual-level socioeconomic conditions 
or subjective assessments of neighborhood disorder (Aneshensel and Sucoff 1996; Geis and 
Ross 1998; Robert 1998; Ross 2000; Ross and Mirowsky 2001; Ross, Mirowsky, and Pribesh 
2001; Ross, Reynolds, and Geis 2000; but see Browning and Cagney 2003). 


3. Sampson (1988) defines residential stability as “the percentage of residents brought up 
in the area within a 15-minute walk of home” (769). Local friendship ties are defined as 
“the percentage of community residents who reported half or more of their friends living 
within a 15-minute walk of home.” However, respondents who report having only one or 
two friends would score low on that measure—even if those friends are intimate confidants. 
Aneshensel and Sucoff (1996) measured perceived social cohesion with three items: “kids 
know one another,” “adults know one another,” and “people are friendly.” Ross, Reynolds, 
and Geis (2000) measured informal social ties with neighbors as the frequency of visiting 
informally with neighbors, chatting with neighbors, and the extent to which “you and your 


neighbors” help each other out. 


4. I included the dummy code for missing income (= 1) in the regression analyses along with 
income. However, it was not significant in those models. Although it remains in the final 
models reported here, I exclude those coefficients from the tables for the sake of space. 


5. In a series of separate analyses (not shown), I assessed the impact of an array of other 
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control variables, including living alone versus living with others, number of people in the 
household, home ownership, caregiver status, taking care of children, and the number of 
chronic health conditions. Theoretically speaking, there are suggestions in the literature 
that each of these variables could be associated with the focal independent or dependent 
variables. However, there was little evidence that any of them influenced the overall patterns 
of association documented in Tables 2 and 3. Therefore, I decided to exclude them from the 
models reported here (results available upon request). 


6. In some respects, the items inquire about one's general impressions of the overall social 
climate of the neighborhood. Transference of individual experience in such a way that it 
becomes a description of a property of the neighborhood may be problematic. For instance, 
an individual might have friendly personal relationships with one's own neighbors—but those 
do not necessarily represent the overall neighborhood social climate or perceptions of the 
social relations of neighbors in general. 
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Appendix A. Age, Gender, and Racial Comparisons between the Study 
Sample and the 2000 Census Estimates 


Sample Sample Census Census 
Age 65+ Age 65+ Age 65+ Age 65+ 
N Percentage N Percentage 
White men 295 56,377 
65-74 136 46.10 30,058 53231 
75-84 124 42.00 19,228 34.11 
854 35 11.90 7,091 12.58 
Black men 285 30,675 
65—74 172 60.40 19,487 63.53 
75-84 99 34.70 9,069 29.56 
854 14 4.90 IIO 6.91 
White women X 293 77505 
65-74 124 42.30 35,403 45.68 
75-84 133 45.40 29,406 37.94 
85+ 36 12.30 12,696 16.38 
Black women 294 51,046 
65-74 146 49.70 27,558 53.99 
75-84 108 36.70 17,131 33.56 
85+ 40 13.60 6,357 12.45 


Note: Census data source: U.S. Census Bureau—2000, SF 1 Data. 
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Appendix B. Principal Components Factor Analysis of Neighborhood 
Disadvantage Items 


Census Item Factor Loading 
Households below poverty line .842 
Overcrowded households .783 
Individuals receiving public assistance .816 
Median household income’ —.805 
College education —.789 
High school education —.904 
Female-headed households .857 


Notes: Eigenvalue = 4.815; a = .924. 
" Reverse-coded. 
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Abstract 


Humiliation; incarceration; stigma; loss of income, freedom, and respect: most 
research on offending emphasizes these sanctions. Yet classical theorists recognized 
other costs including physical harm. We revive this abandoned insight, arguing 
that danger—the possibility of pain—figures largely in people's decisions to offend. 
Although modern states typically eschew violence, many victims, vigilantes, and 
others assault offenders. This violence is typically more certain, swift, and severe 
than other sanctions, and fear of injury likely deters many potential offenders. 
Yet the possibility of pain may be irrelevant to individuals who boldly believe 
in their unassailability. Consistent with our hypotheses, we find that perceptions 
about danger are significantly associated with involvement in theft, drug selling, 
and prostitution among homeless youth, and that these effects are independent of 
perceptions about a crime's excitement, profit, or other returns. Our results suggest 
that dangers play a key but typically neglected role in the genesis of these crimes. 


In the late eighteenth century Jeremy Bentham argued that danger plays an 
important role in the “hedonistic calculus” people use in making decisions. 
According to Bentham, "danger is nothing but the chance of pain" ([1789] 
1996:144), and as Jaeger and his colleagues have noted, “physical harm may be 
one of the only consequences that (nearly) all social groups and cultures agree 
is undesirable” (2001:88).' Thus perceptions about the danger of crime, or the 
potential for pain, should influence people's decisions about whether or not to 
break the law. 
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At first glance Bentham's hypothesis appears to have little bearing for Western 
nations that replaced punishments of the body with other sanctions during the 
early and middle phases of the modern period (Spierenburg 1995). Consistent 
with this shift, most contemporary work on criminal decisions ignores danger and 
focuses on the economic, social, and psychological costs associated with formal 
and informal sanctions. Yet the state's abandonment of physical punishment has 
not eliminated crime's potential for pain for offenders; instead it has increased 
danger's exclusiveness, concentrating it in individuals' and groups' violent 
responses to illegal actions (Black 1993; Brown 1975; Johnston 1996). 

Scholars have noted that danger should discourage offending (Black 1993; 
Gottfredson and Hirschi 1990), but these accounts undertheorize the ways in 
which both fear and fearlessness of physical harm may influence crime. Following 
Bentham, we treat danger as an indicator of potential pain, and focus on how 
the possibility of being harmed by others influences the decision to offend. We 
argue that physical harm is typically swifter, more certain, and more severe than 
other costs of offending, and, thus, that perceptions about danger should have a 
notable effect on the decision to offend. Specifically, we predict that the perception 
that crime is dangerous should discourage offending, whereas the intrepid view 
that offending is relatively safe will encourage it. 

We explore our hypothesis about perceptions of danger by analyzing interview 
and survey data from a sample of homeless youth. People with high offense risks 
may fear a violent response to many types of crime, but we focus on three offenses 
that have a promise of economic returns: theft, drug dealing, and prostitution. 
We examine our respondents' views of the danger associated with these crimes 
and consider how these perceptions are related to offending. We conclude with 
an examination of the issues raised by the deterrence implications of the danger 
in crime. 


The Disutility of Danger 


A "discourse of fear" characterizes many contemporary societies (Altheide 2002; 
Garland 2001). This discourse involves frequent and sensational communications 
that heighten the symbolic awareness—and the expectation—that danger and risk 
are central features of modern life. As a result, people substitute a “perspective 
of fear” for the “fear of something.” This generalized alarm increases people’s 
trepidation about being the victim of a specific crime, as well as their dread of 
crime in general. Moreover, an extensive literature demonstrates that these fears 
strongly influence people’s thoughts, beliefs, and actions. 

Fear may also play a role in the decision to offend, a possibility that has 
received much less attention. As indicated earlier, we hypothesize that crime’s 
potential for danger is likely to frighten people who expect that they will be 
physically injured during or as a consequence of offending. In most Western 
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nations, people's violent responses to crime are the key source of this danger. 
These reactions may occur in a variety of contexts. Victims may use violence 
to resist offenders, to defend and protect themselves and their property, or in 
retaliatory attacks. This violent “self help” may operate as a form of conflict 
management, punishment, or social control (Black 1993:27). 

Other sources of physical jeopardy involve nonvictims who respond 
aggressively to violations of deeply ingrained sentiments that constitute the 
collective conscience (Durkheim [1893] 1933). Alternatively, people may be 
motivated by individual sympathies, whether these are based in what Smith 
referred to as nonvictims’ “resentment” against offenders, or what Nietzche 
identified as the “desire to punish” (see Garland 1990). People may also view 
offenders as responsible for the deterioration of their community or for harms 
experienced by family, friends, and neighbors. And research indicates that 
regardless of their reasons, some bystanders, residents, business owners, moral 
entrepreneurs, vigilantes, and others outraged by crime respond with violence 
(Jacobs 2000; Sullivan 1989). The police constitute another threat: offenders may 
- be the victims of officers who use violence instead of, or in conjunction with, 
arrests (Skolnick and Fyfe 1993). 

Other lawbreakers are an additional menace. Rival gang members, competitors, 
and people who prey on offenders (e.g., those who rob drug dealers) may use 
force to establish control in their interactions as they confiscate people's illegal 
earnings or possessions (Jacobs 2000; Sullivan 1989). Co-offenders are another 
source of harm. They may attack collaborators when a crime fails or when 
commitments are broken, as, for example, when dealers assault people who do 
not make the payments required for drugs that they were "fronted" (Johnson 
et al. 1990; Sullivan 1989). Offenders also describe attacks by extortionists who 
use force to access their products or services without having to pay for them 
(Sullivan 1989). 

Three features of people's violent reactions to crime underscore the potential 
importance of perceptions about crime's possible danger. First, individual violence 
is typically more certain than other sanctions once crimes are discovered.’ 
Compared to officials who administer state sanctions, those who respond 
aggressively to illegal acts are less inclined to feel constrained by legal technicalities 
(Black 1980). They are also less likely to be sympathetic to an offender's plea 
of mitigating circumstances such as a disadvantaged background or personal 
hardship, conditions that may protect offenders from other punishments. Second, 
violent reactions to crime typically occur more swiftly than other consequences. 
The long delays in legal sanctioning contrast sharply with the immediacy of 
"private" violent responses that often occur even before an offense is completed. 
Third, violent reactions to crime are often harsher than punishments inflicted 
by others (Black 1993). Current laws typically forbid direct physical punishment 
for all but the most serious transgressions, whereas people who respond to crime 
with violence are less reluctant to inflict harm, regardless of the type of offense. 
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Less constrained by notions of fairness, they are unlikely to feel bound to ensure 
that “the punishment fits the crime,” and their sanctions range from temporary 
wounds to lasting injuries and death. Permanent disabilities and fatal violence 
stand in stark opposition to the more temporary pains associated with most 
formal and informal sanctions. 

Gottfredson and Hirschi (1990:6) note that the danger of a salah response 
should deter a “prudent” person from offending. Yet people vary in their 
perceptions of the likelihood of harm from an activity or situation (Simpson 
1996), and some people may believe that they possess attributes that allow 
them to reduce the possibility of harm. Lyng (1990; 1993) claims that a sense of 
invincibility is common among voluntary risk takers, including offenders (also 
see Jacobs 2000). High-risk activities such as skydiving, mountain climbing, and 
offending involve considerable unpredictability, a potential for chaos, and a clear 
threat to one’s physical or mental well-being. According to Lyng (1990), people 
who engage in high-risk activities are aware of the potential danger but conclude 
that they possess unique abilities that allow them to circumvent it. They know 
that others are hurt and sometimes killed while climbing mountains, skydiving, 
or offending, but they boldly claim “survival skills” that they think protect them 
from harm. Adopting a “self-serving bias” (the perception that one is somehow 
better than others), these individuals maintain that injured people mistakenly 
assumed that they had these abilities, whereas they were actually poseurs who 
“did not know what they were doing” or did not possess the “right stuff” (Lyng 
1990)? People who believe that the certainty, celerity, and severity of crime's 
danger do not apply to them are unlikely to be discouraged by the possibility 
of harm; thus it is their fearlessness, rather than their fear, that influences their 
decision to offend. Indeed, there is likely something attractive and empowering 
in the perception that one is immune to danger. 

Our hypothesis—that perceptions of crime's potential danger influence 
offending—draws on insights into offending and other risky activities offered in 
various theories in the "rational action paradigm" (Jaeger et al. 2001). Theories 
in this paradigm assume that people have relatively stable preferences (or tastes) 
for various outcomes and can assign a measure of satisfaction to these outcomes. 
People's subjective expected utility functions are the product of their assessment 
of the likelihood of an outcome being realized and the satisfaction it will provide, 
together with their approach to the risk associated with obtaining the outcome 
and to time discounting.’ Rational actions reflect a decision in which people chose 
an action based on ex ante expectations that it will result in the best outcome. In 
other words, people act rationally when they optimize, selecting an action (from 
the set of feasible actions) that will maximize their expected utility. 

The rational action paradigm has spawned several theories (e.g., expected 
utility, public choice, decision, and social choice), and theorists differ in their 
use of "thick" versus "thin" approaches. Thick or “soft” perspectives extend the 
rational choice approach by introducing psychological, cultural, and structural 
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factors that provide a more detailed perspective and consider a wider range of 
factors than expected utility theory (Gruber 2001; Lichbach 2003).° We assume 
that we can usefully view offending as a risky decision executed under conditions 
of uncertainty (Bueno de Mesquita and Cohen 1995; Eide 1994; McCarthy 2002; 
Nagin and Paternoster 1991, 1993; Paternoster 1989; Schmidt and Witte 1984); 
but we recognize that a variety of factors influence people's preferences and 
perceptions of an outcome’s utility and likelihood, and that these change as people 
acquire additional information, encounter new situations, and change related 
perceptions. Our hypotheses about perceptions of danger are consistent with a 
parsimonious version of rational choice that assumes relatively unambiguous 
preferences; that is, we assume that people usually prefer safety to harm. 


Rediscovering Danger 


Our emphasis on danger contrasts with most contemporary discussions on the 
perceptions of crime's costs. Most work on this topic has concentrated on one 
of four areas: formal sanctions such as arrest, incarceration, and other state 
penalties; informal sanctions such as social stigma and rejection by significant 
others; commitment to normative values and beliefs that the legal system is just 
and moral; and the guilt or shame that norm violations and sanctions may elicit 
(Becker 1976; Eide 1994; Grasmick and Bursik 1990; Meier and Johnson 1977; 
Nagin and Paternoster 1991, 1993; Paternoster 1989; Schmidt and Witte 1984; 
Tittle 1980; Williams and Hawkins 1986; Zimring and Hawkins 1973). Perceptions 
about these costs likely have the greatest influence on individuals who have a 
considerable stake in society, who are embedded in the networks of people who 
support the law, and whose criminal experiences are limited to a small number 
of common misdemeanors or petty offenses (Gottfredson and Hirschi 1990; 
Stafford and Warr 1993; also see Decker, Wright, and Logie 1993). 

Formal and informal sanctions are likely less important for people responsible 
for most serious street crimes, for marginalized individuals, for those who 
have considerable connections with offenders, and for people who have 
successfully offended in the past (i.e., avoided sanctions). Economic and social 
marginalization minimize a person's ties to normative society and can encourage 
the view that crime is a legitimate means for meeting one's needs (Hagan 1992). 
Meanwhile, criminal associates validate offending and are less likely to see crime 
or arrest as a source of stigma (Hagan and McCarthy 1997; Stafford and Warr 
1993). As Nagin notes: “a criminal record cannot be socially . . . isolating if it is 
commonplace” (1998:4). Finally, most offenders likely realize, as a result of their 
own and others’ experiences, that few crimes result in an arrest (Horney and 
Marshall 1992; Piliavin et al. 1986). 

Research on people who are most likely to offend supports the argument 
that formal and informal sanctions play only modest roles in their decisions. 
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In a study on the offending decisions of adolescent high-school dropouts, drug 
users, and adult offenders (Piliavin et al. 1986), estimates of the likelihood of 
formal (e.g., arrest and incarceration) and informal (i.e., loss of respect from 
friends or neighbors) sanctions were shown to have little influence. Research on 
burglars (Decker, Wright, and Logie 1993) reports that a majority were willing to 
commit a crime when the probability of arrest was 5096 and when the possible 
penalty was five years’ imprisonment. Similarly, Foglia's (1997) analysis indicates 
that perceptions of the likelihood of legal sanctions and negative responses from 
friends had little influence on inner-city adolescents' involvement in crime. 
Furthermore, several studies suggest that some, but not all, offenders substantially 
underestimate the likelihood of these sanctions (Shover 1996). 

Other research emphasizes that some offenders are concerned about the 
danger of a violent reaction to their crimes. Offenders worry more about being 
injured by their victims than about apprehension by the police (Wright and 
Rossi 1986) and many develop protective strategies to manage the threat of harm 
(Katz 1988; Shover 1996).* Nonetheless, their rates of assault and violent death 
far exceed those for nonoffenders (Sampson and Lauritsen 1990; Tremblay and 
Paré 2003). 

In sum, physical harm may represent one of crime's key costs. The expectation 
that crime will result in physical harm likely discourages people from choosing 
it, whereas the audacious view that one possesses a set of skills that makes one 
immune to crime's danger likely encourages it. Indeed, we predict that the 
all but universal desire to avoid pain means that the perception that crime is 
dangerous will influence offending above and beyond the effects of perceptions 
about crime's other costs. 


Studying Danger: Data, Variables, and Methods 


Although some studies, particularly ethnographies (Shover 1996; Sullivan 1989), 
report that crime's danger plays a key role in the decision to offend, current 
research does not provide a systematic evaluation of the association between 
danger and offending. We address this oversight with an exploratory analysis 
of data from a study of street youth. A team of researchers collected the data in 
1992 in Toronto and Vancouver, Canada, from respondents they encountered in 
service agencies (e.g., shelters and counseling centers) and street locations where 
the homeless often congregate (e.g., parks and street corners). Over a two-week 
period, researchers "saturated" the streets and social service agencies, speaking 
with all the homeless youth they encountered and determining their eligibility 
for the study (e.g., age appropriateness). The resulting sample includes male and 
female youth who live on the street, in shelters, in temporary housing with friends, 
or in hotels. We interviewed this diverse group of runaways, throwaways, and the 
homeless in offices provided by agencies, and the youth completed self-report 
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questionnaires. Four hundred and eighty-two youth participated in this part of 
the study. We scheduled second and third wave interviews with our respondents at 
one-month intervals; 7896 of our respondents completed a second-wave interview, 
but only 53% finished a third interview (see Hagan and McCarthy 1997). 

The items we use to measure our key independent variables, the perceived 
danger of various crimes, were gathered only during the first round of interviews. 
Previous research had shown considerable variability in people’s perceptions 
about the risks of crime over time (Minor and Harry 1982; Piliavin et al. 1986; 
Saltzman et al. 1982), a pattern that we too find for the perceptual variables for 
which we have multiple measures. For example, between their first and second 
interviews (approximately one month apart), about two-thirds of the youth in 
our study changed their assessments of the likelihood of arrest for each of the 
three crimes we monitored (theft, drug selling, and prostitution), and about 
one-quarter revised them by three or more points on a ten-point scale. This 
variability is consistent with the premise that people’s views change as they gather 
new information." 

The weak over-time correlations suggest that the most suitable approach to 
modeling perceptions in our data is to use more immediate, rather than distant, 
measures. It has been noted that "causally relevant perceptions are those more 
proximate to crime . . . distal measures of perceived risk. may be irrelevant to 
behavior" (Piliavin et al. 1986:116). Indeed, the rational choice approach assumes 
that the effects of perceptions will more likely be instantaneous than lagged 
(Grasmick and Bursik 1990). The nature of perceptions and the characteristics 
of our data encourage a focus on cross-sectional analysis of the first-interview 
data; nevertheless, we turn to our longitudinal data at the conclusion of our 
results. Meanwhile, our cross-sectional data offer several advantages over those 
used in other studies: a greater number of measures of crime's perceived costs 
and benefits, a more diverse set of control variables, and greater variation in 
involvement in crime. 

The independent variables in our models are divided into two categories: a 
group of perceptual measures of offending's potential costs and rewards, and a set 
of control variables (see Table 1). Our perceptual variables include crime-specific 
measures about three of crime's possible costs (danger, arrest, legal concerns) 
and two of its potential benefits (excitement, profitability). We measure our 
key variable, perceptions of crime's danger, with respondents' ratings of various 
activities. We gave the youth in our study a list of behaviors that included 
recreational activities typically seen as having a high risk of physical danger (e.g., 
skydiving, mountain climbing), risky behaviors common among the homeless 
(e.g., hitchhiking, sleeping in abandoned cars or buildings) and minor and more 
serious crimes. Respondents rated each behavior using a ten-point scale in which 
1 represented “very safe” and 10 indicated “very dangerous." ? We focus here on 
respondents responses to questions about the danger associated with committing 
three types of crime: theft, drug selling, and prostitution. As noted earlier, 
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danger involves issues of certainty, celerity, and severity, and individuals may 
vary in their perceptions of each of these dimensions. In this initial exploration 
we have used a global measure of danger and assume that subsequent research 
will investigate more precise measures, if we find purchase with our present, 
more general approach. 

A second cost variable measures perceptions of the likelihood of formal 
sanctions for each offense. To indicate their views about the individual likelihood 
of arrest for committing the three crimes we study, respondents selected values 
from a ten-point scale with 1 equal to “no chance” and 10 corresponding 
to certainty.” To help us to determine the distinctiveness of the two sets of 
perceptions, the respondents answered questions about formal sanctions in their 
interviews before completing self-report items about danger. 

Our third set of cost variables concerns perceptions about the unacceptability 
of particular crimes. Specifically, we asked our respondents about the extent 
to which they agreed or disagreed with statements about the wrongness of 
taking others’ property, drug use, and prostitution. The conceptual meanings 
of these measures are a source of debate that is incidental to this article: they 
could represent the level of commitment to the normative order, moral beliefs 
that would encourage guilt if contravened, or support for crime-legitimizing 
definitions. They may be specific perceptions that are relatively mutable, or more 
deeply held and relatively rigid core beliefs. These issues notwithstanding, we 
assume that the view that a criminal act is wrong should discourage offending. 
People who believe that an offense is unacceptable may recognize that committing 
it would result in considerable guilt and are therefore motivated to follow the 
law and avoid this cost." 

Our data do not contain perceptual measures of the likelihood of informal 
sanctions or their associated costs; however, such estimates have shown 
nonsignificant effects on crime in studies of high-risk populations (Foglia 1997; 
Piliavin et al. 1986). Moreover, these perceptions likely play only a minor role 
in the offending decisions of the youth in our sample. For example, 8096 of our 
respondents reported that the majority of their street friends had been arrested, 
and 75% said that 50% or more of their friends had been jailed. As well, 56% of 
the nonoffenders in our study said that the possibility of friends’ disapproval had 
no effect on their decision not to offend (only 27% said that concern that their 
friends would disapprove “a fair bit” or “a lot” influenced this decision).'* Thus it 
appears unlikely that many homeless youth confront or fear stigmatization and 
rejection by their peers if they offend or are arrested. 

Although we focus on the potential costs of crime, we also include items 
that measure two of crime’s potential gains: financial and psychological returns. 
We gauged perceptions about the monetary rewards of offending by asking 
respondents to use a ten-point scale to estimate the likelihood of making “lots of 
money" from theft, drug selling, and prostitution (1 = no chance, 10 = certainty). 
We also asked respondents to compare perceived financial returns from illegal 
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and legal activities and estimate whether they could make more money from 
crime or from a "straight" job (Piliavin et al. 1986). To measure the "thrill" that 
offending can provide (cf. Cusson 1983; Katz 1988), respondents used a ten-point 
scale (1 — very unexciting, 10 — very exciting) to gauge the excitement of the 
same activities they had previously rated for danger. 

Three factors encouraged us to introduce measures of crime's potential returns 
into our analysis. First, rewards are central to cost-benefit approaches. Second, 
research suggests that people— particularly youth—typically overestimate the 
probability of obtaining the benefits from risky behavior while underestimating 
the likelihood of realizing its costs (Gruber 2001; Millstein 1993). Third, studies 
of offenders indicate that they tend to focus more on crime's potential gains than 
on its possible costs (see Shover 1996: Chapter 6). 

Our equations contain several control variables, the most important of which 
are controls for offending before leaving home. Offenders who successfully avoid 
harm may have heightened feelings of invincibility and a greater likelihood of 
viewing high-risk situations as routine (Jacobs 2000:124). Perceptions of crime's 
harm may thus simply be a consequence of offending, rather than an independent 
contributor to offending. We address this possibility by including measures of 
at-home theft and drug selling in our models of street theft and drug selling. We 
do not have a measure of selling sex while living at home and instead use a single 
item that combines involvement in theft and drug selling at home in our model 
of street prostitution. Our at-home theft scale combines information about the 
frequency of committing four types of theft: stealing goods valued between $10 
and $50; stealing objects worth more than $50 (both categories exclude stealing 
food, clothes, and tobacco products), burglarizing a home or business, and stealing 
objects from an automobile. Our measures of drug selling distinguish between 
selling marijuana or hash and dealing crack, cocaine, or hallucinogens. 

Our other control variables fall into two groups. The first set of items focuses 
on family background: parental unemployment, maternal drug addiction, and ' 
familial criminality. The second concerns the respondent and includes measures 
for the following: age, gender, race, arrest while living at home, unemployment 
since leaving home and in the six months prior to the survey, situational adversity 
on the street (i.e. lack of food and adequate shelter), offense and arrest histories 
of friends (the proportion who had committed various types of crimes or had 
been arrested), illegal opportunities, and attitude toward risk-taking. These 
control variables represent the key factors emphasized in current research on 
risk taking (Bromiley and Curley 1992; Gruber 2001; Slovic 2000) and offending, 
particularly for homeless youth (Hagan and McCarthy 1997). Although some of 
our measures are correlated, there is little collinearity among our independent 
variables: diagnostics for the regression analyses reported below indicate that the 
square root of the largest variance inflation factor score is 1.4.” 

The measures for two of our dependent variables—frequency of committing 
theft and drug selling since leaving home—mirror measures for offending before 
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leaving home.?' We use a single item to measure our third dependent variable, 
number of times the respondent sold sex since leaving home. Approximately 
6396 of our respondents had committed a theft since becoming homeless, and 
58% had sold drugs, but only 21% had worked in the sex trade.” Given these 
variables’ distributions, we use tobit regression to estimate models for theft and 
drug selling; we dichotomized prostitution and used logit regression to estimate 
a model for that offense.? There is some evidence of heteroskedasticity in our 
data, so we used Huber's (1981) formula to obtain robust confidence standard 
errors for our results. Consistent with other studies of offending (e.g., Sampson 
and Laub 1993), we used tests of significance to draw attention to important 
effects even though our sample was not selected with a random design. We 
have followed Mohr in using tests of significance to help assess ^whether or not 
a certain relationship or other quantity is worth further thought—whether it 
might repay additional research effort (1990:8).” 


Results 


TALKING ABOUT DANGER 


As expected, interviews with our respondents provided many accounts of the 
physical hazards of offending. Stories about victim retribution were common, 
frequently in cases of breaking/entering and theft. As made evident in a statement 
by one youth, retaliation can be relatively spontaneous: 


[I'd] crawl in through a window, crawl along the house on my knees and 
go and unhook the VCR and TV . . . [In one place] I got hit in the leg 
with a baseball bat and in the back once, and my buddy got cracked in 
the head. - 


Others reported attacks by victims who planned their revenge: 


I robbed this place five times and what happened was, the guy got smart 
and he slept in the store, you know. And he [caught me and] beat the 
crap out of me. 


Reflecting on the danger of burglary, another youth noted that people are not 
the only potential threat: 


You got to find out if there is a dog . . . You got to make sure there's 

something in the house that’s valuable enough, just in case you do get bit, 

you know. If you do get bit and you are only going in there for a damn 

VCR, it’s pretty stupid. 

Several respondents described interactions in which they or other street youth 
who were suspected of a crime or who had an outstanding warrant for their arrest 
were assaulted by police officers. For example, one youth spoke of an incident 
when the police stopped him and a group of street friends: 
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And the cops come and say, ^What are you doing here?" We're just joking 
around with each other and whatever, right, just having a good time. We 
weren't being loud or anything like this. I don't know. They grabbed my 
friend and they beat him up and handcuffed him and told us to leave. 


Another youth recounted his fear of being assaulted by police for selling drugs: 


I got a lot of businessmen . . . Pd go down there at lunch hours . . . and sell 
hash to them. It was cool. I liked doing it that way better, because those 
type of people are less likely to get caught with something that I sold them 
and are less likely to pin it on me. [Selling] on the street, a cop gets you, 
takes you down to Cherry Beach [a police station], and beats you up. 


Conversations about drug dealing revealed other sources of harm. One 
respondent's account reflects a common situation in which limited economic 
resources encourage people to find someone who will “front” them drugs in 
order to start selling: 

I got in a hole. You got to go and get fronted some [drugs] to get yourself 

out of it and I found [that] I kept digging myself a bit deeper into this hole 

... [The people] wanted their money and I didn't pay my bill . . . I just 

kind of disappeared . . . If they find me it could be pretty bad ... If I had 

[fronted] hundreds of dollars, I'd probably want to hurt somebody too. 


Other youth recognized that some drug purchasers' desperation adds to the 
danger of selling. “If somebody's really dying for a hit,” one explained, “you get 
stabbed, shot, beaten up for your drugs.” Another account indicates that these 
are reasonable fears: 

I got shot in the leg. Right there, there’s one scar. And, uh, I’ve got two 

scars on my other leg. I’ve got one there, and there’s the other one there. 

Someone tried to mug me [when I was] selling hash. I got shot, you know. 

[Another time] I got shot in the butt .. . I got so scared. 

Drug dealing is not the only crime that involves hostile customers. Several 
youth who sell sex reported having encountered clients who attacked them. 
Financial gain is a common motive among these assailants: 

The first night, I made $80, but the dick robbed me. [He] put a gun to my 

head and took the money I made. I was really scared. It was horrible. 
Some customers have even more sinister objectives. 

I got tied up and had a rope tied around my neck and had this guy telling 

me he’s gonna cut me up, but I managed to escape from him. I think that’s 

about the scariest time I’ve ever had. 
Offenders could also encounter danger from the activities of other offenders, albeit 
indirectly. One youth who worked in the drug economy offered an example: 

I’ve done the odd [drug] “couriering.” I stopped doing that after I opened 

... ah, I went down to this guy’s basement to pick up the stuff and he 
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opens up the door and I see some AK47s. I’m like whoa, and I was like 
12 or 13 at the time, and I’m looking at this and I’m going, I got to get 
out of this drug business, this is ridiculous. 


In other situations offenders may be a more direct source of crime’s danger. 
Competitors may be hostile to those who charge less than the going rate for an 
illegal product or service: 


A lot of them [youth working on the street] get beat up if the word gets 
around [that they are selling below the norm] . . . there are warnings 
[but] if they tell you to “fuck off,” well, then there could be some physical 
abuse. 


This respondent added, however, that some offenders fabricate incidents to 
encourage attacks on others: 


[Some of the time] it’s just troublemakers out there that say, “Oh, this 
person is charging that,” and the kid will get beat up for nothing. But 
they’re just jealous because the kid is doing well. 


Another mentioned that offending may also encourage coercion by more powerful 
offenders: 


I worked independently . . . maybe about two weeks . . . trying to survive 
... and along come these people and the next thing you know, I'm forced 
and I’m told, “You have to do this," you know. "You're going to do this, 
else were gonna hurt you” ... A couple a times it didn't work out and 
I got beaten up. 


Individuals repelled by specific crimes also pose a threat. Several youth we 
spoke with adamantly condemned particular offenses and, as demonstrated 
in the following conversation, reported attacking individuals involved in these 
activities. 


Respondent: I beat up pushers. I beat up a guy, uh, not too severely, I kind 
of gave him a little bit of warning that if he wants to sell that stuff he 
goes downtown and sells it to big guys. 


Interviewer: What about the police arresting you? 


Respondent: What's he [the dealer] gonna say? ^I was trying to push drugs 
in front of the school and he beat me up” 


IMMUNITY AND SELF-IMAGE 


Although most of our respondents appeared to be aware of some, if not all, of 
crime's possible danger, they differed in their reactions. At one extreme were 
individuals who concluded that offending would likely result in their being 
harmed. One former drug dealer's comment shows that some youth realize that 
the possibility of injury during offending outweighs any potential returns: 
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And like the worst feeling you can get in the entire world is to always be 
like watching over, like looking over your shoulder, watching your back 
all the time . . . You've got to be watching for robbers, right, and you've 
got to be watching for cops, and no matter what, after awhile, you're just 
kind of like, you know, always watching your back. 


At the other extreme are those who conclude they are immune to the perils of 
crime. Remarks from these respondents suggest that although they were aware 
of the many sources of danger, they thought they possessed the skills, knowledge, 
and understanding to recognize situations or individuals that were potentially 
dangerous: 


I kind of know how to judge somebody, or I feel I know how to judge 
somebody . . . It's just like vibes that you get from that person . . . If they 
look nervous themselves, if they don't look directly at you, or if their eyes 
keep shifting all the time, um, it's hard to explain. It's just like a sixth sense 
... something that you develop over time. 


Some also claimed a sensitivity or instinct to foresee potential danger: 


I'm a selective person . . . Usually you can tell by their opening question . . 
. just the way they present themselves . . . You just know, it's a gut feeling. 
. . . It's just like, no this is not right. 


Others spoke of their physical abilities and argued they could defend themselves 
against attacks, as here in the words of a young woman: 


I know how to handle myself in any kind of situation. I took a lot of self- 
defense classes. You better talk to me, instead of putting your hands on 
me, 'cause once you put your hands on me, I'm gonna try and hurt you 
... [and] Pve beaten up a couple of people. 


Another regular response reflects the confidence that youth have in their ability 
to escape from danger if it arises: 


Interviewer: Were you worried that you were going to get beaten up or 
anything like that when working [in the sex trade]? 


Respondent: No, 'cause you can just say no. It's easy enough to jump out 
of the car or whatever, so I wasn't too worried about it. 


Overall, our interviews suggest that the possibility that offending would 
result in injury deterred many of our respondents from offending. In contrast, 
confidence about their ability to avoid harm appears to have increased other 
respondents' willingness to consider crime as an option. 


PERCEPTIONS AND THEIR CONSEQUENCES 


Our survey data permit a more systematic examination of the relationships 
suggested by our interviews. We begin the quantitative analysis presented here 
with an examination of the factors that influence perceptions about crime's 
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Table 2. OLS Regression of Background Variables' Effects on Perceptions 
of Crime's Danger 








Theft Drug Selling Prostitution 
b SIE: b SE b SE: 
Independent variables 
Age .083 .047 95353 .046 —.040 .042 
Gender —.290 297 —.725* .251 —.796* .225 
Race .050 .247 2297, .242 —.007 .218 
Family disruption —.167 .249 —317 .244 —.179 419 
Parental unemployment —.137 .168 —.018 .164 .260 145 
Maternal drug addiction .025 .161 —.063 .158 —.123 .141 
Family member arrested —.013 .128 —.322 .126 —.166 Jil 
Offended at home? —.081* .021 —.158* .032 —.008 .014 
Arrested at home —.034 .073 .044 .071 .052 .065 
Constant 4.934 1.127 5:925 1.096 10.220 .980 
Re 1052 .102 .049 


(n=482) 
* Crime-specific (i.e., measures of theft, selling drugs, and a combination of the two in the prostitution 
model) 


*p<.05 (two-tailed) 


danger. Although that question is not a focus of this article, this preliminary 
analysis allows us to explore how prior offending influences the perception of 
danger (see Table 2). The results from OLS analyses indicate that perceptions of 
the dangers of stealing and selling drugs are significantly discouraged by previous 
involvement in these crimes (b = —.081, S.E. = .021 and b = —.158, S.E. = .032; 
the effect of prior offending on prostitution is not significant). However, these 
effects are relatively modest, and collectively our independent variables explain 
only a small part of the variation in perceptions (5%, 10%). These results suggest 
that perceptions about crime’s danger are determined only partially by prior 
criminal experiences. They therefore support our investigation of perceptions’ 
independent effects on offending.” 

Our second set of quantitative findings concerns the bivariate correlations 
between perceptual variables that measure various costs and benefits associated 
with offending. As indicated in Table 3, all but 9 of the 120 correlations are 
less than .3 and none are larger than .4.^ The weak relationships among our 
perceptual variables suggest five points; three concern our measures of danger, and 
two apply to items that measure other perceptions. First, perceptions of a crime's 
danger differ from assessments about the likelihood of arrest. The correlations 
between perceptions about the danger of theft, drug selling, and prostitution and 
perceptions of the likelihood of arrest for these acts reveal sufficient independent 
variation to establish reliably the separate effects of each construct (Sampson and 
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Laub 1993:197)."5 Second, the view that a particular crime is dangerous does not 
translate into a similar fear for other offenses." Third, the conclusion that a crime 
is dangerous does not correspond with the perception that it is exciting; indeed, 
the opposite pattern appears, suggesting that the excitement associated with 
offending may lie in the perception that one is immune to the danger associated 
with crime." Fourth, assessment of whether crime in general is more lucrative 
than legal employment is distinct from perceptions about a specific type of crime's 
profitability. Fifth, the various perceptions about offending represent independent 
forces; combining them in scales may thus be inappropriate (Paternoster 1986). 
Overall, these patterns support our position that the danger associated with 
offending represents a unique cost. They also highlight the need to consider the 
independent effects of various perceptions on crime. 

We extend our focus in Table 4, presenting tobit results that express the 
relationships of perceptual and other variables with theft and drug selling. Our 
results support our theoretical position: even with the introduction of several 
controls including prior offending, involvement in theft (b ——.445, S.E.—.198) and 
in drug dealing (b——.357, S.E.—.160) significantly decreases as the perceptions 
that these activities are dangerous increase. Individuals who had rated theft and 
drug selling as very dangerous (10 on a scale of 10) have predicted scores for these 
offenses (1.342 and 1.630, respectively, with the values of all other variables set 
to their means) that are about three or four times smaller than the scores (5.344 
and 4.851) for those who rated the offense very safe (1 out of 10). 

Involvement in theft and in drug selling are shown as having been further 
discouraged by perceptions of the unacceptability of the particular crime 
(b=-1.175, S.E.=.384; b=-.792, S.E.=.292). In contrast, neither offense was 
notably influenced by perceptions of the likelihood of arrest. Some, but not all, 
of crime’s potential benefits also play an important role. Theft and drug selling 
increase with perceptions that they are exciting (b=.364, S.E.=.171; b=.614, 
S.E.=.129) and with the perception that crime in general is more lucrative than 
legal (“straight”) work (b=.772, S.E.=.307; b=.707, S.E.=.237). Net of these 
more general perceptions, perceptions about the amount of money that theft 
and drug selling can likely generate have nonsignificant effects on offending. The 
contrasting effects of these competing measures of crime’s potential economic 
returns suggest that offenders do not expect to “get rich” from stealing or selling 
drugs but do anticipate that crime pays better than legal employment (McCarthy 
and Hagan 2001). In other words, offenders appear to adopt a relative, rather 
than an absolute, lens when contemplating crime’s financial returns (Decker, 
Wright, and Logie 1993). 

Several of our control variables also show significant effects on offending. 
As expected, offending at home increases involvement in street theft and drug 
dealing.” Additional effects indicate that involvement in either crime escalates 
with the proportion of friends involved in that particular offense and with the 
frequency of offending opportunities. Lack of food and adequate shelter while 
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Table 4. Tobit Regression Coefficients of Perceptual and Other Variables’ 
Effects on Theft and Drug Selling 








Theft Drug Selling 
b S.E. b S.E. 

Age —,539* .188 .074 .162 
Gender 5.560* 1.027 1.206 .802 
Race —1.586 1.077 —.308 .790 
Family disruption 1.271 .950 1.367 .754 
Parental unemployment —.066 699 —.364 .494 
Maternal drug addiction —.625 .636 —.057 .490 
Family member arrested .188 .499 —.014 .367 
Offended at home* :998* .098 .481* .081 
Arrested at home .426 .309 —.008 224 
Percentage of friends who offend? 10374 .016 .056* .014 
Percentage of friends arrested —.005 .016 —.020 .014 
Adversity .564* .181 2972/5 .136 
Months on the street 10727 .025 .043* .020 
Unemployment .078 .633 .505 .483 
Illegal opportunities 797^ .378 1.670* .289 
Perception that 

Activity is dangerous? —.445* .198 —357* .160 

Activity will lead to arrest? —.079 . .183 —.054 .161 

Activity is wrong/should be illegal*  —1.175* .384 "es 92^ 292 

Activity is exciting? .364* ote .614* 4429 

Activity is lucrative? —.094 .194 .025 451 

Crime is more lucrative than legal work .772* .307 17015 1237 
Enjoy taking risks .890 .474 —.067 .365 
Constant 3.218 6.527 —10.461 4.963 
X 252.35 425.22 
Df 22 22 
—2 X log likelihood 2392.269 2074.800 
(n=482) 





? Crime-specific 


*p<.05 (two-tailed) 





homeless and length of time living on the street also contribute to theft and 
drug selling, relationships consistent with the claim that situational adversity 
plays an important role in the genesis of crime (Hagan and McCarthy 1997). 
The remaining significant effects are specific to theft: stealing is less common 
among older youth and is more frequent among males. 

Table 5 reports results from our logit model of prostitution. Mirroring 
the trend reported above, our key variable, the perception that prostitution is 
dangerous, is negatively and significantly associated with involvement in the sex 
trade (b=-—.130, S.E.=.064). Thus the odds of participating in prostitution drop 
by about 1396 for every point increase in the perception of this crime's danger. 
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Table 5. Logit Regression Coefficients of Perceptual and Other Variables’ 
Effects on Prostitution 








Prostitution 
b SE 

Age .044 .063 
Gender —.958* .347 
Race .164 1327 
Family disruption .028 .306 
Parental unemployment .005 1219 
Maternal drug addiction —.400* .184 
Family member arrested 122 .142 
Offended at home .001 .021 
Arrested at home —.124 .108 
Percentage of friends in sex trade .033* .005 
Percentage of friends arrested —.008 .005 
Adversity .040 .052 
Months on the street —.001 .008 
Unemployment 214 .208 
Illegal opportunities .396* .116 
Perception that 
Prostitution is dangerous —.130* .064 

Prostitution will lead to arrest —.038 054 

Prostitution should be illegal —221* .106 

Prostitution is exciting up .060 

Prostitution is lucrative —.052 .057 

Crime is more lucrative than legal work .100 .105 
Enjoy taking risks —.207 151 
Constant 012 1-939 
X ' 90.33 
Df 22 
—2 X log likelihood 354.200 
(n— 482) 


*p<.05 (two-tailed) 





For respondents who rated prostitution very dangerous (10 on a scale of 10), 
the predicted probability of offending (.116) is less than half the score (.298) for 
respondents who rated this offense very safe (1 out of 10). 

Involvement in prostitution is also negatively related to the view that 
the sex trade should be illegal (b=-.221, S.E.=.106). All other perceptual 
measures, including the likelihood of arrest, have little influence on prostitution. 
Furthermore, only a few control variables have a direct effect on prostitution. 
Involvement in the sex trade is more common among females and among those 
whose mothers had drug problems; it is positively associated with the proportion 
of friends involved in this activity and with illegal opportunities. 
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We reestimate the above equations to pursue further implications of these 
patterns. In these new equations, we replace our danger measures with those 
used in the other equations (e.g., replacing the perception of theft’s danger first 
with the assessment of drug selling’s danger and second with the perception of 
prostitution’s danger). We do not present a table of these results because they 
indicate little overall crossover between crime types (see also Paternoster 1989; 
cf. Montmarquette, Nerlove, and Forest 1985).? Indeed, the only significant 
crossover involving measures of danger indicates that theft decreases with the 
view that drug selling is hazardous (b ——.709, S.E. - .214).?! 

Extending our analysis, we explored whether perceptions about the danger in 
crime interact with other variables in our models (see also Grasmick and Bursik 
1990). Interactions included our measures of danger and all of our perceptual 
variables, as well as any control variables that had a significant effect on any of 
the three crimes that we have examined. Only one of these interaction terms 
showed a significant effect: theft was positively influenced by the interactions 
between the belief that it is not dangerous and the proportion of friends involved 
in stealing (b=.014, S.E.=.007). Interaction effects are difficult to detect in survey 
research, particularly in studies that have samples that are in the hundreds rather 
than the thousands (McClelland and Judd 1993). Nonetheless, the paucity of 
significant interaction effects in our analysis suggests that perceived danger’s 
strongest effects are direct.” 

We conclude this section on our analyses with a report on our longitudinal 
data. Our hypotheses about the effects of crime’s perceived danger and the 
changing nature of perceptions suggest that the effects of our first-interview 
perceptual measures should decrease over time and work primarily through their 
effect on our measures of first wave offending and the harm, or lack thereof, that 
accompanies it. As noted earlier, we assume that people use their experiences (as 
well as the experiences of others and other sources of information) to update 
their perceptions about crime’s costs and rewards. We cannot directly demonstrate 
updating of perceptions about danger because we do not have second wave 
measures. However, the change in our other perceptual measures suggests that, like 
participants in other surveys (Matsueda, Kreager, and Huizinga 2003; Pogarsky 
et al. 2004), our respondents update their views about crime’s other risks. 

With this caveat in mind, we examined the effects of perceptions about danger 
on second-interview measures of offending (see Table 6). Our second-interview 
measures parallel those we used to measure crime in the first interviews, but 
these are limited to offenses committed during the twelve days before the second 
interview. We used negative binomial regression for the count data shown here 
(which reflect the number of days on which the respondent offended). Our 
results indicate that our three first-interview measures of danger are significantly 
associated with second-interview offending at the bivariate level (i.e., theft, b = 
—.224, S.E. = .080; drug selling, b = —.161, S.E. = .056; prostitution, b = —.262, S.E. 
= .077). But these effects are dramatically reduced and become nonsignificant 
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when we control for first-interview offending (i.e., theft, b ——.010, S.E.—.095; 
drug selling, b=—.044. S.E.=.057; prostitution, b=~.080, S.E.— .048).? The effects 
of danger also decrease to nonsignificance if we introduce our other perceptual 
variables and our first-interview measures of street experiences (e.g., adversity, 
illegal opportunities, or deviant networks). 

Although we cannot determine the precise causal connections, our cross- 
sectional and longitudinal findings suggest that perceptions about danger may 
play an important, and perhaps a very immediate role in offending decisions. The 
mechanisms by which this process occurs clearly require further study.” 


Discussion 


The decline in Western nations use of direct physical pain to punish lawbreakers 
has not eliminated crime’s potential for harm, nor all potential offenders’ fear 
of it. Instead, people’s apprehensions may focus more on the danger of others 
who respond to crimes with severe, perhaps fatal, retaliatory violence. Our 
cross-sectional analyses demonstrate that perceptions about crime’s danger are 
significantly and negatively related to offending: that is, crime decreases with 
the view that an offense is dangerous. This pattern occurs across the three types 
of crimes examined—theft, drug selling, and prostitution—and persists with 
controls for other perceptual variables and commonly identified correlates of 
offending. These perceptions are also associated with offending in the period 
before our second round of interviews and may have influenced subsequent 
offending primarily through their relationships with offending prior to the first 
interview. These findings suggest that perceptions about crime’s danger may 
play an important role in the etiology of offending. Subsequent research should 
thus explore the effects of both distal and more proximate perceptual measures 
on offending, as well as the relationships between views about crime and its 
consequences. 

This article adds to a growing literature that demonstrates that decisions by 
youth to engage in risky activities are influenced by perceived costs and benefits, 
not solely by emotional decision making that is devoid of rationality (see Gruber 
2001). Youth who fear and youth who are fearless in the face of crime’s potential 
harm both appear to use these perceptions in making decisions about offending. 
These findings are consistent with diverse explanations from sociology, economics, 
and psychology. Thus they should not be used to champion the perspective of 
one discipline over another but should be used to build more comprehensive 
theories of risk taking and decision making. 

Theorists and researchers need to broaden their thinking in other ways. 
Sociologists have convincingly demonstrated that the potential for informal 
sanctions has a greater effect on criminal decisions than does the threat of 
formal sanctions or other factors highlighted in conventional economic analyses 
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Table 6. Negative Binomial Regression Coefficients of First-Interview 
Perceptions and Offending on Second-Interview Theft, Drug 
Selling, and Prostitution 





Second Interview Offending 
Theft Drug Selling Prostitution 
b S. E: b SIE: b SE: 


First interview 
Perception that offense is dangerous —.224* .080 —.161*  .056 —.262* .077 


Constant .494 pala 1:252 BIT WATT 2592 
x 7.81/1 8.28/1 11.47/1 

—2 X log likelihood 487.864 831.824 441.362 
Perception that offense is dangerous —.010 .093 .044 .057  —.080 .048 
Offending .134* — .026 120528985026 2072728 .008 
Constant —2.058 .683  —2.479 .499  —2.392 .576 
x? 34.15 148.03 87.56 

Df 2 2 2 

—2 X log likelihood 460.074 764.048 359.872 

(n- 376) 





*p<.05 (two-tailed) 





of crime. Future work on criminal decision making must consider these distal 
factors in concert with more proximate ones. Sanction theory’s selective and heavy 
emphasis on legal consequences suggests an image of offenders who base their 
decisions to offend on weighty deliberations about the likelihood and severity 
of legal punishments, stigma, and ostracism. In contrast, our findings indicate 
that potential offenders may be more powerfully influenced by more immediate 
consequences of crime, that is, by their perceptions of the probability that they will 
be harmed by others. By comparison, the influences of the law and community 
seem remote and abstract. 

Our research also suggests a need to move beyond factors that discourage 
crime among those who have a considerable stake in normative society and little 
criminal experience. We must examine more closely the views that influence 
people on the margins of society and who are more familiar with offending and 
punishment. Although the possibility of harm appears to play an important 
role in these people’s decisions about offending, perceptions other than those 
studied here undoubtedly affect their choices as well. For example, research has 
indicated that offending is related to perceptions about a criminal occupation’s 
prestige (Matsueda et al. 1992). Other views, such as the conclusion that crime 
will lead to the acquisition of new social criminal capital (i.e., establishing new 
connections with other offenders), or its human counterpart (i.e., a conduit to 
acquiring new skills) may also encourage people to choose offending over other 
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alternatives (McCarthy and Hagan 2001). Similarly, offending may be affected 
by the perception that particular crimes, or types of crime, may undermine or 
destroy social criminal capital, or may prevent the acquisition of new sources of 
it. Future research would be enhanced by greater attention to the consequences 
of these and other perceptions. 

We must also explore in greater detail people's perceptions of the danger of 
crime. Subsequent research should investigate the similarities and differences 
between perceptions about crime's danger and views about its other costs. For 
example, deterrence research suggests that estimates of the likelihood of arrest 
are often independent of, and have different consequences for, offending than do 
conclusions about arrest's negative consequences. A similar pattern may occur 
for perceptions about danger. Future research should explore how perceptions 
about danger's certainty, severity, and celerity independently and jointly influence 
the decision to offend. 

We also know little about the processes that encourage people to see crime as 
dangerous, or those that foster the view that they are able to surmount crime’s 
potential for harm. Previous research suggests several possibilities. People vary 
in their access to information about crime's potential harms (Pogarsky, Piquero, 
and Paternoster 2004; Stafford and Warr 1993), and incomplete information 
may comprise their assessment of a potential victim's compliance or resistance 
(Anderson 1999; Gambetta 1993). Alternatively, individuals who adopt the 
“gambler’s fallacy” (Pogarsky and Piquero 2003) may assume that crime-related 
harm is a rare event whose occurrence (either for themselves or others) lowers the 
likelihood of danger for future crimes; thus they may systematically underestimate 
the likelihood of their own victimization. Another approach hypothesizes that 
people may prefer to avoid harm and believe an act is dangerous yet commit 
it when they encounter a social interaction that triggers strong emotions (e.g., 
embarrassment) that “override” their preference to avoid danger. In other words, 
emergent preferences and preference switching may explain people’s decisions 
to commit a dangerous act (Fontdevila 2002). Katz (1988) has eloquently 
described a “transformative” process that individuals may experience in the 
immediate foreground or situational dynamics of a crime. As part of this process, 
people redefine themselves in relation to others, the law, and their capabilities 
to offend. Perhaps individuals who see themselves as immune to crime’s harm 
have undergone such a transformation of self-image but experienced the 
awakening over a longer period. Subsequent research should investigate the 
various explanations suggested here in order to clarify why some individuals 
view offending as inherently risky while others are convinced of their ability to 
evade crime’s potential for harm. 

Our findings suggest that the use of violence against offenders could reduce 
crime, but the extent of such reduction remains opaque. Increasing crime’s harm, 
and the knowledge of it, may have limited effects on offenders who believe they are 
immune to harmful consequences; these people may have “stickier” perceptions 
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that require considerable contradiction before they change. Moreover, if these 
offenders believe that most crime victims will use violence, they may be more 
likely to use preemptive violence, thereby increasing victim costs. 

Crime victims may incur other costs if they rely on violence or its threat to 
deter offenders. Game theory research indicates that relying on individuals to 
prevent and respond to crime can discourage police activity, thereby creating 
greater opportunities for offending (Cressman, Morrison, and Wen 1998). If 
Elias and others are right (see Garland 1990), most' people would also receive 
little lasting satisfaction from resorting to violence, particularly if it is excessive 
or used against offenders who commit nonviolent crimes. Garland (1990:243) 
notes that most modern state-based punishment is "institutionally ordered and 
discursively represented" in ways that disguise the pains in its practices. Individual 
violent responses to crime could negate this protective veil. They could also result 
in individual remorse and guilt that may be more costly than the losses incurred 
(e.g., financial, minor physical injuries, or regret at not having intervened). 

Mead's (1918) discussion of punitive justice points to further, macro-level 
consequences of punishments based on an "attitude of hostility" that may underlie 
individual violent responses to crime. These effects include support for the 
narrow beliefs that crime is caused exclusively by individual characteristics and a 
corresponding decrease in concerns for the social conditions that contribute to it. 
Sanctions communicate a society's views on crime and punishment, but they also 
convey assumptions about individual rights and citizenship (Duff 1996). Treating 
people as citizens requires that punishment create prudential incentives to obey 
the law, but it must supplement and not replace rational, moral persuasion. As 
Bentham reminds us (in the gender-specific language of the time), “it has been 
too frequently forgotten, that the delinquent is a member of the community, as 
well as any other individual . . . His welfare is proportionately the welfare of the 
community—his suffering the suffering of the community” (quoted in Zimring 
and Hawkins 1973:42). Thus, although perceptions about danger may inform 
the decision to offend; a sanction system that resorts to fear of pain is not a 
panacea for crime. 


Notes 


1. Jackman (2002) summarizes the complexities of defining violence, aggression, and harm 
and notes that people often choose to experience physical harm as a means to an end (e.g., 
plastic surgery). We simplify definitional issues and restrict our focus to the use of physical 
force against others that results in, or has a high likelihood of resulting in, injury or death. 


2. For example, consider incident data from the U.S. Victimization Survey from 1992 
through 1999 (U.S. Department of Justice 2001). Victims who confronted offenders chose to 
use a weapon or some other form of violence against the offender over calling the police in 
about 25% more of the incidents (2,988 compared with 2,374). 


3. Lyng's (1990:872) emphasis on invulnerability resembles Claster’s (1967) argument that 
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delinquents’ lower estimates of the probability of arrest reflect a belief in their “magical 
arrest immunity.” Yet, unlike Claster, Lyng does not attribute participation in crime to innate 
needs or personality traits but argues that the motivation originates in people’s alienation 
and their opposing desires for spontaneity and control. 


4. Genetics may also play a role. For example, a longer version of the D4DR dopamine 
receptor gene may increase the attractiveness of risk taking (Pinker 2002:48). 


5. There is considerable debate about how to approach time discounting (O’Donoghue 
and Rabin 2001): some analysts treat it as part of a person’s preferences; others assume 
that people’s discounting factor is typically “excessively myopic”; a competing orientation 
assumes that people’s discounting changes across decisions. We assume the latter. 


6. A detailed discussion of thick versus thin rational choice theories is beyond the limits of 
this article (see Lichbach 2003 for a summary). 


7. Other contributing factors could include individual propensity to offend as well as the 
possibility that offenders are present-oriented and thus unable or unlikely to consider future 
costs (Gottfredson and Hirschi 1990). 


8. Bouffard (2003) finds that many college students also list physical injuries as an 
important consequence of offending when asked to generate a list of crime’s potential 
negative consequences. š 


9. Comparisons between sample dropouts and those who returned for a second interview 
reveal significant differences for 5 of the 39 variables in our study. The latter are older, show 
higher scores for the perceptions that drug selling will result in an arrest and that drug use 
should be legal, and show lower scores for the view that prostitution is lucrative and for 
involvement in street theft. 


10. The average correlation between first-interview and second-interview measures for 
perceptions about the likelihood of arrest for the three types of crime we consider is .485. 
The average correlation between first-interview and third-interview measures for the 
likelihood of arrest is .078. 


11. Weak correlations over time may also reflect unreliable measures or a basic instability in 
a perception (i.e., a perception that varies almost randomly over time for any individual). 
We assume but cannot empirically demonstrate that updating explains more of the 
variation in attitudes over time than do a lack of reliability or constant variability. 


12. Panel designs also do not solve the causal order problem because of the instability of 
perceptions and the variation in effects of proximate versus distal views (see Grasmick and 
Bursik 1990). 


13. We assume that respondents interpreted danger as the possibility of physical harm. This 
accords with dictionary definitions (e.g., OED), common use (Simpson 1996), and studies 
on risk (Slovic 2000). 


14. People's perceptions of the danger of the sex trade may also reflect fears of contracting 
sexually transmitted diseases, particularly HIV. 


15. Our data do not contain estimates of perceptions of other formal sanctions such as 
incarceration, but Piliavin and colleagues' analysis (1986) suggests that estimates of arrest 
contribute more to perceptions of formal risk than do estimates of imprisonment. 


16. We could not locate any studies that explore the differences between perceptions 
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and beliefs about offending, but research on environmental beliefs and perceptions 
demonstrates their distinctiveness (see O'Connor, Bord, and Fisher 1999). 


17. The low correlations between these measures (e.g., r — .047 between views about theft 
and prostitution) suggest that individuals have varying degrees of support for different 
norms or laws. 


18. Reports from nonoffenders indicate that other factors varied considerably in their 
importance. Only 2096 reported that concerns about employment influenced their decision 
"a lot," whereas 5596 said that the immorality of crime had a major influence. 


19. We do not have measures of violent victimization related to offending at home or on the 
street. 


20. Our measures of prior offending and the availability of illegal opportunities to offend 
should also act as reasonable proxies for variables emphasized in self-control theory 
(Gottfredson and Hirschi 1990). 


21. Perceptions about crime's potential costs and benefits likely influence the decision to 
commit a specific illegal act or criminal event (Cornish and Clarke 1986), as well as general 
involvement in particular types of crime. We focus on the latter. 


22. Sixty percent of males had committed a theft, 6996 had sold drugs, and 1796 had sold 
sex; the proportions for females are 53%, 49%, and 29%. The theft and drug selling means 
(ordinal variables) are significantly higher for males (6.0397 and 7.104) than for females 
(4.290 and 3.723), but the reverse pattern occurs for the means for prostitution (6.085 
versus 11.251, interval variable). 


23. The results for a tobit model of prostitution are similar to those for the logit analysis. 
Tobit models of logged theft and drug selling also produce results similar to those reported 
» here. 


24. Models of perceptions about arrest reveal a similar pattern. Prior theft and drug selling 
significantly decrease estimations of the probability of arrest for these crimes (b = —.058, 
S.E. = .021, and b = —.141, S.E. = .031), but our models explain only a small proportion of 
the variance (17% and 6%). 


25. Reliability analyses indicate that combing the items by type (e.g., perceptions about 
danger) or by crime (e.g., all items related to theft) produces low Cronbach’s alphas (e.g., 
.397 for theft items and .351 for danger measures). Factor analyses of the items also reveal 
their distinctiveness. For example, three of the five theft items have loadings of less than .35 
in a factor analysis of theft items (as do two items in a factor analysis limited to perceptions 
about danger, arrest, and morality). Correlated attitudinal items that fall within this range 
typically produce an “indeterminant factor structure” (Robinson, Shaver, and Wrightsman 
1999:11). 


26. A chi-square test for a factor analysis of the three danger and three arrest items indicates 
that a one-factor model is inadequate. Moreover, the three measures of perceptions about 
danger have notably larger associations with offending (theft, r = —.270; drug selling, r = 

. —.327; prostitution, r= —.215) than do the three measures about arrest (theft, r = —.082; drug 
selling, r = —.149; prostitution, r — —.051). 


27. Indeed, a measure that combines the two danger measures with the highest correlation 
(theft and drug selling) has a small, nonsignificant effect in the multivariate regression 
analysis of drug selling we present below and provides no substantial improvement in 
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our analysis of theft. Jacobs (2000:31) also reports that several of the drug robbers he 
interviewed said that they were too scared to shoplift. 


28. Questions about other activities (e.g., mountain climbing) reveal similar patterns: 
the correlations between perceptions that these activities are exciting and dangerous are 
negative and significant. 


29. Analyses that replace independent measures of home offending and arrest with measures 
that combine the two find that being a “sanctioned offender” significantly increases street 
theft (b — 4.907, S.E. = 1.175) and drug dealing (b = 3.758, S.E. = 1.062), results consistent 
with the process identified in theories that emphasize labeling, criminal stigma, and 
defiance. Meanwhile, successful offending (i.e., a “nonsanctioned” offender) is positively 
related to subsequent offending, but the net effect is weaker for theft than drug selling and 
significant only for the latter (b = 2.219, S.E. = 1.134, and b = 2.611, S.E. = 1.133). 


30. There are two significant effects: theft is related to the view that drug selling is exciting (b 
= ,350, S.E. = .175), and drug selling is negatively related to the view that theft is profitable 
(b 537259. E. 2145 


31. We also explored the possibility that some individuals may simply perceive many 
activities, criminal and noncriminal, as dangerous, suggesting that types of people rather 
than perceptions per se matter most. Factor analysis of our perceptual measures of the 
danger of noncriminal activities reveals three dimensions: (1) mountain climbing and 
skydiving; (2) hitchhiking, taking drugs, and sleeping in public places; and (3) having sex. 
Analyses that control for the first and third factors reproduce the substantive effects for 
perceptions of crime's danger reported in the text (i.e., the point estimates and level of 
significance decrease only slightly). Although these effects decrease more substantially in 
analyses that control for the second factor, they nonetheless remain significant with a one- 
tailed test. 


32. Two effects are significant at the .05 level with a one-tailed test. The view that theft is safe 
(i.e., danger reverse-coded) interacts with adversity (b = .109, S.E. = .060) and excitement (b 
7S ESO) 


33. Tobit results mirror those for the negative binomials. 


34. In the theft equation, controls for both perceptual and street variables are required 
to reduce danger's effect to nonsignificance, whereas either set of controls is sufficient to 
reduce danger's effect to nonsignificance in the equations for drug selling and prostitution. 


35. Meanwhile, as Sampson and Laub remind us, issues of causal order can never be totally 
resolved. Instead, we should heed Hirschi and Selvin's advice: “it is ... wrong to let some 
uncertainties about causal order preclude causal inferences" (quoted in Sampson and Laub 
1993:39). 
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Abstract 


This paper examines the relationship between nonmarital cohabitation and routine 
exchanges of support between American adults aged 19-30 (N = 3,809) and their 
aging parents, using data drawn from the first wave of the National Survey of 
Families and Households. Cohabiting young adults were found to be significantly 
less likely to be exchanging support with their parents than their married or single, 
noncohabiting counterparts. Cohabiting young adults were also significantly less 
likely than married young adults to nominate their parents as persons to turn to in 
an emergency. Analysis of variability in exchanges of support with parents among 
cohabiting young adults revealed that the quality of the relationship with the parent 
to be the most potent predictor of exchanges. 


Cohabitation and Exchanges of Support 


Nonmarital cohabitation has rapidly become a common feature of family life in 
American society. In 1977 about 1.1 million Americans were estimated to be in a 
cohabiting relationship (Casper and Cohen 2000). By the end of the century, the 
number had increased to 7.6 million (Fields and Casper 2001). Put another way, 
by 1995 nearly 4596 of adults under the age of 45 had ever cohabited (Bumpass 
and Lu 2000). This sharp increase has meant that our scholarly understanding 
of these phenomena is quite uneven. The demographic parameters of this social 
change have received the most attention. We know who is most likely to cohabit, 
what the timing, duration, stability of these unions are, and how these unions 
affect subsequent marital stability. However, our knowledge about the implications 
of cohabitation for a variety of dimensions of family life beyond marital stability 
remains fragmentary. 

This paper explores the relationship between cohabitation and intergenerational 
exchanges of support during young adulthood. This topic has been virtually 
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ignored. This is somewhat surprising, given that nonmarital cohabitation is quite 
popular during young adulthood and that intergenerational exchanges of support 
are crucial in assisting young adults to navigate key life transitions such as moving 
from schooling to work, buying a home, and transitioning into parenthood. We 
know that when individuals get married, exchanges of support with parents 
increase. However, marriage is becoming less common among young adults as 
a way to organize intimate relationships and often appears to be replaced by 
cohabitation. This change raises an important question: Can cohabiting couples 
count on the same kinds of support from their parents? For many married 
couples, the transition to parenthood prompts parental support. When cohabiting 
couples have a baby, do their parents respond in the same way? For many young 
adults, parental support is best characterized as a safety net. Contact and exchanges 
may be quite infrequent when there are few needs. However, when an emergency 
arises, parental support is forthcoming. Do cohabiting couples perceive a parental 
safety net in the same way as young adults (both single or married) who are not 
cohabiting? Finally, for currently cohabiting couples, what might be the factors 
associated with ties to parents? 


Background 


A number of studies show that exchanges of support between aging parents 
and their children are most prevalent when adult children are in their twenties 
(Eggebeen and Hogan 1990; Silverstein and Bengtson 1997; Soldo and Hill 
1993). This is not surprising given that young adulthood is marked by a number 
of transitions, the majority of which are best navigated with assistance from 
parents (Eggebeen and Hogan 1990; Hogan, Eggebeen, and Clogg 1993; Stack 
1972). Parents also have more resources to share with adult children. Middle- 
class parents of young adults are likely to be at a point in their lives (mid-life) in 
which they have discretionary time and money to devote to helping their children. 
Working-class parents may not have the discretionary income, but they can still 
be a strategic source of child care, temporary shelter, or social and emotional 
support (Burton and Bengtson 1985). There is considerable evidence that both 
adult children and parents expect that support should be forthcoming (Hogan 
and Eggebeen 1995). Finally, for aging parents, adult children are a significant, 
if not central, source of routine household assistance, emotional support, and 
care in times of need (Davey and Eggebeen 1998; Eggebeen 1992; Eggebeen and 
Davey 1998). 

Despite the centrality of both cohabitation and parental support to the lives 
of young adults, previous work on these subjects has been limited to examining 
financial support. Rindfuss and VandenHeuval (1990), using data from the 
National Longitudinal Study (NLS) class of 1972 found that cohabitors were more 
likely than married couples, but less likely than single young adults, to depend 
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on parents for more than half their financial support. In contrast, Hao (1996), 
using data from the National Survey of Families and Households (NSFH), found 
that cohabiting couples were less likely than married couples to have received 
any financial transfers from parents over the previous five years. 

There are several reasons to expect that the full array of exchanges of 
support between parents and children, not just financial transfers, will differ by 
union status. First, and most obvious, it may be that cohabiting adult children 
have a more distant or strained relationship with parents than married young 
adults. The rapid increase in cohabitation makes it likely that there are sharp 
generational differences in opinions on this phenomenon. Adult children, sensing 
that their parents may not approve of their relationship, may minimize contact. 
Alternatively, generational differences on the moral legitimacy of cohabitation 
may lead to conflict that is not quickly or easily resolved, leading to distant or 
severed ties initiated by parents. Thus, cohabiting adult children are at greater risk 
of strained relationships with parents, and these distant or strained relationships 
may dampen exchanges of support and help relative to married young adults. 

Another possibility builds on the fact that cohabiting relationships are 
notoriously unstable (Bumpass and Lu 1999; Bumpass, Sweet, and Cherlin 1991). 
Parents, sensing that this relationship may not last, may be reluctant to invest time - 
and resources until it shows some evidence of stability or permanence—possibly 
by being transformed into a marital relationship. As a result, cohabiting young 
adults may be much less likely than their married counterparts to receive various 
types of support from parents. Assuming that cohabitation is less institutionalized 
than marriage (Nock 1995), there may be fewer expectations for support on the 
part of young adults who are cohabiting, or parents may feel fewer obligations to 
offer support to their cohabiting children. Finally, it is well documented that those 
who cohabit are more liberal in their social and political views, less religious, more 
likely to have egalitarian gender role ideologies, and more tolerant of alternative 
family forms (Clarkberg et al. 1995; Lye and Waldron 1997; Thornton et al. 
1992). In short, there are a number of plausible reasons for expecting significant 
differences in patterns of support between parents and adult children, depending 
on their cohabitation status. All of the arguments predict that there will be less 
support between cohabiting adult children and parents relative to married adult 
children, although different arguments emphasize different mechanisms. 

A closely related but distinct question is what factors contribute to closer ties 
between cohabiting children and their parents? There are several possibilities. 
First, the nature of the cohabiting relationship itself may influence parent- 
child relationships. When the cohabiting relationship is “marriage-like” in its 
characteristics, it may be more likely that parents accept and accommodate 
themselves to it. The longer a cohabiting relationship lasts, for example, the 
more likely it is that parents and children will establish and maintain ties. 
In a related vein, the quality and stability of the cohabiting relationship may 
influence intergenerational ties. Adult children who express unhappiness with 
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the cohabiting relationship or pessimism about its long-term prospects may 
be inclined to withdraw from parents. Parents who feel uncertain about the 
prospects of their child's cohabiting relationship may also limit investment in that 
relationship. Finally, cohabiting relationships that involve grandchildren are likely 
to spawn closer links between parents and children. Another possibility is that 
when it is certain that marriage is just around the corner, both parents and adult 
children may begin to strengthen their exchange relationships, possibly because 
they begin to buy into the cultural prescriptions about how parents treat their 
married children or because their relationships become closer as a result of shared 
activities surrounding the planning and execution of the marriage ceremony. 

A third possible mechanism is that the strength of the intergenerational tie 
between cohabiting adult children and their parents may be influenced by how 
much the adult child values close family relationships. Apart from the nature 
of their particular family ties, adult children may feel that parents and children 
should be close, and they act on this belief by forming more active exchange 
relationships with parents. 

Finally, of course, intergenerational support is strongly influenced by the 
quality of the relationship between aging parents and their cohabiting adult 
children. It seems reasonable to expect that, just as for married or single adult 
children, satisfying, intimate ties between cohabiting children and their parents 
will be strongly associated with a greater propensity to exchange support. 


Data and Methods 


The data for this study are drawn from the first wave of the 1987/88 National 
Survey of Families and Households (NSFH; Sweet, Bumpass, and Call 1988). This 
national survey is ideally suited to addressing the above questions. The survey is 
nationally representative, involving interviews with 13,008 respondents. It also 
contains over samples of single-parent families, stepfamilies, and—of interest for 
the purposes of this study—recently married couples and cohabiting couples. 
The true value of the survey is that respondents were asked questions that cover 
a wide variety of issues about American family life, including a number of 
questions about intergenerational ties among family members. The analytical 
sample consists of 3,809 young adult respondents (aged 19-30) who have at 
least one living parent (biological or step) and report on a variety of exchanges 
of support with their parents. 

The respondents in the NSFH answered a large number of questions about 
exchanges of support, permitting support flows between young adults and their 
biological or stepparents to be measured in a variety of ways. Specific exchanges 
are classified using six indicators, each scored as “1,” meaning yes, or “0,” meaning 
no. Exchanges of advice or emotional support are recorded for respondents who 
indicated giving (or receiving) advice, encouragement, or moral or emotional 
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support to (or from) aging parents during the past month. Household assistance 
is recorded if the respondent reported giving or receiving transportation, repairs 
to the home or car, or other kinds of work around the house during the past 
month. Finally, respondents are classified as giving or receiving financial support 
if they reported giving (or receiving) a parental gift or loan worth $200 or more 
during the previous five years (excluding any help with a mortgage). Overall giving 
and receiving is indexed by two variables. Reception of support is measured by 
summing the receiving variables to form a scale with a range of 0 to 3. Giving 
of support is formed the same way. Respondents were also asked to whom they 
would turn for advice, a loan, or help in a crisis or emergency. Young adults who 
designated a parent as the person they would turn to for help on any of the three 
questions are coded as having parents available as a safety net. A limitation of 
these data is that for respondents who are married or cohabiting, we do not have 
information on the exchanges with their partner's or spouse's parents. What this 
means is that we cannot draw conclusions about how relationship status affects 
the total amount of support that couples are receiving. However, this limitation 
is less serious than it seems at first glance, given that parents generally are the 
most significant exchange relationship within support networks (Hogan and 
Eggebeen 1995; Hogan and Spencer 1993; Rossi and Rossi 1990). 

The key independent variable in the first part of the analysis is the current 
relationship status of the young adult. On the basis of their current union status, 
young adults are classified as married or cohabiting. Although they were not 
the focus of these analyses, we also include young adults who are not currently 
married or cohabiting (single). To take into account the effects of marital 
history, we also include a measure of whether the respondent has ever previously 
cohabited and a measure of whether the current marriage is a remarriage. 

The second half of the paper examines patterns of exchanges with parents 
among cohabiting young adults. In addition to the set of control variables 
described below, we sequentially include a series of six variables that allow us 
to test the four possible predictors of exchanges of support among cohabiting 
young adults and their parents. Three variables tap whether the cohabiting union 
is marriage-like in that it has elements of stability or permanence: the length (in 
months) of the current union, whether there are dependent children present, and 
the respondents' assessments of whether they think the cohabiting relationship 
will last. A question on whether the respondent plans to convert the cohabiting 
relationship into a marriage is used to indicate whether this union can best be 
seen as the last stage of the courtship process. The extent to which the respondents 
think close family relationships are valuable is tapped by a question asking for a 
rating of the importance of intergenerational family ties. Finally, the respondents 
ratings of the quality of their relationship with their parents are included. 

Also included are a number of indicators that previous work on 
intergenerational relationships has shown to be important. These include a 
number of other characteristics of the respondent, such as age, gender, race 
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Figure 1. Percent of Young Adults Giving and Receiving Support from 
Parents, by Partnership Status 


(black, Hispanic, other), presence in the home of children under the age of 18, a 
measure of self-rated health, a shortened form of the Center for Epidemiological 
Study-Depression Scale, years of completed education, total of all the individual 
incomes of everyone in the household (this variable is logged because it is highly 
skewed), and number of siblings. Also included in the multivariate models are 
measures of the parent or parents, and several indicators of the nature of the 
relationship between the respondents and their parents. Parent characteristics 
include a rating of parent's health (of the parent in the worst health if both are 
alive), number of years of formal education of the most educated parent, and 
a set of dummy variables indicating whether both parents are alive, mother 
only, or father only. We also included several variables that tap the nature of 
the relationship between the young adult and his or her parents. These include 
a variable measuring the young adult's subjective assessment of the quality of 
the relationship. The rating scale is one (very poor) up to seven (excellent). (If 
both parents are alive, we used the highest rating). Distance to the nearest living 
parent is in miles (logged to take account of the highly skewed distribution of 
this variable). Also included is a measure of the level of contact with the most 
contacted parent (the highest rating among visits, telephone calls, and letters 
was used). The scale for this variable ranges from one (not at all) to six (several 
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times a week). 

With two exceptions, missing data on eleven independent variables do not 
exceed 696 on any one variable, although these missing data are fairly randomly 
distributed across respondents. However, 3096 of the cases have missing data 
on total household income, and 1196 of respondents have missing data on the 
depression scale. In order to retain as many cases as possible in the analytical 
sample, we added controls for missing data on key variables (income, parent’s 
education, health status, depression, number of siblings, parent’s health, contact, 
and relationship quality). To simplify presentation, the coefficients for these 
variables are not shown in Table 2. 


Results 


Figure 1 displays the variations in giving, receiving, and expecting help between 
young adults and their parents by marital status of the young adults. There are 
clear differences, which chi-square tests show as statistically significant, with 
married young adults more likely than cohabiting young adults to be giving and 
receiving support and much more likely to expect support from their parents 
in a crisis. Single young adults show less dramatic differences. They are slightly 
more likely than cohabitors to be giving support, have approximately the same 
likelihood of receiving support, but are much less likely to expect support from 
parents in a crisis. The question is: Do these differences remain after taking into 
account characteristics of the young adults, their parents, and various aspects of 
their relationship? 

Table 1 contains the ordinary least squares (OLS) and logistic regression 
models of three summary measures and three specific types of support. The 
overall patterns in these results suggest that the observed differences in Figure 1 
are fairly robust. Even when characteristics of the adult children, their parents, 
and the nature of their relationship are taken into account, married adult children 
are more likely than currently cohabiting adult children to be in an exchange 
relationship with their parents. Married young adults are significantly more likely 
than cohabiting young adults to be receiving assistance from and giving assistance 
to their parents. They are also more likely to nominate their parents as persons 
they would turn to in an emergency. These summary measures obscure some 
variations among specific types of exchanges. Giving and receiving assistance with 
household chores is particularly affected by cohabitation status. Receiving advice 
is modestly associated with being married. However, giving advice and exchanges 
of money show no differences by cohabitation or marital status. 

Single young adults differ only slightly from their cohabiting compatriots. 
Singles are nominally more likely to be receiving help from parents, with most of 
this help in the form of work around the house. There are no differences between 
single adults and cohabiting adults in giving help of any type to parents or in 
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Table 2. Factors Associated with Variations in Exchanges of Support 
among Cohabiting Young Adults and Their Parents 





Giving Support ^ Receiving Support ^ Potential Help 





Do you plan to marry? xil ges —01 AT* 
Quality of relationship with parent 105/55 LE Dosis 
Importance of intergenerational family ties —01 05 .06 
Will you and your partner eventually separate? —.05 —.03 —.03 
Length of current union —.004** —.004** .001 
Presence of dependent child —13 —14 —.03 





Notes: Column one and two are OLS regression models. Column three is a logistic regression model. 
Other variables in the models are: age, gender, race, education, income, distance, mother only alive, father 
only alive. 


DIETO p< 05 Ph: p< 01 





expectations of support in times of need. 

One of the explanations offered as to why we should expect fewer exchanges 
of support between cohabiting young adults and their parents is that cohabitors 
may be more likely to have a distant or strained relationship with their parents. 
However, the differences in exchanges with parents between cohabitors and 
married young adults remained even with controls for contact with parents and 
a measure of relationship quality. These results leave open the possibility of other 
explanations for the observed differences in exchanges. 

Table 2 contains models testing the four explanations of variations in 
intergenerational ties with parents among cohabitors. First, it was argued that 
the nature of the cohabiting relationship might be influential in that more 
marriage-like or stable cohabiting unions are more likely to be characterized by 
closer ties between generations. The evidence does not support this. The presence 
of children (one possible indicator of a cohabiting union being marriage-like) 
is not related to giving support to parents or receiving support from parents. 
Nor is presence of children associated with turning to parents in times of need. 
Respondent evaluations of the potential for eventual breakup—an indicator 
of union stability—are unrelated to ties to parents. Finally, the length of the 
cohabiting union, which we predicted to be positively related to exchanges with 
parents, is actually negatively related to giving and receiving support. 

A second explanation is that young adults in cohabiting relationships who 
appear to be in the last stage of the courtship process are more likely to have close 
ties with their parents. There is evidence that the likelihood of giving support is 
higher for cohabitors who have plans to marry. They also are slightly more likely 
to turn to parents in an emergency. There is also some evidence for the third 
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explanation: that ties with parents may be influenced by how much the adult 
child values close family relationships. Adult children who are of the opinion that 
intergenerational family ties are important are also more likely to be receiving 
support from their parents. 

Finally, it was suggested that ties between cohabiting young adults and their 
parents are influenced by the quality of their relationship. There is strong evidence 
for this. The more positively adult children evaluate their relationships with their 
parents, the more likely they are to participate in exchange relationships and to 
indicate they would turn to their parents in an emergency. 


Discussion and Conclusions 


Young adulthood is a period of life characterized by a number of turning points 
and transitions. It is generally assumed that support from family members, most 
notably parents, can be crucial for assisting young adults through these turbulent 
times. Surprisingly, however, little empirical attention has been given to what 
factors promote or inhibit supportive intergenerational relationships during 
this life period. The purpose of this analysis was to explore the relationship of 
nonmarital cohabitation—an increasingly popular life style choice among young 
adults—and intergenerational support. 

The results clearly show that cohabiting young adults are significantly less likely 
than married young adults to be in exchange relationships with their parents. Most 
notably, cohabiting young adults are much less likely than their married or single 
compatriots to be giving and receiving assistance with household tasks. They are 
marginally less likely to be giving any type of support to their parents. Finally, 
they are less likely than married young adults to nominate their parents as persons 
they would turn to for emergency support. Significantly, these results stand even 
in the context of adjustments for a variety of characteristics of the young adults, 
their parents, and the nature of the relationship between them. There is also good 
reason to view these differences as a conservative estimate because of the focus 
on exchanges of support between nonresidential adult children and their parents. 
If coresidence were included as a form of intergenerational support, it is likely 
that the differences in support exchanges with parents among cohabiting young 
adults and those who are married or single would be even larger. Coresidence as 
a form of support by parents is very common in young adulthood, is used mostly 
by those who are single, and is probably rarely used by those who cohabit. 

Although these analyses were not set up to test explicitly why cohabiting 
young adults have weaker ties to their parents, the findings offer insights about 
the explanations described in the introduction. First, the differences between 
cohabitors and noncohabitors in their relationship with parents is not the only 
explanation of these differences, given that neither level of contact nor a measure 
of the quality of the relationship significantly changed the effect of cohabitation. 
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Selectivity into cohabitation remains a possible explanation. However, a crude 
proxy for selective differences—a measure of previous cohabiting experiences— 
did little to change the observed differences. 

A possible explanation is that cohabitation remains largely uninstitutionalized. 
It seems reasonable to suppose that, faced with uncertainty about what their role 
as parents should be in the lives of their cohabiting adult children, parents retreat. 
This retreat is consistent with the observed pattern in these data, which show the 
starkest differences between married and cohabiting children is in the reception 
of support from parents. Similarly, adult children, who face their own uncertainty 
about the permanence of their cohabiting relationship, may make few, if any, 
demands on parents. They may also be less likely to engage in the behaviors of 
married couples that create and strengthen the ties between them and parents, 
such as visiting on vacations, spending holidays together, or involving parents in 
family events and rituals. The resulting social distance is likely to create barriers 
to flows of routine kinds of assistance. 

So which cohabiting couples are likely to be most involved in exchanges of 
routine support with their parents? We had speculated that that young adults 
in cohabiting unions that were more marriage like, cohabiting unions that 
were in the last stage of courtship, cohabiting unions where the young adult 
valued close family ties, and cohabiting unions where the young adult had 
a positive relationship with the parent would all be more likely to be in an 
exchange relationship with their parents. The evidence suggests that quality of 
the relationship between cohabiting adult children and their parents is the most 
important predictor of exchanges of support. 

An unexpected finding was that the length of the cohabiting union is 
negatively related to exchanges of support. We had argued that the more stable 
the cohabiting union, the more likely there would be exchanges because parents 
might be more accepting of the relationship or might feel less uncertainty about 
its stability and begin to invest in it. However, one could also argue that cohabiting 
unions that remain stable without being converted into marriage or ending 
within a short period of time might represent a different kind of relationship 
(and different kinds of young adults). These young adults might have lower levels 
of exchanges with parents because their choice of this alternative lifestyle might 
create significant strains in their relationships with their parents. 

These results need to be viewed as tentative for a number of reasons. 
Explanations of why these differences exist have not been adequately addressed. 
Significantly, it is not clear whether cohabitation per se leads to fewer exchanges 
or whether young adults already more isolated from parents are more likely to 
be cohabiting. Nor have these analyses established the magnitude of the effects 
of differences in exchanges. Statistically significant differences do not necessarily 
translate into practically significant differences. The focus of this analysis is on 
routine kinds of exchanges and a measure of hypothetical support. Missing, 
however, is whether these differences in relationship status would play out when 
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support is needed for an actual crisis. There is evidence that family members 
respond with help much more forthrightly in a crisis in the lives of aging 
parents (e.g., a sudden illness, death in the family, or a financial crisis) than to 
routine needs (Eggebeen and Davey 1998). While these analyses have established 
the overall relationship between cohabitation and exchanges, the prospect of 
significant subgroup variations remains. Whether this cohabitation effect exists 
for individuals living in poverty or among minority groups remains to be seen. 
Finally, it must be kept in mind that biological or stepparents are not the only 
sources of potential support. Fewer exchanges of support between cohabiting 
young adults and their parents do not mean that they are at risk of not obtaining 
needed help. A lack of exchanges of support with parents may well be offset by 
more extensive ties with partner parents, friends, neighbors, or other relatives. 

Cohabitation is likely to remain a significant feature of young adulthood either 
as a part of the courtship process leading to marriage or, for some, an alternative 
to traditional marriage. Young adults are also likely to remain in need of the 
support, resources, and care of parents as they move through their twenties and 
into middle age. Understanding how choices about managing intimate unions 
affect and are affected by the larger family context should be a significant part 
of the research agenda of family scholars. 
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Parental Status and Differential Investment in 
Sons and Daughters: Trivers-Willard Revisited" 


ROSEMARY L. HOPCROFT, University of North Carolina at Charlotte 


Abstract 


According to the Trivers-Willard (T-W) hypothesis there is an interaction between 
individual status and investment in offspring such that high-status individuals 
invest more in boys, and low-status individuals invest more in girls. Parental 
investment may be both physiological and affect thè sex ratio of offspring, or it may 
be psychological and affect resource allocation to offspring. I test both components 
of the hypothesis using cumulative U.S. General Social Survey (GSS) data and find 
results that support T-W. Using years of education attained as an indirect measure 
of parental investment, I find that sons of high-status fathers áttain more education 
than daughters and that the daughters of low-status fathers attain more education 
than sons. Supporting the sex ratio component of T-W, I find that high-status men 
have more sons. 


Sociological studies of status attainment have demonstrated the importance of 
parental investment in the educational, financial, and cultural capital necessary 
for status attainment in contemporary American society (Blau and Duncan 1967; 
Hauser and Featherman 1977; Hauser and Mossel 1985; Steelman and Powell 
1991; Bianchi and Robinson 1997; Keister 2000; Conley 2001; Treiman 2001). 
Parents and families are also important in shaping personality characteristics that 
are crucial to educational and occupational attainment, such as career motivations 
and educational aspirations (Jacobs, Karen, and McClelland 1991; Grabowski 
et. al. 2001; Smith 1991; Teachman and Paasch 1998; Bowles and Gintis 2002; 
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Breen and Goldethorpe 2002; Buchmann and Dalton 2002; Crosnoe, Mistry, and 
Elder 2002). The family, as one of the primary socializers of children, is also a 
source of the differential sex roles that influence both career and educational 
aspirations (Correll 2001, 2004) and therefore the occupational outcomes of 
men and women. 

These family processes are social processes. However, there is evidence that 
these family processes interact with the biological predispositions specific to 
each individual in the family. Behavioral genetics research suggests that genetic 
predispositions are important shapers of personality characteristics, including 
work styles and aspirations (Dunn and Plomin 1990). That is, at least some 
of the shaping of personality characteristics and career aspirations that occurs 
within the family is a reaction to each child as an individual. Jencks et. al. (1979) 
noted that IQ, which is also responsible in part for educational attainment and 
status attainment, is about 5096 inherited (see also Kerckhoff, Raudenbush, and 
Glennie 2001). Thus, each child's actual educational and occupational attainment 
is in part a function of the capabilities and personality of the child (Bond and 
Saunders 1999; Bowles and Gintis 2002; Breen and Goldthorpe 2002; Chen and 
Kaplan 2003; Conley 2004). 

There is also evidence that these family processes interact with the biological 
relatedness of family members to each other. How effectively parents invest in. 
children is also related to whether or not the family is intact and the presence of 
nonbiologically related parents and/or children in the family. For example, Biblarz 
and Raftery (1999) have presented the counter-intuitive finding that stepfamilies 
are less able to promote the occupational mobility of children than female-headed 
families. Case, Lin, and McLanahan (2001) show that, all else being equal, children 
in stepfamilies attain less education than children in intact families. 

One issue on which there are conflicting results is whether parents preferentially 
invest in the education of sons or daughters. In the contemporary U.S., unlike 
in many other societies, there is some evidence that girls receive more support 
for higher education than boys (Behrman, Pollack, and Taubman 1986). Some 
scholars have found that the presence of sisters, not brothers, hurts a woman's 
chances for educational attainment (Butcher and Case 1994), although others 
have not replicated this finding (Kaestner 1997). Girls are now the majority of 
college undergraduates, although they remain outnumbered by males in some 
professional and graduate schools (Jacobs 1996). This outcome is not what might 
be expected given traditional sex roles in the United States, which suggest that 
parents will invest more in the education of boys than in the education of girls. 
Consistent with traditional sex roles, other scholars have found that the number 
of brothers poses a greater obstacle than does the number of sisters to parents' 
financial support of their children's college education (Steelman and Powell 1989). 
Similarly, Conley (2000) found that the actual educational attainment of women 
is hurt more by the presence of brothers, while the educational attainment of men 
is hurt more by the presence of sisters. Steelman and Powell (1991) also found 
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that parents were more willing to go into debt to support their child's college 
education if their child was a boy than if the child was a girl. Other scholars find 
no effect of sex of child or sex composition of siblings on educational attainment 
(Jacobs 1996; Kaestner 1997) or on parental expectations for higher education 
(Rosen and Aneshensel 1978). 

The reason for these mixed and sometimes conflicting results may be an 
interaction between the child's sex, parental status, and amount of parental 
investment. In fact, there is a long-standing hypothesis from evolutionary biology 
that predicts just such an interaction. This hypothesis, known as the Trivers- 
Willard hypothesis, predicts that high-status individuals will invest more in sons, 
and low-status individuals will invest more in daughters (Trivers 1972; Trivers and 
Willard 1973). This hypothesis was originally tested with species other than our 
own, where the majority of parental investment in offspring is strictly biological 
and limited to the gestation and nursing of young (e.g., Clutton-Brock, Albon, 
and Guiness 1984). Here the hypothesis takes a sex ratio form, with investment 
limited to the actual physiological investment of mothers in the production 
of either a son or daughter. In this form, the hypothesis suggests that mothers 
in good condition are more likely to give birth to sons, and mothers in poor 
condition are more likely to give birth to daughters. 

Important for our purposes here, the Trivers-Willard hypothesis is not just 
applicable to the sex ratio at birth, but also to investment in offspring after birth. 
In humans, of course, these investments are extensive and include not only 
nurturing of children while they are young, but also investments in education 
and the social and cultural development of children. Trivers-Willard themselves 
explicitly extended their hypothesis to humans, writing: “the model can be applied 
to humans differentiated on a socioeconomic scale, as long as the reproductive 
success of a male at the upper end of the scale exceeds his sister’s, while that of 
a female at the lower end of the scale exceeds her brother’s” (Trivers and Willard 
1973). Reproductive success is how evolutionary biologists refer to the number 
of surviving offspring—the sine qua non of evolution. 

The logic of the Trivers-Willard hypothesis relies on this sex difference in the 
variance in reproductive success. This sex difference is found in the vast majority 
of sexually reproducing species, including humans. That is, highly reproductively 
successful men have many more children than highly reproductively successful 
women. On the other hand, less successful men are often shut out of reproduction 
altogether either through early death, disability, or destitution, while females are 
more rarely shut out. Another way of looking at this difference between male and 
female reproductive success is in terms of risk—males are a riskier reproductive 
bet than females. Like an investment in a reproductive stock market, males can 
give high reproductive returns, but they can also give none. Like an investment in 
a reproductive bank, females cannot give such high returns but are very unlikely 
to give none at all. 

According to Trivers-Willard, given this sex difference in reproductive 
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variance, it will be in the reproductive interests of individuals to fine tune their 
investments in progeny by sex—in good conditions where external risks are lower, 
go for the riskier option (males); in poor conditions, go for the less risky option 
(females). The analogy with financial investments holds—in good conditions (a 
“bull” market), go for risky investments (stocks); in poor conditions (a *bear" 
market), go for the less risky options of bonds and bank accounts. Just as the 
latter maximizes investment income, the former maximizes reproductive success. 
Here the analogy ends, however. Financial investment is guided by a conscious 
strategy to maximize returns; reproductive investment need not be. It is simply 
that in animal and human evolution, those behavioral strategies that have in the 
past favored reproductive success have been selected and have proliferated in the 
population; those strategies that did not favor reproductive success have literally died 
out. The Trivers-Willard hypothesis says nothing about conscious motivations, 
which may or may not be present. 

A key prerequisite for the logic of Trivers-Willard is the sex difference 
in variance of reproductive success (number of surviving offspring). This 
prerequisite appears problematic in contemporary industrialized societies where, 
given artificial contraception, the reproductive success of males and females is low 
and often appears very similar. That is, high-status males do not appear to out- 
reproduce high-status females. Nevertheless, we remain physiological products 
of our evolutionary history, and we may have physiological adaptations to an 
evolutionary environment where high-status males did outreproduce high-status 
females. Furthermore, evolutionary psychology (Tooby and Cosmides 1990, 1992; 
Maryanski and Turner 1992; Cosmides, Tooby and Barkow 1992:5; Buss 1999; 
Cziko 2000) would suggest that we also have psychological adaptations to the 
environment of evolutionary adaptiveness (EEA). The EEA is the Pleistocene in 
Africa, when early humans lived as hunter-gatherers in societies of relatively small 
groups of people (80-100) with little economic stratification. 

Was there a sex difference in the variance on reproductive success in the 
EEA? We will never know for sure, but studies of contemporary hunter-gatherer, 
nomadic pastoral, and other simple societies show status differentials among 
men, and that typically high-status males are the most reproductively successful 
individuals in the group in terms of producing the largest number of offspring 
(Lévi-Strauss 1955:370; Chagnon, Flinn, and Melancon 1979; Irons 1979; Hill 
1984; Turke and Betzig 1985; Betzig 1993; Casmir and Rao 1995). Status in 
such groups was typically conferred on individuals for their hunting, fighting, 
leadership, healing, or religious skills. Lévi-Strauss (1955) notes that for the 
headman the privilege of having multiple wives was his only compensation for 
the burdens of leadership. Of course, if high-status men in such societies obtain 
more than one wife, then low-status men may not obtain a wife at all, creating a 
higher variance in reproductive success for men than for women. This evidence 
suggests that in the similar environment of Pleistocene hunter-gatherers, the 
sex difference in reproductive variance that is the basis of the Trivers-Willard 
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hypothesis did exist.! 

Given this situation in the evolutionary environment, by the logic of Trivers- 
Willard, there may exist evolved psychological and physiological mechanisms 
that promote high-status parents to invest more in sons and low-status parents 
to invest more in daughters, regardless of any contemporary sex differences in 
reproductive success. Given such evolved mechanisms, we can expect high-status 
parents to invest more in sons, and low-status parents to invest more in daughters, 
even if males are not actually more reproductively successful in the contemporary 
environment. Investment of physiological resources will be manifest in the sex 
ratio at birth; investment of material resources will be manifest in differential 
resource allocation (e.g., providing education) by sex. Once again, this differential 
investment may or may not be conscious. Certainly, the evolutionary reason for 
the differential investment will not be conscious. No one is expected consciously 
to say to themselves: I am going to invest more in this sex child rather than 
the opposite sex child because of a difference in expected reproductive success. 
Furthermore, this differential investment in children by status of parents may 
mean that, on average, boys and girls in the society are invested in equally, and 
so overall there is no relationship between the sex of the child and parental 
investment received. 

Application of Trivers-Willard to humans has been controversial, for both 
theoretical and empirical reasons. On a theoretical basis, many claim that 
hypotheses from evolutionary biology are simply not applicable to culture- 
bearing, modern humans (Gould 1997; Rose and Rose 2000; Marks 2002). 
Sociologists have frequently rejected any introduction of biology into sociology 
as deterministic, reductionist, or worse, ideologically biased (e.g., Lorber 1993). 
However, as previously noted, more and more sociological studies especially in the 
areas of family and demography have demonstrated the interaction of social with 
biological processes. Certainly, any rejection of a hypothesis from evolutionary 
biology should not be rejected simply because it is from biology. 

Second, on an empirical basis, previous tests of both the resource allocation 
version and the sex ratio version of the Trivers- Willard hypothesis with human 
data have given mixed results. First, a great deal of evidence consistent with the 
resource allocation version of Trivers-Willard has been found among preindustrial 
populations or populations in less developed countries (e.g., in Kenya, Cronk 
1989, 1993, 2000; Borgerhoff Mulder 1987, 1988, 1998; among gypsies in 
Hungary, Bereczkei and Dunbar 1997; in Poland, Koziel, and Ulijaszek 2001). Yet 
evidence in industrialized countries is more conflicting. There is some evidence 
of differential resource allocation to infants by parents in industrialized societies 
(Abernathy and Yip 1990; Gaulin and Robbins 1991). Smith, Kish, and Crawford 
(1987) found that wealthy Canadians left larger bequests to their sons than their 
daughters, while the reverse was true for poorer Canadians, but this was not 
replicated in California (Judge and Hrdy 1992). Two recent prominent studies 
of contemporary American society found no Trivers-Willard effect (Freese and 
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Powell 1999; Keller, Neese, and Hofferth 2001). Only Kanazawa (2001a) found 
some evidence of this effect in terms of time spent with adolescents aged 10-15; 
also Cox (2003) found evidence of this effect in his (limited) analysis of child 
education by parental wealth. 

I suggest, however, that the measures of parental investment used in all these 
previous tests of the resource allocation version of the Trivers- Willard hypothesis 
in contemporary industrialized societies are flawed. All of these studies use 
self-reports of direct, proximate measures of investment, typically at relatively 
young ages. For example, Freese and Powell (1999) examine parental investment 
in adolescents in the form of measures such as saving money for college, use of 
private school, having educational objects in the home, talking to children about 
school, monitoring of children’s behavior, while Keller, Neese, and Hofferth (2001) 
look at even more direct measures of investment in babies and small children 
including hours per week participated with children, self-reported warmth, and 
months breastfed (see also Gaulins and Robbins 1991). These measures are all 
based on self-reports, which themselves may sometimes be biased given our 
culture’s egalitarian values (Keller, Neese, and Hofferth 2001:356; for an example 
of biased self reports, see Cronk 1991). However, it is also the case that in a wealthy 
society such as the U.S., large sex differences in these kinds of investments at 
these early stages are unlikely. That is, there are abundant resources available for 
parents to be able to make these early investments in all children regardless of 
sex. Evolutionary theory suggests that in such a situation of unusual abundance 
all children, irrespective of sex, will be highly valued and invested in by their 
parents (Hrdy 2000). 

Moreover, as status attainment research in sociology has clearly shown, status 
attainment in American society requires much more than just investments in 
children as babies and adolescents; it requires ongoing investments in very lengthy 
periods of education and occupational training. High-status positions in American 
society require extensive training and education periods, in fact longer periods 
than most other contemporaneous advanced industrial societies. (For example, 
in many countries a medical degree is an undergraduate degree, not a graduate 
degree as it is in the U.S.). It is provision of support over such long periods of 
time that is the scarce resource in American society, not the provision of basic 
resources and support while children are young. I suggest, therefore, that the focus 
of studies testing the resource allocation version of Trivers-Willard should be sex 
differences in parental provision of access to extensive training and education and 
encouragement of this process. In this study, I use years of education attained as 
an indirect, but behavioral measure of parental investment. 


Hypothesis 1: Among high-status families, boys will be more likely to 
be supported in further education and therefore attain more years of 
education than girls. Among low-status families, girls will be more likely 
to be supported in further education and therefore will attain more years 
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of education than boys. 


Given the correlation between class standing and racial classification in 
American society, hypothesis 2 also follows: 


Hypothesis 2: Among African American families, girls will be more likely 
to be supported in further education and therefore attain more years of 
education than boys. 


For the sex ratio hypothesis, once again the empirical evidence is conflicting. 
Studies that have supported the Trivers-Willard sex ratio hypothesis include those 
reported by Mueller (1993), Betzig and Weber (1995), Chacon-Puignau and 
Jaffe (1996), Gibson and Mace (2003), and Catalano (2003); those which have 
not supported the hypothesis include Essock-Vitale (1984), Mueller (1993—for 
nineteenth- and early twentieth-century British elite), Marleau and Saucier 
(2000), and Ellis and Bonin (2002). The mechanism typically proposed for the 
effect of status on sex ratio is the condition of the mother: high-status mothers 
are more likely to be in good condition than other mothers, and mothers in 
good condition are better able to carry a male fetus to term (Catalano 2003). 
High-status women are therefore more likely to have boys among their offspring. 
However, to my knowledge there have been no tests of the sex ratio version of the 
hypothesis using representative data from the general U.S. population. All previous 
tests (with both positive and negative findings) have been on convenience samples. 
For example, Lee and Bonin (2002) used convenience samples of college students; 
Marleau and Saucier (2000) used convenience samples of pregnant women in 
Montreal, Canada. In this study I am able to test the sex ratio hypothesis with 
information from a representative sample of Americans. 

Hypothesis 3: The sex ratio of offspring will vary by status, with higher- 

status individuals having more boys among their biological offspring than 

lower-status individuals. 


Data and Methods 


To test these hypotheses, I use the cumulative data from the 1972-2000 General 
Social Surveys (GSS) conducted at the National Opinion Research Center at the 
University of Chicago (Davis and Smith 1998). Each survey is an independently 
drawn, multistage probability sample of noninstitutionalized, English-speaking 
persons aged 18 or over living in the United States. These surveys are widely 
available, reputable surveys of the American population. 
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TEST OF RESOURCE ALLOCATION HYPOTHESIS 


Dependent Variable—Measure of Parental Investment 


To measure parental investment, I use respondents’ actual educational attainment 
in terms of years of education (actual number, range 0-20 years). As a measure 
of investment, educational attainment is admittedly crude and does not get at 
any difference in quality or type of education attained (which also differs by 
sex). Furthermore, educational attainment is not a direct measure of parental 
investment. However, it may be considered an indirect measure of parental 
investment, in that parental investment greatly increases the likelihood of attaining 
high levels of education. Given that investment in children’s education is perhaps 
the primary investment parents make in their children in contemporary U.S. 
society, years of education attained will reflect parental investment better than 
any other indirect measure. Such investment may be in the form of financial 
support, including gifts of money, food, and goods; or it may be in the form of 
emotional support, including encouragement and moral support. Clearly, not all 
individuals who attain higher levels of education do so with such parental support, 
but many do. Furthermore, those with parental support are more likely to achieve 
high levels of educational attainment than those without parental support. Thus, 
years of education attained may be considered an indirect measure of parental 
investment, if not a perfect.one. 


Independent Variable—Measures of Parental Status 


Previous studies testing Trivers- Willard have used parental education and income 
as measures of parent status. These are reasonable, if incomplete measures of 
social status. Here, I use four measures of the status of the father's (or substitute 
father's) occupation when the respondent was 16 years old: two measures of 
socioeconomic status (SEI)—SEI 70 and SEI 80—and two measures of occupational 
prestige—Prestige 70 and Prestige 80. There are two of each type of measure 
simply because occupation variables from earlier surveys were coded with one 
set of census codes (1960-70 codes), and occupation variables from later surveys 
were coded with a second set of census codes (1980). There is a slight overlap of 
not more than 10% between the samples using the 1970 codes and those using 
the 1980 codes. 

The SEI measures are measures that have been computed from measures of 
education, income, and prestige in a manner similar to that employed by Duncan 
(Blau and Duncan 1967). The first of the SEI measures here (SEI 70) is for those 
respondents whose father’s occupations were coded according to the 1970 U.S. 
census codes and a second measure (SEI 80) for those respondents whose father’s 
occupations were coded according to the 1980 census codes. These measures of 
status have been used in a variety of sociological studies of status attainment 
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(e.g., Blau and Duncan 1967; Hauser and Featherman 1977; Kerckoff et. al. 1989; 
Rytina 2000). Occupational prestige measures social standing of the occupation. 
The measures of occupational prestige here were taken from ratings systems 
developed at NORC in 1963-65, and updated in 1989. 

These measures of father’s occupational status are preferable to simply income 
or education because they measure social standing, which is arguably a better 
measure of parental status than simply income or education alone. Father’s 
status is used rather than mother’s status because father’s occupational status 
has traditionally been higher than mother’s occupational status and thus more 
important in determining the status of the family. Although it may be argued 
that male occupational status is no longer as important in determining the status 
of the family as it once was, the continuing importance of male occupational 
status is suggested by the finding that male occupational status (but not female 
occupational status) continues to be an important factor in marriage formation 
(Oppenheimer 2000). 


Control and Other Independent Variables 


Given that years of education will be influenced by age of respondent, age is an 
important control variable and is measured as respondent’s age (in years). 

Two additional control variables are included because they may be expected 
both to decrease and augment, respectively, resources available to each child net 
of parental status. The first variable, number brothers and sisters (actual number, 
including step and adopted siblings), is an important control variable because 
number of siblings can be expected to dilute the level of parental resources 
available for each child. Previous studies have shown that the number of siblings a 
person has decreases their level of educational attainment net of other factors (e.g., 
Steelman and Powell 1989; Downy 1995). The second variable is family income 
when the respondent was 16 (coded 1 =far below average to 5 — far above average). 
It is an important control variable because it is a measure of resources available 
to each child, separate from the measures captured by the variable occupational 
status of father. Measures of family income have also regularly been shown to 
have a positive effect on educational attainment (Steelman and Powell 1991). 

In the analyses which included all cases, a weight variable, weighting, was 
used to compensate for those surveys where African Americans were over 
sampled. Sex of respondent is measured as a dummy variable, Sex (Male 


=1). 
TESTS OF SEX RATIO HYPOTHESIS 


I test the sex ratio hypothesis in two ways. First, given that the GSS is based 
on probability samples of the American population, if high-status individuals 
are indeed more likely to have male offspring, then male respondents should 
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disproportionately have high-status parents. Once again, parental status is 
measured using the occupational status of the father, so male respondents 
should be more likely to have high-status fathers. To test this, I correlated sex of 
respondent with both the occupational prestige (Prestige 70 and Prestige 80) and 
socioeconomic status (SEI 70 and SEI 80) of their fathers. 

Second, in 1994, respondents were asked detailed questions about each of up 
to nine children, including their sex and whether they were biological, adopted, 
or stepchildren. Using these variables, for each respondent I calculated the total 
number of biological children of each sex, and then calculated the percentage of 
biological children who were boys (percent boys). I then used percent boys as the 
dependent variable in OLS regression analyses. Respondent status is measured in 
three ways: the respondent's own occupational prestige (Prestige 80), socioeconomic 
status (SEI 80), and intelligence (number of words correct). This last variable is the 
number of words the respondent knew on a short vocabulary test and is highly 
correlated with measured IQ (Vining 1986), which can be regarded as another 
measure of respondent social status. 


Results 


TESTS OF RESOURCE ALLOCATION HYPOTHESES 


The results of the test of hypothesis 1 using years of education attained as the 
dependent variable are given in Table 1. This analysis includes all cases because the 
logic of Trivers-Willard pertains to status relative to all others in the society. 

There are three models for each of the four measures of father's occupational 
status (SEI 70, SEI 80, Prestige 70, and Prestige 80). Model 1 contains as predictors 
respondent sex, father's occupational status, the interaction between sex and 
father's status, and respondent age; Model 2 adds number of siblings; Model 3 
adds family income when respondent was aged 16. 

Results are consistent across all years and all measures of status of father's 
occupation. Father's occupational status had strong positive effects on educational 
attainment across all models. Across all models except one, there was a significant 
interaction between sex and father's status in the positive direction predicted 
by Trivers-Willard. This positive coefficient indicates that an increase in father's 
status produces a significantly greater increase in the education of sons than 
in the education of daughters, just as Trivers-Willard predicts (see Figure 1). 
Although sex had a significant effect on educational attainment in only 3 out 
of the 12 models, its effect was consistently negative (with one exception which 
was essentially 0). These findings suggest that for respondents whose fathers 
have zero prestige, all else being equal, boys attain fewer years of education than 
girls—once again just as Trivers-Willard predicts. Age had a negative effect on 
years of education across all models. Number of siblings had a negative effect 
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Father's SEI (1980) 
Figure 1. Years of Education by Father's Socioeconomic Status (1980) 


on number of years of schooling across all models, while family income had a 
positive effect on years of schooling across all models. 

The significant coefficients on the interaction terms were not due to 
heteroskedasticity in the data (especially greater variance on the dependent 
variable for males than for females). Tests for heteroskedasticity (plotting 
unstandardized predicted y values against studentized residuals) showed that 
heteroskedasticity was not extreme and was within reasonable levels for regression 
analysis (Fox 1991: 52). Nor are the results due to differences in family structure 
by socioeconomic class. That is, it could be argued that lower-status families 
are more likely to be characterized by divorce, and this possibly could hurt the 
educational prospects of boys more than girls. Controlling for family structure 
by only including survey respondents whose families were intact at age 16 yielded 
identical results (analysis not shown). 

Table 2 gives results to an analysis identical to Table 1 but includes African 
American respondents only. Once again, results are fairly consistent across all 
measures of status of father's occupation. Sex was only a significant predictor 
of years of education attained in three models, and in a negative direction. 
Given the interaction term, these results should be interpreted as showing that 
sons of hypothetical fathers with zero status or prestige obtain less education 
than daughters of hypothetical fathers with zero status or prestige. Father's 
occupational status had strong positive effects on educational attainment across 
all models. The interaction between sex and father's status was significant in only 
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Table 3. Correlations between Respondent Sex and Occupational Status 


of Father 
5o: V ee 2S a E S 
SEI 70 SEI 80 Prestige 70 Prestige 80 

All cases 

Sex DATE .034** 0211 30387* 

N 22,126 15,363 22,424 15,544 
African Americans only 

Sex .024 .087** .012 .088** 

N 2,460 1,582 2,499 1,608 





DOPO 





one model (using father's 1970 occupational prestige as the measure of father's 
status), and the coefficient was negative. This finding suggests that in the earlier 
years of the survey among African Americans, an increase in father's prestige 
produced a significantly greater increase in the education of daughters than in 
the education of sons. This effect is opposite to that found for the entire sample, 
although it is consistent with other evidence of the educational attainment of 
African American women (Cohen and Nee 2000). It is also consistent with Cronk's 
(1989; 1993; 2000) findings of preferential investment in daughters among the 
Mukogodo of Kenya, another group with low socioeconomic status. However, 
this significant negative interaction was not found using any of the measures of 
father's status. This significant negative interaction was also not found in any 
of the surveys from more recent years and may be due to recent improvements 
in the upward mobility opportunities of African Americans. That is, the logic 
of Trivers-Willard suggests that when prospects for upward-status mobility are 
good, parents will invest in boys, when prospects for upward-status mobility 
are poor, parents will invest in girls. If African Americans with means are no 
longer preferentially investing in girls, it may be because possibilities for upward 
mobility have improved for African Americans in American society as a whole. 
Once again, age and number of siblings had a negative effect on number of years 
of schooling across all models, while family income had a positive effect on years 
of schooling across all models. 

In sum, both tests gave substantial support for both hypotheses 1 and 2. 
Among high-status families, boys attain more years of education than girls. 
Among low-status families, girls attain more years of education than boys. This 
was particularly true for African American families, which in the data from earlier 
years sponsored the education of girls much more effectively than that of boys. 
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TEST OF SEX RATIO HYPOTHESIS 


In the first test of the sex ratio hypothesis, I correlated respondent sex with 
father's occupational status. Results are given in Table 3. Across all measures of 
father's occupational status, there is a positive, significant correlation between 
respondent sex and occupational status of respondent father. This finding 
suggests, if indeed the General Social Surveys are probability samples of the U.S. 
population, that high-status fathers are more likely to have male offspring. Results 
for African Americans showed an interesting result in light of previous findings. 
Father's occupational status was correlated with sex of respondent, but only for 
the variable for father's occupational prestige (1980) and socioeconomic status 
(1980). This result suggests that African American males with high occupational 
status have only recently become more likely to have male offspring, perhaps 
because of relatively recent improvements in the general social status of African 
Americans. That is, in the earlier General Social Surveys, respondent fathers 
with high occupational status would probably have had lower social status (and 
income) than a non-African American with an equivalent occupation. In later 
General Social Surveys, respondent fathers with high occupational status may 
have been afforded more real status and income, hence the change in the effect 
on sex ratio of offspring. 

Table 4 gives the results of the second test of the sex ratio hypothesis. In this 
test, percentage boys (of all biological children of the respondent) is regressed on 
the intelligence, SEI, and occupational prestige of the respondent. Results show 
that intelligence, SEI, and occupational prestige are significant, positive predictors 
of percentage boys for men only, not women (see also Figures 2, 3, and 4). This 
finding suggests that higher-status males are more likely to have boys than other 
males. The findings for women are the opposite, perhaps because these measures 
of status are better measures of male status in American society than female 
status. Also, women who have taken time off from a career to have children are 
less likely to be in high-status occupations. If these women are somewhat more 
likely to be (or have been) married to higher-status men, then this would account 
for the finding of a negative relationship between female occupational status and 
percentage boys among offspring. In fact, some of these women may have lower 
occupational status because they married high-status men, given the tradition of 
wives supporting their husbands’ careers over their own.” 

In sum, these two tests give substantial support for hypothesis 3. The sex 
ratio of offspring varies by status, with higher-status fathers having more boys 
among their biological offspring than lower-status fathers. This has only recently 
occurred for African American respondents, perhaps indicative of their recently 
improved status as a group within American society. 
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Table 4. 1994 GSS Regression of Percentage Boys among Biological 
Children on Respondent's Occupational Status 








Model 
1 2 3 
Sex (12 Male) —21.315* —23.031*** —31.474*** 
(8.912) (3.907) (9.412) 
Intelligence —2.126* 
(number words correct) (.870) 
Respondent's SEI 80 —.195** 
(.079) 
Respondent's Prestige 80 —.227+ 
(.142) 
Interaction status by sex 2.391" BAr Ys gig 
(1.390) (.121) (.201) 
R .012 .014 .024 
N 685 1034 705 


*p«.05 *p<.0l ***p«.001 


Discussion and Conclusion 


Evidence is mounting that biological factors play important roles in the process 
of investment in children within the family. In this article, I have shown how a 
well-known hypothesis from evolutionary biology can help explain differential 
investment in offspring by sex. This is the Trivers-Willard hypothesis, which 
suggests that high-status parents will invest more in boys, and low-status parents 
will invest more in girls. This investment may be physiological and affect sex 
ratios or financial and affect the allocation of parental resources. The analyses 
in this article lend support to the Trivers-Willard hypothesis in both its sex ratio 
and resource allocation forms. 

The test of the resource allocation version of the Trivers-Willard hypothesis 
showed that when looking at educational attainment—arguably the greatest 
investment that American parents make in their children—results conformed 
to the predictions of Trivers-Willard.* There is an interaction between father’s 
status and child’s sex such that on average the educational benefit of having 
a high-status father is greater for boys than for girls. This result suggests that 
high-status fathers are more likely to invest in the education of sons than in 
the education of daughters. These results were significant and consistent across 
28 years of General Social Surveys. Among African Americans, a group that 
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Figure 2. Percentage Boys among Biological Offspring, by Intelligence 
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has faced obstacles to upward status mobility in American society, this finding 
was reversed for the earlier samples. That is, in the early years of the survey, on 
average the educational benefit of having a high-prestige father was greater for 
girls than for boys. 

These results suggest that there are class biases in investment in offspring in 
the contemporary U.S., with higher-status parents investing more in boys, and 
lower-status parents investing more in girls. Given that years of education is an 
indirect measure of parental investment that does not take into account factors 
such as quality or type of education, which also vary by gender, this may just be 
the tip of the iceberg of differential parental investment in offspring. Whether 
such differential parental investment is a conscious strategy or not is unclear, and 
arguably, unimportant. Previous results based on self-report data suggest that 
parents are not conscious of differentially investing in their offspring, at least 
at young ages. The primary drawback of this study—using years of education 
attained as the measure of parental investment—is in this respect also its greatest 
strength, as years of education attained is a measure of past behavior unlikely 
to be subject to reporting biases. However, the indirect nature of this measure 
of parental investment suggests that future analyses should examine more direct 
measures of parental investment in children's higher education such as parental 
provision of tangibles like money and goods and also parental provision of 
intangibles such as moral support and encouragement for higher education. It 
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is in determining who will receive our love and support that evolutionary theory 
can be most useful to sociologists (e.g., Daly and Wilson 1998). 

Second, the analyses reported here also support the sex ratio version of the 
Trivers-Willard hypothesis. The results strongly suggest that in American society 
today, high-status males are more likely to have boys among their biological 
offspring. This finding suggests that, as the medical community has learned 
already, socially constructed factors such as social status and associated stress 
(or lack of it) can affect physiological processes. The hypothesized mechanism 
here is through the condition of the mother: the wives of high-status men may 
be assumed to be in better condition (on average) than the wives of lower-status 
men. The condition of the mother in turn affects her ability to carry a more 
fragile male fetus to term, and hence influences her probability of having a boy 
or girl. š 

For the sociology of the contemporary U.S., this article has shown how an 
evolutionary hypothesis (the Trivers-Willard hypothesis) can help both explain 
an important social process (educational attainment) and uncover social patterns 
previously undiscovered (the relationship between father's status and sex ratio of 
biological children). The interaction of sex of offspring with parental status on 
education attained was unclear from studies that looked only at the relationship 
between sex and education attained or the relationship between parental status 
and education attained. I argue that the contradictory results obtained so far are 
likely because of an interaction between sex and parental status on educational 
attainment, as well as differential sex ratios by parental status. For example, these 
findings can help explain the effect of sex composition of sibship on individual 
educational attainment. Conley (2000) finds that the presence of opposite sex 
siblings hurts the educational attainment of individuals. This may be in part 
because, in high-status families, there are proportionately more boys and also 
more investment in boys; and in low-status families, there are proportionately 
more girls and also more investment in girls. So, a boy with sisters is more likely to 
be from a lower-status family and receive less education; and a girl with brothers 
is more likely to be from a higher-status family and receive less education. 

Similarly, status attainment research in the U.S. will benefit from attention to 
how the interaction of child's sex and father's status affect the education attained 
by the child. In general, attention to how sex differences in all kinds of investment 
in children differ by socioeconomic class of family of origin is likely to be a 
fruitful avenue of research for sociologists. For example, it is likely that families 
will encourage and support boys and girls in different career and educational 
aspirations, but how they do this will depend on class background. 

It is important to note that these results are for U.S. society only and will be 
somewhat different in other societies with different educational institutions and 
means of status attainment. In U.S. society, Trivers- Willard type processes were 
revealed in years of education attained, but this is likely riot to be the case in other 
societies where it is the type of education attained, rather than years of education 
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attained, that is the more important predictor of later social status (Kerckhoff 
1995; Buchmann and Dalton 2002). Furthermore, considerations of social 
history also shape how Trivers-Willard type processes take place. For example, 
stratification by race has a history unique to the United States and is reflected in 
the pattern of results over time reported here for African Americans. 

The larger point here is that the usefulness of any evolutionary hypothesis 
depends on an understanding of specific social and cultural contexts, which, 
together with evolved predispositions, produce observed social behaviors. This 
point is often lost in the debate over the relevance of evolutionary theory to 
the social sciences and in critics’ accusation that the evolutionary theorists are 
practicing biological determinism and reductionism. Yet without a specific 
social context, almost all evolutionary hypotheses are of limited value. Humans 
are above all social beings and have evolved to thrive in a social setting (Boyer 
2001:122). It seems that given their understanding of social and cultural contexts, 
sociologists are particularly well positioned to use evolutionary hypotheses to 
help explain social processes, as others have suggested (van den Berghe 1979; 
Nielsen 1994; Whitmeyer 1997; Lopreato and Crippen 1999; Sanderson 2001; 
Kanazawa 2001b). 


Notes 


1. Given this association between status and reproductive success in the evolutionary 
environment, individuals, especially males, may have evolved predispositions to compete 
for status, even if the original evolved reason for this predisposition (greater reproductive 
success) no longer holds. Evidence suggests that contemporary males on average do continue 
to compete for status and strive for dominance in a variety of arenas (Pratto 1996). Studies 
also suggest that females prefer high-status males (Langhorne and Secord 1955; Buss and 
Barnes 1986; Buss and Schmitt 1993) and that high-status males have greater frequency of 
sexual relations and more sexual partners than lower-status males (Pérusse 1993; Kanazawa 
2003). 


2. There is some evidence that first children are more likely to be male than later-borns 
(James 1971). However, this does not account for this finding, as when number of children 
is controlled the positive effect of status on percentage boys among biological offspring 
persists. 


3, Cox (2003) found a similar result in more limited analysis of education by parental 
wealth. 
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Abstract 


This article presents the results of new data collection in Mexico about the 
relationship between child well-being and social networks. Two research questions 
guide the analysis. First, under what conditions do networks generate greater (lesser) 
support? Second, what kinds of networks are associated with healthier children? We 
explore the health status effects of several dimensions of social networks, including 
network size, kinship roles, interaction (proximity, contact, and coresidence), and 
provision of financial and emotional support. Our key findings suggest that networks 
containing more extended kin and coresident ties offer greater support resources to 
mothers with young children, especially among the poorest households. We also find 
that network structures characterized by more social support and greater interaction 
with extended, rather than immediate, kin help sustain healthier children. Together 
these findings indicate the advantages of examining specific role relationships in 
network research among economically marginalized families and attest to the 
importance of social networks founded on principles of reciprocity, confianza, and 
compadrazgo to the well-being of Mexican families. 


The idea that social integration affects health is not new. For at least a century, 
scientific research has focused on the question of how social relationships link to 
health. Contemporary studies have documented that social ties may both improve 
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and worsen health. The apparent confusion stems from research practices that 
are insensitive to the sociocultural contexts in which networks are analyzed, and, 
until recently, from a tendency in health research to conflate social networks and 
social support. Although these latter concepts are distinct , they have been treated 
as unidimensional in prior studies (Glass et al. 1997; Seeman 1996). 

House, Umberson, and Landis (1988) outlined several conceptual directions 
that would improve understanding of the structures and processes through which 
social relationships affect health. Among these were the existence and quantity of 
social relationships and the supportive resources derived from networks. Since 
then, studies have paid particular attention to the social support that networks 
provide because it is one way in which resources are transferred (see Hurlbert, 
Haines, and Beggs 2000; Wellman and Gulia 1999; Wellman and Wortley 1990).! 
Wellman and Wortley's (1990) landmark study of community ties and social 
support in Toronto is an important example. The study showed that networks 
in East York, Toronto, consisted both of immediate kin, whose ties are dense and 
broadly supportive, and of friends, neighbors, and others, whose ties are loosely 
knit and offer specialized support. Together kin and nonkin offered intimacy and 
social support resources to community members, helping them combat physical 
and emotional stressors encountered in everyday life. 

Our analysis extends these and other findings on social networks, social 
support, and health from research in sociology and epidemiology. Building on 
extant research about how social relationships influence elder health and longevity, 
we seek to broaden our understanding of how household social relationships 
influence child health. We argue that the sociocultural context in which networks 
are analyzed is key to understanding which ties and network attributes are more/ 
less helpful in promoting child health. To demonstrate this, we use new data 
gathered in Mexico about child health and social networks. 

Mexico is a nation that has been marked by sustained economic recession and 
growing socioeconomic inequality since 1970. To the present day, its residents 
have few public welfare benefits, and many citizens are unable to purchase high- 
quality healthcare. In addition, they are often exposed to health risks associated 
with inadequate nutrition, hygiene, and protection from environmental exposure. 
As a result, persistent health disparities continue to separate the wealthy from the 
poor (Bobadilla et al. 1993). In addition, as we discuss below, Mexico is known 
to have particular social arrangements, including a system of godparenting, that 
have ensured the survival of many families. 

The analyses presented here examine the types of social networks that are 
conducive to social support resources for mothers. We then examine the effects 
of network structure and social support on child health. Although networks 
may provide security during times of crisis or emergency, the cumulative impact 
of more routine, day-to-day supportive exchanges also may have lasting effects 
on child health. Because research suggests that such networks are particularly 
meaningful for very poor families in Mexico (Logan 1981; Lomnitz 1977), 
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our analyses emphasize how child health and support relationships play out 
for economically marginalized households in our sample. We begin below by 
reviewing existing research about-how social networks and support influence 
health, and about the social network arrangements that link Mexican families. 
We follow with a discussion of our hypotheses. 


Prior Research 


In theory, social relationships are linked to better health in several ways. They may 
provide emotional benefits, which include intimacy, a sense of belonging, and 
self-esteem. They may also offer instrumental and material benefits by providing 
tangible assistance, such as money, services, goods, guidance, and advice (Kaplan, 
Cassell, and Gore 1977; Walker, MacBride, and Vachon 1977; Weiss 1969; Wellman 
and Wortley 1990). 

One of the earliest studies on social networks and health was Durkheim’s 
(1951) work on the association between social isolation and suicide. Since then, 
many studies have documented the health-enhancing effects of social integration. 
Social support and social engagement reduce mortality risks and disability (e.g., 
Forster and Stoller 1992; Kawachi et al. 1996; Litwin 1998; Sugisawa, Liang, 
and Liu 1994), improve disease recovery rates (Berkman 1995), and promote 
cognitive development and function (Berkman 2000; Fratiglioni et al. 2000). 
Biological and personality endowments may also affect social relationships, but 
social relationships have positive effects on health that are independent of these 
endowments (House, Landis, and Umberson 1988:544). Although most studies 
focus on the healthful effects of social relationships, networks may also contain 
relationships that negatively affect mental and physical health (Seeman 1996; 
Wellman 1981). 

Interest in exactly how social networks and social support affect Mexican 
health has gained considerable attention in recent years.’ Strong social ties are 
thought to help promote the health of Mexicans in the United States, who fare 
better than other groups with similar characteristics. However, most evidence on 
this point has been speculative. For example, a study on the health of immigrant 
children by the National Research Council reported that “healthful behaviors may 
be reinforced by strong family bonds among immigrant groups and communities 
that sustain cultural orientations that lead to healthful behavior” (1998:63). 
Along these lines, many studies have pointed to cultural factors (such as strong 
ties among Mexican families) that may diminish the deleterious effects of poor 
socioeconomic status on health (Cobas et al. 1996; Collins and Shay 1994; 
Guendelman 1995; Guendelman and Abrams 1995; Guendelman, Gould, and 
Hudes 1990; Scribner 1996; Scribner and Dwyer 1989; Zambrana et al. 1997). 
Other research suggests that while cultural traits may be part of the story, selective 
migration also contributes to the better birth outcomes of recently arrived Puerto 
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Ricans (Landale, Oropesa, and Gorman 2000) and to the foreign born mortality 
advantage among Mexicans (Palloni and Arias 2004). 


Social Arrangements in Mexico: Reciprocal Exchange, Confianza, 
and Compadrazgo 


To date, we have found no studies that examine how social support and 
networks affect child health in Mexico. We argue, however, that it is important 
to understand how these relationships involve and affect children, because many 
of the presumed benefits of social networks directly relate to child and family 
well-being. For example, social support may benefit children directly by providing 
additional child care, parental support, and financial assistance, and indirectly, 
by offering children greater access to less tangible benefits such as emotional 
support and intimacy that derive from ties with adult family or friends (see, e.g. 
Stanton-Salazar 2001). 

We argue that social support from strong social ties with kin and others 
protects child health in Mexico. Despite sustained economic recessions, growing 
socioeconomic inequality, and increasing outmigration in contemporary Mexico,’ 
particular features of social arrangements help to ensure the well-being of 
many families. Below we describe three salient attributes of these arrangements: 
reciprocal exchange, confianza, and compadrazgo. 

Reciprocal exchange refers to a transfer of commodities or services between 
groups or individuals. Transfers occur in both directions: participants may both 
provide and receive. Although an exchange often begins between persons with 
few resources, once it occurs it forms an important resource that in itself may 
be used whenever needed. Using this type of exchange, households may organize 
the provision of basic needs by cultivating emotional and economic reciprocity 
and alliances among friends, neighbors, and kin (Kana'iaupuni 2000:20). Social 
networks are thus vital to exchange relationships because families use them to 
build security (Lomnitz 1977). 

Confianza is vital to the system of reciprocal exchange. More than its literal 
meaning of confidence or trust, confianza occurs before any exchange takes place 
and forms a building block for social cohesion. It is rooted in a *mutual desire 
and disposition to initiate or maintain a relationship of reciprocal exchange" 
(Lomnitz 1977:134), and it is often described as both an honor and an obligation. 
Kin or not, persons de confianza are responsible for helping others in many 
different ways. 

Compadrazgo is different than, but related to, confianza. It is the institution of 
godparenting that names persons as protectors of newborns and others passing 
through an important life event, such as baptism, fifteenth birthday, or marriage. 
Although friendship may be the basis for confianza, only when friendships are 
close and intimate do they become institutionalized by compadrazgo (O'Connor 
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1989). With the designation of comadre or compadre at an important life event 
comes a set of responsibilities for raising children. As a result, comadres/ 
compadres form close bonds between themselves and family members. On 
the whole, the system of compadrazgo provides families with security during 
periods of economic crisis, and it ensures exchange of favors and obligations 
throughout the child's life (Dinerman 1982; Griswold del Castillo 1984; Lomnitz 
1977; Williams 1990). 

Together, these three features of social network arrangements are important to 
understanding the social context of well-being in contemporary Mexican society. 
Ideally, Mexican families comprise immediate and extended relatives and friends 
bound together by mutual respect and reciprocity.* Other economic realities serve 
to strengthen these bonds further. In the late 1970s, Lomnitz (1977) spelled out a 
theory of survival for very poor households in urban Mexico. The daily existence 
of these families, she argued, relied on networks of close family and friends. Since 
then, others have described the practical value of these network arrangements 
when families have been confronted by rapid urbanization, on the one hand, 
and economic crises on the other (González de la Rocha 1994:17). For example, 
Kana'iaupuni (2000) reveals the importance of being able to call regularly upon 
compadres, who may procure free medical services or medicine when children 
become ill. These extended family ties offer support through meals, small loans, 
or transportation in exchange for nonmaterial goods such as household labor 
or elder care. In these and other ways, networks help maintain both economic 
and political stability in poor urban and rural households (Eckstein 1977; 
Kana'iaupuni 1995; Logan 1981; Lomnitz 1977). / 

Although particular Mexican social arrangements such as godparenting may 
help families secure the resources they need, limited knowledge exists about 
how network relationships operate. Keefe (1984) shows that many, but not all, 
Mexican-American families in her sample participated in large kin networks 
and engaged in high rates of visiting and exchanges. Some have suggested that 
family networks are more likely to provide emotional support than instrumental 
or material assistance (Golding and Burnam 1990; Griffith and Villavicencio 
1985; Vega 1990). Others have found that related mothers, especially those living 
nearby, mutually promote salubrious behavior during pregnancy, such as healthy 
prenatal diet and good nutrition, by providing instrumental assistance, emotional 
support, and a legitimate source of knowledge and guidance (De Anda 1984; De 
Anda and Becerra 1984; Sherraden and Barrera 1996a). Work by Sherraden and 
Barrera (199b) also indicates that family support enhances pregnancy outcomes 
among second-generation Mexican-Americans. And compared to less-supportive 
families, more-supportive families maintain greater ties to Mexico, traveling more 
often across the border and integrating more Mexico-resident relatives into their 
active social network. 

In contrast, some scholars adopt a different perspective based on immigration 
research suggesting that social networks may not always benefit health and well- 
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being (Portes and Rumbaut 2001). One study of immigrants' networks found that 
not all ties are supportive and that sometimes they create more stress (Menjivar 
2000). Another study documented the social isolation of many recently arrived 
immigrant families (Stanton-Salazar 2001). Relatively little is know about when, 
how, and which ties are more or less supportive, and how the benefits of social 
ties and receipt of support shift with more time spent in the United States. 
Overall, these findings suggest several ways that friendship and kin networks 
may affect health. However, because they rely primarily on data gathered from 
Mexicans in U.S. destinations, they provide a fairly limited understanding of 
Mexican networks. In this article, we step back and examine the unique features 
of social relationships in Mexico and their effects on child health, to answer some 
key questions about how social networks may support families. By doing so, we 
gain insights into theories about the survival strategies of poor families in Mexico 
and also provide an empirical foundation for future studies on the effects of social 
networks on Mexican health across both sides of the Mexico-U.S. border.’ 


Hypotheses, Data, and Methods 


HYPOTHESES 


From prior studies described above, we developed a set of hypotheses about the 
effects of networks on social support and child well-being in Mexican households. 
We chose to examine the personal (or ego-centered) networks of focal individuals 
(in this case, mothers of young children). Figure 1 displays the conceptual model 
that links network structure and interaction to well-being. To the far right are 
the outcomes of interest, including child health status at the time of the survey 
and the receipt of emotional and financial support. Influencing health status and 
social support are the key dimensions of Mexican family networks, including kin 
relationship and interaction, which we discuss in detail below. 

We argue that knowing family relationships is important to understanding 
network effects on health and social support. Although few have studied kin 
relationships and health outcomes, research consistently suggests that kin generally 
are an important source of social support (Glass et al. 1997; Hurlbert, Haines, and 
Beggs 2000). Wellman and Wortley (1990) found that the parent/child relationship 
was the most likely to offer certain forms of support, and Sherraden and 
Barrera (1996a; 1996b) and others have emphasized the importance of maternal 
support to Mexican families. On the basis of this literature, we hypothesized that 
immediate kin are helpful to the health and well-being of families—although 
larger immediate families could also mean less support per child due to scarce 
resources. Thuswe predict the following. 


Hypothesis 1a: Networks containing immediate kin relationships (parents 
and siblings) are more likely to be supportive relative to networks with 
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Figure 1. Diagram Relating Network Structure to Social Support and 
Child Health 


few or no immediate kin. 


Hypothesis 1b: Networks containing immediate kin relationships (parents 
and siblings) are associated with good child health status compared to 
networks containing few or no immediate kin. 


We further argue that in Mexico, extended kin relationships (including 
compadres who may be fictive kin) improve child health status and raise the 
likelihood of social support. Extended kin ties are particularly valuable for 
women and children. They are both symbolically meaningful and hold practical 
value by providing access to resources over and above that of immediate family 
relatives. On the whole, we expect that extended kin bring greater heterogeneity 
and resources to the networks within which individuals are embedded, and that 
the absence of extended kin may leave households bereft of supportive resources 
and assistance. Consequently, we predict that: 


Hypothesis 2a: Networks containing extended kin relationships are more 
likely to be supportive relative to networks with few or no extended 
kin. 


Hypothesis 2b: Networks containing extended kin relationships are 
associated with good child health status relative to networks containing 
few or no extended kin. 


The next set of hypotheses captures social interaction between network 
members. Based on anthropological studies in Mexico (e.g., Lomnitz 1977) and 
network studies (e.g., Wellman and Wortley 1990), we operationalize interaction 
as geographic proximity, frequency of contact, and coresidence with members of 
the network. Residential proximity is a network characteristic that affects day- 
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to-day access and opportunities for interaction among members (Seeman and 
Berkman 1988; Wellman and Wortley 1990). We predict that: 


Hypothesis 3a: Greater geographic proximity in the networks of 
respondents will raise levels of social support from ties in comparison to 
geographically dispersed networks. 


Hypothesis 3b: Greater geographic proximity in the networks of respondents 
will enhance well-being compared to geographically disperse networks. 


In addition, frequency of contact between ties encourages "the provision of 
support by fostering shared values, increasing mutual awareness of needs and 
resources, mitigating feelings of loneliness, encouraging reciprocal rounds of 
support, and facilitating the delivery of aid" (Wellman and Wortley 1990:568; see 
also Marsden and Campbell 1984; Seeman and Berkman 1988). More contact will 
promote greater emotional and financial support than more estranged ties, and 
for similar reasons should enhance child well-being. Thus we predict that: 


Hypothesis 4a: The more frequent are contacts with ties within the network 
structure, the greater the likelihood of receiving support from networks, 
large or small. 


Hypothesis 4b: The more frequent are contacts with ties within the network 
structure, the better child health is. 


We treat coresidence separately because, by default, it incurs the most 
interaction and proximity. From obligation and/or preference, coresidents may 
offer various forms of assistance that include child care, housekeeping, income, 
and various forms of emotional caregiving. We therefore expect that: 


Hypothesis 5a: Coresidence with network members increases the likelihood 
of receiving support from networks. 


Hypothesis 5b: Coresidence with network members is associated with 
good child health. 


Two of our final hypotheses indicate that networks and social support are 
especially critical to the well-being of children in economically marginalized 
households. Although we expect that poorer families have less access to network 
support and good health, we also expect that resources from extended kin are 
more critical to children's health in these families, relative to nonpoor families. 
Extended kin protect poor families from the effects of economic insecurities in a 
nation that offers few, if any, institutionalized public programs to support families 
during periods of economic and personal crisis. Therefore we predict that: 


Hypothesis 6a: Poorer households will receive less support from their 
networks overall, but greater access to extended kin ties increases the 
chances of receiving support, compared to wealthier households. 


Hypothesis 6b: Poorer households will be worse off in terms of child 
health status overall, but greater access to extended kin ties increases the 
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chances of good child health status in the poorest households, compared 
to other households. 


Following these arguments, our last hypothesis is axiomatic: 


Hypothesis 7: Networks containing greater social support also sustain 
healthier children. 


Measures and Methods 


DATA 


To test these hypotheses, we used data from the Health and Migration Survey 
(HMS) project, a longitudinal data collection and analysis project that examines 
the health consequences of Mexico-U.S. migration (see www.mexmah.com). 
We selected ten villages in the state of San Luis Potosí, Mexico, to represent 
various types of climatic conditions, population compositions, and economic 
productions. In each village, we conducted interviews with 200 randomly selected 
households from a completed census that we took of all households in the village. 
In smaller communities, we did a complete household enumeration. For the 
analysis reported here, we used data from a first wave of representative samples 
from households in eight communities in San Luis Potosí (roughly 1,000 total 
households). 

The key respondent to the HMS was the sefiora of the household, who 
was either the wife of a male household head or (in a few households, about 
296) the head herself. Respondents were asked about their marriage, work, and 
fertility histories as well as household migration, labor, and asset acquisitions. 
The survey also contains data on adult and child health, including histories 
about live births that occurred in the six years prior to the survey date, infant 
mortality, immunizations, breastfeeding, health service use and beliefs, and 
prenatal health. 


DEPENDENT VARIABLE 


The first dependent variable, child health status, is based on information collected 
from women who had experienced a live birth in the six years prior to the survey. 
For 620 children, health status was assessed on a five-point scale ranging from 
very poor to excellent, on the basis of mother-reported overall health status at 
the time of the survey. Although self-reported health may be subject to reporting 
errors, studies have found it a surprisingly accurate predictor of subsequent 
morbidity, mortality (Idler and Benyamini 1997; Mare and Palloni 1988), and 
use of medical services (Ferraro and Farmer 1999). 

Our second dependent variable comes from women respondents' reports 
of emotional and/or financial support that they received from each tie in the 
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twelve months prior to the survey. Preliminary tests revealed no differences from 
measuring support with a simple dichotomous variable versus a count of ties that 
provided support for the 373 mothers with young children in this analysis. For 
simplicity's sake, we collapsed the information into emotional support, coded as 1 
if a woman had received advice or counsel from her network in the last year and 
as 0 otherwise. Financial support similarly was collapsed into a single measure 
(1 if received, 0 if not received). 


KEY SOCIAL NETWORK MEASURES 


Our survey was designed to assess the characteristics of respondents’ social 
networks in the 12 months immediately prior to the survey. Network attributes 
include size (number of immediate and extended kin ties), social interaction 
(spatial proximity, frequency of contact, and coresidence), and provision of 
emotional or financial support by kin and nonkin members, such as parents, 
siblings, in-laws, friends, and relatives. These measures originated from a set of 
closed-ended questions about existing relations, interactions with, whereabouts 
of, and support from, specific family members (such as parents, siblings, cousins, 
uncles, and aunts) and other kin (such as parents/siblings-in-law and compadres). 
These questions were followed by open-ended questions prompting for other 
relatives or friends not already mentioned (questions are available on request 
from the authors). With these data we were able to assess existing ties to relatives 
and other network members as well as support transfers involving the respondent 
in the year prior to the survey. 

We measured the dimensions of networks as follows. Kin relationship was 
based on relation to the woman respondent. Although these relationships include 
kin of fathers, we defined the relationships from the perspective of mothers 
in households. That is, ties were categorized into either immediate relations, 
which included siblings and parents, or extended relations, which included 
the respondent's other relatives, compadres, and friends.’ To capture network 
interaction we measured proximity to each noncoresident tie by whether the 
tie resided in the same village (=1), another Mexican village (=2), or across 
the U.S. border (=3).° In addition, we created a categorical variable based on 
the average proximity to the entire network: 0 = predominantly local network 
structures (proximity index less than 1.5), and 1 = more dispersed networks 
with higher scores. 

We operationalized frequency of contact by the number of face-to-face 
visits. Frequent contacts included members visited on a weekly basis, whereas 
more estranged ties were those visited less frequently. From this information, we 
created an intensity of frequent contact scale by summing the total number of 
face-to-face contacts per week in the network.’ Coresidence, the final measure of 
network interaction, was captured by a dummy variable indicating shared living 
arrangements with any network member other than spouse or children. 
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OTHER INDEPENDENT VARIABLES 


We examined these outcomes controlling for attributes of children, mothers, 
and households. Child health was regressed on individual attributes including 
age, sex, birth order, and whether the child was breastfed. Controls for mother's 
characteristics included age, parity, education, and health status." Other 
household attributes were prior infant deaths, the number of children under 
seven in the household, and economic marginality. The latter was created from 
measures of household wealth. Each of the following assets was scored and 
summed to create an index ranging from 0 to 5: ownership of home, property 
other than the home, vehicle(s), business(es), and/or five or more hectares of 
land (a little over 12 acres). According to this index, households were coded 1 
(poor) if they had none or only one of these assets, and 0 (nonpoor) otherwise. 
Sixty-five percent of children lived in households with an asset index of less 
than 1, indicating that most households owned relatively few assets; more than 
one-third held greater assets. 


METHODS 


Logistic regression was used to evaluate the social support outcomes, whereas 
ordered probit estimates were used to predict general health status. Ordered 
probit methods allow for the rank-ordering of dependent variables without 
making assumptions about the intervals between categories. A positive coefficient 
indicates that an increase in the independent variable leads to a greater likelihood 
of having better health. A constant term is provided for each category of the 
dependent variable.” 

Our analyses of child health status added measures of social support, net of 
other network characteristics. Specifically, child health status was regressed on 
indicators representing households that received no support, emotional only, 
financial only, or both emotional and financial support in the year prior to the 
survey. Many households received either emotional (22%) or financial support 
(6%), whereas about 44% received both, and 27% received none at all. Finally, 
for each set of analyses, we also examined the hypothesis that economically 
marginalized households rely more heavily on social networks and support 
than do wealthier households. Where appropriate, we looked at which network 
characteristics (size, relationship, interaction) encouraged more or less support 
and better or worse child health by poverty status of households. 


A CAUTIONARY NOTE 


The direction of causality and, specifically, the possibility that social networks and 
health have reciprocal effects, complicated our analysis. Most prior research has 
looked at social networks and health in the manner hypothesized in the preceding 
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section, where strong supportive networks improve the health of individuals and 
communities. But poor health may conceivably alter social network transfers, 
especially in cases involving chronic or repeated health problems. As social capital 
resources are exhausted and power balances within relationships shift over time 
(Berkman 1984), individuals and households with extensive health problems may 
have selectively fewer ties and/or less support from existing ties." Although such 
tradeoffs are possible, they are less problematic in our analysis because we focus 
on the health of young children, who are less likely than adults to suffer from 
chronic disease. However, because our analysis examined health in households 
that received support compared to other households (including some that needed 
support but did not receive it, and others that did not need it and did not receive 
it), we may have underestimated the real impact of social support on health. We 
introduced some measure for this possibility by including mothers' preexisting 
poor health status in the model. 

Another challenge that affects most health research is that a household's 
social network resources may be linked to socioeconomic status. That is, poorer 
households may rely on networks that have fewer resources, whereas wealthier 
households may have more resource-rich networks. In relation to these supply 
and demand issues, it is possible that kin may be the needy ones and “use up” 
available resources for their health needs rather than providing them. In the 
present analysis, we were unable to address the resources of individual ties or 
the specific health needs of families. However, we were able to account for social 
class differences with a measure of household socioeconomic status based on 
accumulated property and business assets in the year prior to the survey.'* Separate 
analyses were conducted where appropriate to assess health and support among 
economically marginalized households (those with one or no assets), compared to 
others. In tables and figures shown here, we have labeled those most economically 
marginalized as poor, and others as nonpoor. 


Findings 


Table 1 shows that approximately two-thirds of mothers received emotional 
support from their networks. Roughly half received monetary assistance in the 
year prior to the survey. Although a little more than one-quarter received no 
support at all, 44% of households received support of both types (not shown 
in Table 1). Poorer mothers received significantly less support than others. Of 
the infants and children covered in this analysis, most were reported to have 
good or fair health. Relatively few were in poor or excellent health, but excellent 
health was statistically more likely among wealthier households, and no children 
reportedly had very poor health. 

Average network size for the women in our sample was nine individuals— 
approximately three immediate and six extended kin. The proximity index (the 
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Table 1. Descriptive Statistics, Health and Migration Survey, 1996 








Mean/Percentage Poor Nonpoor 
Dependent variables 
Social support (N = 373 mothers) 
Emotional support received 68.9 63.5 76.8** 
Financial support received 51.5 44.1 62.20€" 
Child health status (N — 620 children) 
Excellent 27 2.0 4.0* 
Good 46.3 46.9 45.3 
Fair 47.9 48.4 47.1 
Poor 3.1 2.8 4.0 
Very poor .0 .0 .0 
Social network measures 
Total network size (number) 9.0 9.0 9.0 
Network size by kin relationship 
Immediate kin size (number) 3.5 3.4 3.5 
Extended kin size (number) 5.6 5.6 5.6 
Interaction 
Spatial proximity (index) 5 5 5 
Frequency of contact (index) 21:2 2225 19.6* 
Coresident ties (yes = 1) 8.8 7-2 TESS 
Other independent variables 
Attributes of children 
Breastfed 77.8 78.0 77.4 
Aged 0-1 13.1 14.1 11.2 
Aged 2-3 yrs 24.8 25.7 23:3 
Aged 4+ yrs 62.1 60.2 05:3% 
Sex (male = 1) 49.2 48.7 50.7 
First birth 18.4 20.1 15:955 
Low birth order 34.8 9919 37.7 
High birth order 46.8 46.6 47.1 
Attributes of mothers — - 
Age 29.5 28.5 31.0** 
Education (years) 6.5 5.9 7.5705 
Poor health (yes = 1) 5.6 5.9 5:3 
Household characteristics 
Prior infant deaths (no.) 2l B Ep 
Young children at home 1.8 Lo Loe 
Poor 65.2 


* p<.05 (using x test of significant difference from poor) ** p<.01 
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respondent's average distance from the entire network) was 1.5, indicating that 
most individuals were embedded in relatively proximate network structures. 
Mothers reported an average of 21 weekly face-to-face visits with their networks, 
excluding spouse and children, and approximately 996 of the sample coresided 
with one or more network members. 

Table 1 also shows that most children in the sample were aged between 4 and 
7 years and were divided fairly evenly between boys and girls. Nearly half were 
high-parity births, and the average child had two siblings under 7 years old and a 
mother under age 30 with 6 years of educational attainment. Between 596 and 696 
of mothers reported poor or very poor health status. The number of prior infant 
deaths ranged from 0 to 3, and 1296 of mothers reported a prior infant death. 
Approximately 65% of households faced great economic marginality. Compared 
to the nonpoor, these households tended to be younger and living in somewhat 
riskier health conditions, measured by lower educational attainment of mothers, 
less access to coresident ties, less social support, and more prior infant deaths. 

Table 2 presents the distribution of health and support outcomes by key 
network characteristics. (For ease of presentation, health is presented as a 
dichotomous outcome in Tables 2 and 3, where good includes good to excellent 
and poor includes very poor to fair.) The results suggest better health and 
support from larger networks. Three-quarters of mothers with large networks 
received emotional support (versus 55% of mothers with smaller networks) and 
56% received financial support (versus 41% of mothers with smaller networks). 
Coresidence also mattered. More emotional and financial support was received 
by mothers with coresident ties. 

Both support and network size positively influenced health, as expected. 
More children had good health when their mothers reported larger than average 
networks compared to smaller than average networks (52% versus 43%). Although 
results were not significant in the multivariate analysis, having more local network 
ties also positively influenced child health. Lastly, we see that children were more 
likely to be in poor health if their mothers reportedly received no support. But of 
children whose mothers reported receiving both financial and emotional support, 
56% were in good health and 44% in poor health. 

Figures 2 and 3 provide additional evidence of health-related effects for 
economically marginalized households, in comparison to other households. 
Figure 2 displays the relationship between child health and level of support. The 
results show significantly increasing chances of good/excellent child health status 
in households receiving greater social support (none, only emotional support, 
and financial support) for economically marginalized, or poor, households. The 
association is statistically insignificant for nonpoor households. 

Figure 3 shows statistically higher chances of financial support as the number 
of extended kin grows, as hypothesized. For nonpoor households the relationship 
between extended kin and support is insignificant. Together, these results suggest 
initial evidence that (1) extended kin ties are an important financial resource for 
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Table 2. Distribution (Percent) of Child Health Status and Social Support 
by Key Network Characteristics 


Child Health Status Mother Received Social Support 








Good Poort N Emotional? Financial’ N 
Network size 
1-8 ties 42.9 57:11. S 5512 4]4T 116 
9+ ties 52.0 48.01 422 TAX 56.0257 
Interaction 
Spatial proximity 
Predominantly local ties 53.5 46.5] 312 70.1 DS 187 
More distant ties 44.5 55950899508 67.7 49.5 186 
Frequency of contact 
No frequent contacts 50.0 50.0 50 60.6 39.4 33 
1-3 frequent contacts 44.4 95:0 87 68.9 319 106 . 
4—5 frequent contacts 47.3 52.7 146 59:9 Sr 90 
6+ frequent contacts 53.6 46.4 237 68.0 54.2 144 
Coresidence 
No coresident ties 48.9 SII 562 67.1t 49.4t 340 
Some coresident ties 50.0 50.0 58 87.9t 72:25 33 
Social support 
No support received 38.5 615 Te) 
Emotional support only 48.9 Ll 139 
Financial support only 44.4 55.6 36 


Emotional and financial support 56.2 43.81 276 


Note: “Good” includes good to excellent health status; “poor” includes very poor to fair health status. 


*y? test was used to test for differences by health status. 
by? test was used to test for differences in receiving emotional support or not. 
“x? test was used to test for differences in receiving financial support or not. 


i= L0G Ip =.05 


economically marginalized households, and (2) financial support from networks 
is important to child health in economically marginalized households. 

A first look at our multivariate models suggests support for our hypotheses 
about the importance of specific kin relationships to health and support 
outcomes. Table 3 displays coefficients from the logistic regression regressing 
receipt of support on network and other characteristics. Poor and nonpoor 
households are pooled in this model. Overall, two important findings emerge. 
First, the effects of immediate kin are both positive and negative on support. 
Column 1 shows that receipt of emotional support increases as immediate kin 
size increases. The reverse effect holds for financial support, revealing lower 
chances of receiving financial support among households with larger immediate 
kin networks. Second, extended kin size and coresidence are important to the 
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Emotional Only Financial Support 


E] Poor (p7.001) Bil Nonpoor (p7.297) 





Figure 2. Percent Children in Good/Excellent Health by Amount of Social 
Support Received—Poor and Nonpoor Households 


receipt of both types of support. In other words, emotional and financial support 
rise with each additional extended kin tie and with the coresidence of network 
members. More immediate kin, however, appear to drain financial support 
resources rather than increase them." 

Column 2 tests whether these network effects are attributable to other child, 
mother, and household characteristics. Although the positive effect of immediate 
kin size on emotional support becomes insignificant, the other effects remain 
significant. Each additional immediate kin tie results in lower likelihood of 
financial support, reducing the log odds to .66 as high (exp —41). On the other 
hand, each extended kin or coresident tie increases the chances of receiving both 
types of support, with corresponding log odds of 1.61 (exp .48) and 2.75 (exp 
1.01), respectively. 

The lower portion of the table shows effects for the controls. Whereas older 
mothers receive support less often than younger ones, more-educated mothers 
receive greater support than less-educated. Mothers in poor health, who may be 
in greater need of support, also are statistically more likely to report receiving 
financial support. Finally, as we hypothesized, greater economic marginalization 
of households reduces the chances of both emotional and financial support. 

We then examined these effects separately for poor and nonpoor households, 
presented in Table 4. Consistent with our descriptive results, we see that type 
of kinship network is important to understanding social support among 
economically marginalized households. That is, with each additional immediate 
kin, the chances of financial support decrease (the effect loses significance for 
nonpoor households because of small sample size). With each additional extended 
kin tie, however, the chances of emotional and financial support increase for 
poor households, net of other explanatory variables. With respect to other 
characteristics, very few differences exist between poor and nonpoor households. 
One exception is that mothers’ poor health significantly increases the chances of 
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Figure 3. Receipt of Financial Support by Number of Extended Kin—Poor 
and Nonpoor Households 


financial support among poor families, possibly indicating greater health needs 
among those families. 

In sum, we find strong evidence for the positive impact of extended kin ties 
and find that support from these ties is meaningful to mothers in economically 
marginalized households. Another important source of both emotional and 
financial support comes from coresidence, which substantially raises the chances 
of receiving support in comparison to respondents who do not live with friends 
or relatives. This finding is relevant for studies of living arrangements and 
household structure because it suggests that coresidence offers tangible economic 
and emotional benefits. Particularly for poor mothers, having others present at 
home is an important resource to child and family well-being. That coresidence 
increases support, whereas proximity and frequent contact do not, also suggests 
that pooling households may be one of the best ways to insure supportive ties. But 
exactly how living with others improves child well-being is not clear. Coresidence 
represents close proximity and frequent interaction, but what we do not know 
is whether the assistance that derives from it reflects an obligation to assist or a 
desire to support others in need. 

Further insights are offered by additional analyses testing for significant 
differences between poor and nonpoor households (results available upon 
request). Generally speaking, results from interaction models suggest few 
differences between poor and nonpoor households. The only important difference 
emerged for the effect of extended kin on financial support. Extended kin 
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Table 3. Logistic Regression Predicting Effects of Network Characteristics 
on Receipt of Social Support 


(1) (2) 








Emotional Financial Emotional Financial 
B SIE B S.E. B S.E. B S.E. 
Network characteristics 
Network size 
Immediate kin size 2 14 —2AT 14 .08 .16 SAFS a5 
Extended kin size 1335 .10 502 S Dye SIN .48* Ally? 
Social interaction 
Spatial proximity —.19 .26 —.31 .24 —.21 27 —.32 .26 
Frequency of contact —.01 .01 —.01 .01 —.00 00  —.01 .01 
Coresidence 1.43* .61 1:167 .45 125" .60 1.01* .45 
Mother's characteristics 
Age =07* -02 —05*  .02 
Education SN E .03 .05t .03 
Poor health .20 En 158% 59 
Number of infant deaths 38 OL BL 39 
Household characteristics 
Number of young children —.01 15 .04 .14 
Poor —.65* 27. —.83* .26 
Intercept —1.78* .66 —1.59* .66 1.03 1.25 .61 1.18 
F statistic 4.09* Sala 4.66* 4.395 
N 373) 373 369 369 


Jep <. 10” p=.05 


offer financial support only to poor households, but emotional support to all 
households. This finding suggests that extended kin offer financial support only 
to those in households that really need it.’ 

Further insights are also gained from the model predicting child health. 
Results in Table 5 show support for our hypothesis that frequency of contact 
is important to child health. Networks containing more intense face-to-face 
contact are associated with better child health status. Each additional contact per 
week increases the chances of better child health. Despite the small size of the 
coefficient, the effect is significant given the average number of contacts (21.2) 
per week, net of other explanatory variables. 

Emotional and financial support are also very important to child health. 
Children embedded in networks that provided emotional support only or 
financial support in the last year had better health status, compared to those with 
no support. The effect for financial support contributed 1.38 higher odds of better 
child health. Together these findings are consistent with the argument that child 
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Table 4. Logistic Regression Predicting Effects of Network Characteristics 
on Receipt of Social Support by Household Poverty Status 





Poor Households Nonpoor Households 
Emotional Financial Emotional Financial 
B SE. B SE B S.E. B SE 
Network characteristics 
Network size 
Immediate kin size di Ho —.44* 22 .04 .30 —.40 27 
Extended kin size 284 :13 Ai .16 27. 23 .18 «17. 
Social interaction 
Spatial proximity MD LT. Fos see RE OQ 39 
Frequency of contact — —.00 .01 =O) Or 01 .01 .00 .01 
Coresidence 1.271 .78 IST .83 1.39 1.04 7 59 
Mother’s characteristics 
Age —.06* .02 —.06* .03 —.10* .04 —05t  .03 
Education 105 105 .03 .05 oe .05 087 04 
Poor health zb .65 1.96* .87 225 ok .80 .94 
Number of infant deaths .41 .40 .26 .49 27 .78 —.02 .64 
Household characteristics 
Number of young children .02 .18 222 m 0.18 = hh 2 PAS 23 
Poor Í 
Intercept —.23 1.48 —1.71 1.58 2.52 2.37 2.34) 1:78 
F statistic —2.88* 3.50% 1.831 1.16 


N 220 220 149 149 
tp<.10 *p«.05 


health benefits from greater network interaction and hands-on support from 
people and their resources. Greater network size, specifically extended kin size, 
is implicated primarily through its positive effects on supportive resources. 

Column 3 shows that a mother's poor health affects the health of her children, 
and that larger numbers of young children at home also lower child health status. 
The final set of coefficients in column 4 test the idea presented in hypothesis 6b 
that extended kin are more important to child health in economically marginalized 
households. We introduce an interaction for economically marginalized (poor) 
households to examine the health benefits of financial support compared to other 
households. The results suggest strong health benefits from financial support for 
children in economically marginalized households, as expected. Because the effect 
of financial support attenuates after the interaction is introduced, this relationship 
drives the effects for financial support found in columns 2 and 3. Note that earlier 
tests interacting poor with other network characteristics revealed no statistically 
significant effects (not shown). 
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To sum up, our results suggest that network relations are vital to the well- 
being of households and children. We find that critical elements of the path to 
better child health are found in (1) highly interactive networks, (2) networks 
that provide emotional and financial support, and (3) coresident and extended 
kin ties, which form a key resource for both types of support. The results also 
confirm earlier work indicating that economically marginalized households more 
often rely on their extended kin ties for social support, whereas immediate kin 
networks are more likely to drain financial support resources, and that there are 
clear health benefits of financial support for children in poor households. 

This evidence suggests that in Mexico it is not necessarily frequent contact that 
determines receipt of social support, as is found in other places (e.g., Wellman 
and Gulia 1999), but the greater degree of heterogeneity of resources implied 
by larger extended kin networks. Social arrangements in Mexico that emphasize 
reciprocity, confianza, and compadrazgo promote household ties to extended 
relatives and friends, who provide health-enhancing supportive resources beyond 
those available from immediate relatives. Together, these findings are consistent 
with the idea that networks help promote better health by expanding resources 
and by providing other forms of day-to-day assistance and related benefits to 
households. 


Discussion 


The research reported here has examined multiple dimensions of social networks 
to understand their role in child health. We have begun to piece together several 
important links that define the pathways through which social networks may 
improve well-being. We conclude that Mexican family networks provide important 
resources that help sustain healthier children. Impoverished households depend 
on these networks to survive in a developing nation such as Mexico, marked by 
high levels of economic and political volatility. Thus, updating arguments and 
evidence from early anthropological work, our study finds that networks continue 
to play an important role in family survival in contemporary Mexico. 

The results suggest several nuances, too, including positive support from 
networks but also some negative implications. For example, rising numbers of 
immediate relatives reduced the chances of financial support to mothers with 
young children. This finding may suggest that the demands placed on immediate 
relatives exceed their ability to offer support. It may also reflect stressors created 
by immediate family members who themselves require time, energy, and support. 
The latter idea is consistent with prior studies showing that the presence of 
immediate kin can raise stress levels among mothers in families (see Cramer and 
McDonald 1996), which in turn can affect care of and interactions with young 
children. Because it is common for Mexican families to care for their elderly 
at home, some households may be transferring extra resources and finances to 
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Table 5. Ordered Probit Predicting Effects of Network Characteristics on 


Child Health 
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In contrast, we found that larger extended networks improve support and that 
such support benefits children in economically marginalized Mexican families. In 
the absence of these ties, mothers and children would likely suffer poorer health. 
These findings provide empirical argument for the importance of particular social 
arrangements in Mexico. Extended kin relations offer a mechanism through which 
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families can access resources outside their immediate relatives. One explanation 
is that having more extended kin ties strengthens networks structurally through 
resource diversification (i.e., because extended kin are “more prone to move 
in circles different from one’s own; Granovetter 1974:52). Thus, whereas the 
resources of immediate family relatives may more closely reflect the conditions 
of families in this study, networks that include compadres and other extended kin 
relations may offer more diverse resources. The end result is positive for social 
support and healthier children. 

By uncovering Mexican-origin social network arrangements that are associated 
with better child health and social support, our findings may also yield insights 
about how social networks function in immigrant households in the United 
States. For example, given that social interaction is an important characteristic 
linking social networks to child health, frequent contact with ties may encourage 
healthier children. Thus, poor immigrant households may be able to protect the 
health of their families and children by relying on their extended kin networks 
for social support and frequent contact. With extended U.S. stays, however, 
immigrant ties may break down, leading to smaller and less frequently visited 
networks, which may contribute to deteriorating health. This situation is likely to 
worsen across time and generations, especially for immigrant groups concentrated 
in large, poor urban neighborhoods where limited resources impinge on social 
norms of exchange and support. This process may account for recent studies of 
Salvadoran and Mexican immigrant networks which found that impoverished 
conditions can often undermine exchange and assistance between social ties 
(Menjivar 2000; Stanton-Salazar 2001). More work is needed to determine how 
the composition and health-related effects of social networks shift after Mexicans 
migrate to the United States. 

Our study makes clear that analyzing networks within their sociocultural 
context is pivotal to identifying how networks influence health and well-being. 
We attempt to understand those relationships in a Mexican context that relies 
on social values and norms of reciprocity, confianza, and compadrazgo to 
buffer families from economic and emotional insecurity. Although our findings 
are similar in spirit to case studies in other times and areas of Mexico and the 
United States, they represent just one step in understanding the complex interplay 


between social networks, social support, and health of Mexicans on both sides 
of the border. 


Notes 
1. Although good health also may be a resource, it is not examined in these studies. 


2. Much of this interest has been stimulated by the “epidemiological paradox,” which refers to 
the better health outcomes of foreign-born Mexicans compared to U.S.-born groups. Research 
based on the U.S. Hispanic Health and Nutrition Examination Survey (HHANES) and other 
data sources generally shows that Mexicans have better birth outcomes than other ethnic 
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groups who are wealthier, more educated, and better medically served (see special issue of 
JAMA 1991, v. 265, 2; Guendelman 1995, Guendelman and Abrams 1995). 


3. Many articles and books document these events and trends (see, e.g., Castafieda 1995; 
Donato, Durand, and Massey 1992; Massey et'al. 1987; Oppenheimer 1998). 


4. Note that this characterization is an ideal type and its stereotypical application in social 
science research has been heavily criticized (Roschelle 1997; Vega 1990; Williams 1990). 


5. Contemporary research on U.S. migration shows that social networks are valuable to 
Mexican immigrant families (Massey et al. 1987; Portes and Bach 1985). Although most 
studies assume that networks are supportive, some evidence suggests that supportive 
exchanges between parents and children are fewer among families separated by the Mexico- 
U.S. border than in U.S.-born households (Hogan, Eggebeen, and Clogg 1993). Elsewhere 
we have begun to address how U.S. migration mitigates the relationship between social 
networks and well-being (Kana'iaupuni and Donato 2001; Kana'iaupuni, Donato and 
Thompson-Colón 2000). 


6. The data from Mexico were complemented with data collected in two primary migrant 
destinations, Houston, Texas, and San Diego, California (not used in this analysis). 


7. Note that our estimates about immediate kin are conservative, because available data 
include no more than four persons, including both parents and up to two siblings. Follow- 
up surveys with these families will include more extensive sibling data. 


8. Although in some cases a tie living somewhere else in Mexico may actually be more 
geographically distant than a tie in the U.S., we classified U.S. ties as the most distant because 
of border-crossing and legal constraints that further separate families. 


9. Other studies have considered contact between ties to be frequent if they occurred monthly 
(Seeman and Berkman 1988). Given the tendency toward social cohesion in this population, 
we preferred a weekly measure. 


10. With a smaller sample of two most recent births, we introduced a measure of child illness 
experienced during infancy to control for prior illness experiences. Because this variable was 
highly collinear with mothers' self-reported poor health status and because the effects were 
quite similar, we opted to use mother's health status as a control as it produced a larger sample 
size. Also, marital status and work experience were dropped after analyses revealed negligible 
changes in the model fit and insignificant effects (8696 of respondents were married). 


11. About half of the households in these data had more than one child included in the 
analysis. To check for potential problems, we restricted the analysis to the most recently 
born child and found that though the significance levels were slightly altered due to reduced 
sample size, the substantive results remained unchanged. Nevertheless, all regression results 
were estimated with STATA software, and we report robust standard error estimates adjusted 
for clustered data within households. 


12. Nevertheless, longitudinal research shows that even after controlling for initial poor health, 
those who were less socially integrated had poorer health (Berkman 1995). 


13. In future analyses we will examine health needs and support over time using panel 
data. 


14. In future analyses we will examine the resources of network ties using the new panel 
data we have been collecting. 
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15. We found little evidence to suggest that the negative effect for larger immediate kin 
networks on financial support is due to collinearity. The bivariate effect for immediate 
kin on financial support was negative, and it remained consistently so in various model 
specifications not shown here. 


16. We are grateful to an anonymous reviewer for this interpretation. 
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Age and Self-Rated Health in Korea* 


HYUNJOON PARK, University of Wisconsin-Madison 





Abstract 


I examine age variation in the effects of socioeconomic status (SES) on self-rated 
health in Korea by including three alternative indicators of SES—liquid assets, 
home ownership, and real estate ownership—as well as two standard measures 
of education and household income. Furthermore, I consider the SES-health 
relationship and its variation by age for men and women separately. The empirical 
findings in general do not support the hypothesis that the SES gap in health converges 
with age. Most SES effects on self-rated health remain constant across age groups 
or even increase with age. However, there is evidence of the converging effects of 
household income and liquid assets for women. Differences in self-rated health by 
household income or liquid assets diverge and then converge as age advances. 


A large body of literature has documented a significant association between 
socioeconomic status (SES) and health across various health outcomes including 
morbidity, functional limitations, and self-rated health (see House 2002; Williams 
1990). Furthermore, to better understand the overall relationship between SES and 
health, particular attention has been paid to age variation in the socioeconomic 
gap in health. As a result, socioeconomic differences in health have been often 
found to be small during early adulthood, reach a peak in middle and early old 
ages, and then become minimal in later old age (House et al. 1990, 1994; Elo 
and Preston 1996). 

However, there are also some exceptions to this pattern of midlife divergence 
followed by convergence in later old age. Ross and Wu (1996) find no evidence 
of convergence but rather a widening gap with age in physical functioning and 
well-being between those with high and low levels of educational attainment or 
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between those with high and low incomes. They emphasize that economic, social 
psychological, and health behavior-related resources associated with educational 
attainment cumulate throughout the life course, resulting in more inequality in 
health in older ages than younger ones. 

To address this discrepancy, recent efforts have extended earlier research in 
several important ways. First, a rising critique of the standard SES measures such 
as education and income argues that those measures may not properly reflect 
socioeconomic circumstances among the elderly, and thus the declining effects of 
those SES variables at older ages may be artifacts of poor measurement (Robert 
and House 1996). Employing two alternative indicators of SES—liquid assets 
and home ownership—Robert and House (1996) find that not only do these two 
measures add explanatory power to the model, but their effects, especially on 
functional limitations, persist until very old ages even after education, income, 
and other demographic variables are taken into account. 

Another useful way of extending previous literature is to assess the extent to 
which the patterns of SES-age interactions mostly observed in Western societies 
hold in different social, economic, and cultural settings. It is generally recognized 
that the patterns of fundamental health indicators may differ significantly across 
societies. For instance, there is evidence suggesting that the social gradient of 
health and life expectancy, typically observed in Western societies, does not apply 
in Japan (Cockerham, Hattori, and Yamori 2000). This evidence indicates the 
need for more cross-national research to evaluate the possible generalization of 
findings based on Western experiences.’ 

Finally, despite a great deal of research investigating gender differences in 
health status and gender variation in the relationship between socioeconomic 
position and health (see Lane and Cibula 2000), few studies have seriously 
considered whether age variation in the relationship between SES and health 
may have different patterns for men and women. An examination of how health 
inequality by socioeconomic position differently evolves with age for men and 
women may help identify a specific age-gender group that shows the greatest 
inequality and thus can be a target population of policy intervention. 

In this article, I attempt to contribute to the literature by following the three 
lines of research described above. First, I describe socioeconomic differentials 
in health across age groups for a representative population of the Korean urban 
areas, which helps assess the extent to which previous findings based on Western 
experiences hold in other contexts. Second, using relatively large numbers of 
cases for both men and women, I examine how men and women show different 
patterns of SES-age interaction effects on health. Finally, I include three alternative 
indicators of SES as well as two traditional measures of education and household 
income: liquid financial assets, home ownership, and real estate ownership besides 
the resident home. The variable of real estate ownership is particularly relevant 
for the Korean context. As in most societies, wealth distribution in Korea is more 
unequal than income distribution, and the concentration of wealth through real 
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estate holdings (including land) is a major factor leading to wealth inequality 
(Leipziger et al. 1992). In Korea, therefore, having real estate may be associated 
with additional advantages in life chances and is likely to be more important 
than income as economic resources for ‘older people who are no longer in the 
labor force. 


Aging, Welfare Provision, and Health Inequality in Korea 


Along with rapid economic growth, the overall health status of the Korean 
population has substantially improved during the last few decades: life 
expectancy at birth for men increased from 59.0 in 1971 to 71.7 in 1999, and 
the corresponding increase for women was from 66.1 to 79.2. Life expectancy at 
age 60 also increased substantially during the same period from 12.7 to 17.5 for 
men and from 18.2 to 22.2 for women. Meanwhile, total fertility rates rapidly 
declined from 4.5 in 1971 to 1.42 in 1999 (KNSO 2003). 

The increase in life expectancy, along with the dramatic decrease in fertility, 
is driving a remarkably rapid increase in the proportion of the elderly among the 
total population. The share of the population aged 65 and older was only 3.1% 
in 1970. Three decades later in 2000, however, the proportion had increased to 
7.296, and it is projected to rise to 14.496 in 2019. In 2026, Korea is expected to 
become a ‘super-aged society’ in which those aged 65 and over occupy 20% of 
the total population (KNSO 2003). 

Despite the increase in life expectancy and the dramatic aging of the 
population, Korea has a significantly lower level of public welfare provision 
compared to Western societies. For instance, among the 29 Organisation for 
Economic Co-operation and Development (OECD) countries Korea has the 
lowest level of public expenditures on health as a percentage of GDP, spending 
only 2.4% of its GDP, while some European countries, such as Norway, 
Switzerland, and Germany, devote about 7% of their GDP to public funding 
of health programs and services. In addition, the proportion of expenditures 
on health that comes from the public sector is only 46% in Korea, which is the 
second lowest among OECD countries, just above the 45% figure in the United 
States. The corresponding figure is more than 80% in such European countries 
as Sweden, Britain, and the Czech Republic (OECD 2001). 

The Korean context of considerably low levels of public subsidies, along with 
its rapidly aging population, provides an interesting setting to examine the impact 
of SES on health and how that impact varies by age. One explanation of the 
declining SES differentials in health at older ages emphasizes that social policies 
particularly geared to older people (such as Medicare and Social Security in the 
U.S.) might help reduce SES inequalities in health at these ages by supporting 
access to health care and resources for the lower SES groups (Robert and House 
1994). This explanation suggests that we may see widening SES differentials in 


1168 / Social Forces 83:3, March 2005 


health with age in societies such as Korea where low levels of welfare provision 
increase the importance of the individual's and family's economic resources for 
the maintenance of health. 

The low level of social welfare provision means that the family is virtually 
the only institution supporting the health and quality of life of the elderly in 
Korea. As Lee and Palloni (1992) point out, less than one-fifth of the Korean 
elderly earn income through their work, while more than two-thirds rely on their 
family members for most of their living costs. The frequent transfer of economic 
resources within the family, along with little public provision of welfare to the 
elderly, is a well-known feature of the welfare system in Korea. Although an 
intergenerational transfer of resources does not always occur through coresidence, 
the emphasis on the family as a source of support affects living arrangements and 
results in the high prevalence of coresidence of the elderly with their children: 
in 1990, more than three-fifths of Korean parents aged 65 and over lived with at 
least one of their children (Park, Kim, and Kojima 1999). 


Gender, Age, and Health in Korea 


There is inconsistent evidence of gender differences in the magnitude of SES 
differentials in health. On the one hand, SES differentials in health and mortality 
are found to be smaller among women than men (Koskinen and Martelin 1994; 
Elo and Preston 1996), while other evidence suggests no significant gender 
differences in the relationship between SES and health or even stronger impacts 
of SES for women than for men (Marmot et al. 1997; Liang et al. 2000). 

These contrasting findings are attributable to many factors: for instance, it 
may be due to different measures of SES and health outcomes used in studies 
or may be related to different populations examined across different periods 
and places. An interesting source of the diverse findings, particularly relevant to 
this study, is the fact that gender difference in the effect of SES on health varies 
by age. For example, comparing the health of the British 1958 birth cohort at 
age 23 and age 33, Matthews et al. (1999) provide evidence suggesting that the 
magnitude of gender differences in health inequalities by SES varies with the life 
stage as well as with health measures used. 

However, our current knowledge of this issue is limited, with very few studies 
seriously considering interactions among age, gender, and SES on health. For 
instance, we know little about whether the convergence in health differentials at 
older ages found in earlier studies is observed for both men and women. Those 
studies, which have found the widening health gap between those with higher 
and lower SES, have not addressed the potential gender difference in this pattern 
either. 

Numerous studies have revealed significant gender inequalities in many 
dimensions of Korean society. Korean women have a distinctly low level of labor 
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force participation: only half of the women at working ages are in the labor force, 
compared to 7396 of U.S. women and 8196 of Swedish women (OECD 2000). 
Another feature that makes contemporary Korea stand apart from Western 
societies in regard to women's labor force participation is the M-shaped age profile 
(Brinton 2001). Women's labor force participation rate peaks at the age group of 
20—24 and then precipitates at the time of marriage and childbirth (25—34 ages). 
After that, some women reenter the labor market, resulting in another peak of 
labor force participation rates around ages 40—44. 

Given the low level of economic activity among Korean women, associated 
with a strong social norm emphasizing the role of women as a supporter of the 
husband's career and a caregiver for children, labor market characteristics such 
as occupation and income may not well reflect Korean women's socioeconomic 
positions. Therefore, it is expected that health differences on the basis of the work- 
related variables are particularly less profound among women than among men. 
Instead, marriage functions as an important mechanism through which women 
attain social status in Korea (Lee 2001). The inability of work-based variables to 
reflect Korean women’s social status and the importance of marriage as means for 
Korean women to attain social status suggests that we should consider the family's 
overall level of socioeconomic position beyond women's individual economic 
characteristics in order to examine socioeconomic inequalities among women. 


Data and Measures 


DATA 


The data come from the Korean Labor and Income Panel Study (KLIPS), which 
is a longitudinal survey of a representative sample of Korean households and 
individuals in the household who reside in nonrural areas. In the first year of 
1998, the KLIPS interviewed 13,317 persons in 5,000 households with a 7696 
response rate. Since then, respondents who participated in the first wave have 
been reinterviewed each year and additional people have been added each year. 

The KLIPS has collected primarily information on economic activities and 
occupational-relevant variables such as employment status, wage, and working 
hours. The data also contain information on the household's economic and 
demographic characteristics such as homeownership, household income, and 
household size. Therefore, the data are particularly useful for studies that 
need refined measures of the socioeconomic positions of individuals and their 
households. 

Reflecting a rising interest in aging and health research in Korea, a special set of 
health-related questions was added in the 2001 (fourth) wave of the KLIPS. Along 
with detailed information on the respondents' social and economic situation, the 
added health-related information provides an excellent opportunity to investigate 
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the effects of SES on health among the Korean people. Thus, the following 
analyses are based upon individuals and families who participated in the fourth 
wave of the survey. Among the total 11,051 participants, I exclude individuals 
who are less than 25 years old in 2001 and have missing information on the self- 
reported health measure and other variables used as covariates in the analysis. 
After exclusions, the final sample size was 4,479 females and 4,251 males. 


HEALTH AND SES MEASURES 


I use one indicator of health as the dependent variable: self-rated health status. 
Respondents' subjective assessments of their overall health are coded very poor 
(1), poor (2), fair (3), good (4), and very good (5). Self-rated health represents 
overall physical well-being rather than simply the absence of disease (Ross and 
Bird 1994). As an independent predictor of mortality, self-assessed health is 
actually a stronger predictor of mortality than physician-assessed health (Idler 
and Benyamini 1997). 

I include five indicators of SES to determine both gross and net effects of each 
measure on self-rated health status: education, household income, liquid assets, 
home ownership, and real estate ownership. All SES variables, with the exception 
of education, are obtained from the household data, while education is obtained 
from the individual data. I use the panel feature of the KLIPS by taking the SES 
measures and other control variables from the second wave (1999). Note that 
liquid asset measures are not available in the first wave.* 

Education is coded in three categories: less than high school, high school 
completion, and some post-secondary or more (tertiary). Household income is 
the annual income from all household members and is coded in three categories: 
first and second quartiles combined, third quartile, and fourth quartile. Liquid 
financial assets consist of various components (e.g., bank accounts, stocks, saving 
insurance, or money lent to others) from all household members. A dichotomous 
variable is used to distinguish between those falling on or below the median and 
those falling above the median. I experimented with various categorizations of 
household income and liquid assets and found that more detailed classifications 
added little explanatory power. Home ownership is represented as a dummy 
variable indicating whether a household owns its home. Finally, the KLIPS asked 
respondents whether a household owned any real estate except for the home of 
residence and the real total amount of this real estate. Given that three-fourths of 
the respondents do not have any real estate, a dichotomous variable is created to 
separate those whose households have any kinds of real estate from those whose 
households do not. 
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DEMOGRAPHIC VARIABLES 


I estimated the models using various ways of classifying age. Treating age as a 
continuous variable, however, results in a better fit and describes the data more 
parsimoniously. In this study, age is included as a continuous variable. Gender is 
coded 0 for males and 1 for females. Marital status is also coded dichotomously 
(1 = married, 0 = others). A continuous variable of the number of household 
members is included in the model to control for the size of household. The 
appendix table provides descriptive statistics for all variables used in the analysis 
by gender.° 


Results 


THE EFFECTS OF THE. FIVE SES MEASURES 


I start with a brief description of the distribution of the self-rated health measure 
as presented in the appendix. As mentioned earlier, self-rated health is measured 
with five points: the higher the score, the better the respondent's self-assessed 
health. A group difference test for the mean of the self-rated health measure 
reveals statistically significant gender difference: Korean men report better health 
status than women (men = 3.43, women = 3.14, p=.000). The appendix also 
presents the percentage distribution of respondents into each scale of self-rated 
health. The finding is consistent with the group difference in the mean of the 
self-rated health measure. The proportion of those who reported their health 
status as being good or very good is generally higher among men than among 
women, while a smaller percentage of men reported poor or very poor health 
status than did women. 

Before examining how the effects of socioeconomic status vary by age, I look 
at the gross and net effects of five SES indicators on self-rated health. In Table 1, 
the column of “gross effect” indicates the results of ordinary least squares (OLS) 
regression relating self-rated health to each of the five SES measures separately, 
controlling for demographic variables.’ The coefficients of the demographic 
variables are those derived from the model that has only demographic factors 
as explanatory variables.’ The table shows that each of the five SES indicators 
is significantly associated with self-rated health for both men and women. The 
direction of the effects of the two traditional SES measures is consistent with 
general expectations: those with higher education or higher household income 
report better health status than those with lower levels of these variables. The 
three alternative indicators of SES move in the same direction: those whose 
households have greater liquid assets and own a house or real estate are better 
off in self-rated health than those whose households do not. 

The column of “net effect” presents the results from a multiple regression 
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predicting self-rated health by all demographic and SES variables included 
simultaneously. Among men, education, household income, and liquid assets 
are significantly related to self-rated health, while the net effects of home and 
real estate ownership are not statistically significant. The similar patterns of the 
net SES effects are observed among women. The three SES indicators found to 
have significant impacts on self-rated health among men influence women's 
self-rated health as well. Those individuals with higher levels of education, with 
higher household income, and with more liquid assets report better health 
status than do their counterparts. The effects of home ownership and real estate 
ownership are only marginally significant among women (p<.10), once the other 
SES measures are taken into account. The column of standardized coefficients 
shows that the two standard SES measures of education and household income 
in general have more predictive power on self-rated health than do the three 
alternative indicators of SES, though the liquid asset variable shows a relatively 
substantial impact among men. 

Interesting is gender difference in the functional form of the age effect. The 
negative coefficient of the age variable means that older people report overall 
poorer health compared to younger people for both men and women. However, 
the coefficient of the variable of age squared, which captures nonlinear changes 
in the age effect, has different signs for men and women. Health decreases at an 
accelerating rate for men, as the negative coefficient of age squared indicates. The 
positive coefficient of age squared for women, however, indicates that self-rated 
health decreases at a decelerating rate as age advances. 


AGE VARIATION IN THE EFFECTS OF SES 


To examine how the effects of each SES indicator vary by age, I test whether 
models with interaction terms between age and each SES measure significantly 
improve upon the full additive model presented in Table 1. For example, when I 
examine the interaction between age and education, the following three models 
are compared; 

(1) Self-rated health = b, + b,age + b,age* + b marital + b numhh + b.ed 

+ b,hhinc + b,asset + b,home + b,estate 


(2) Self-rated health = b, + b age + b age? + b marital + b numhh + b,ed 


+ b hhinc + b.asset + b home + b.estate + b (ed x age) 


(3) Self-rated health = b, + b age + b age? + b marital + b,numhh + b.ed 

* b,hhinc + b,asset b home + b_estate + b (ed x age) + b,(ed x age’) 

The first model is the basic model used to obtain the net effects of the 
explanatory variables in Table 1, including demographic variables and all five SES 
indicators simultaneously. The second model adds the interaction between age 
and education to the first model. If the interaction coefficient (b,) between age 
and education turns out to be significantly positive, it indicates that the positive 
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effect of education on self-rated health increases with age. On the other hand, 
a negative coefficient indicates that differentials in health by education decrease 
with age. If the interaction appears to be nonsignificant, it is concluded that the 
education-based gap in self-rated health remains constant across age groups. 

Finally, the last model has an additional interaction term between education 
and age squared. If the coefficient (b,) of interaction between age and education 
is significantly positive and the coefficient of interaction (b,) between education 
and age squared is significantly negative, it indicates that educational differentials 
in self-rated health diverge and then converge with age. Thus, the hypothesis 
predicting convergence in socioeconomic gap in health at older ages can 
be specifically tested. The three models are estimated for each SES measure 
separately. 

Model comparisons (not shown in this article) based upon F-tests show 
that any of the interaction terms between age and education, age and home 
ownership, or age and real estate ownership are not statistically significant for 
men.’ Interactions between each of these three SES measures and age squared 
are not significant for men either. This finding indicates that the effects of 
education, home ownership, and real estate ownership do not vary significantly 
by age among men. 

There is a different pattern with respect to the effects of household income 
and liquid assets for men. Adding the interaction term between household income 
and age (p=.04) or between liquid assets and age (p=.002) significantly improves 
the model fit compared to the model without the interaction. This finding 
suggests significant age variation in the effects of household income and liquid 
assets. However, the interactions between age squared and household income and 
between age squared and liquid assets are not statistically significant. 

Turning to women, the interaction terms between two variables of educational 
levels and age appear significant (p=.000), while the additional interactions 
between education and age squared are not significant. The model, which allows 
the effect of home ownership or real estate ownership to vary by age, fits the 
data less well than the model without the interaction. Moreover, the model with 
the additional interaction between household income and age squared improves 
upon the model with the interaction between household income and age only 
(p=.03). The effect of liquid assets shows the same pattern as the effect of 
household income does (p=.001). 

In order to further explore these significant interaction effects, Table 2 presents 
the estimates of SES-age interactions that are statistically significant. Remember 
that for men the model with household income-age or liquid asset-age interaction 
was better than the model without the interaction. The regression results in Table 
2 show that these interactions are positive, indicating that health differentials 
by household income or liquid asset increase with age. For an illustration of 
the diverging gap in health with age, Figure 1 presents the predicted values of 
self-rated health by age for men whose households have liquid assets equal to or 
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Table2. Estimates of Interactions with Age for Each SES Indicator 








Men Women 
Education*age 
High school*age .011 (.004)** 
High school*age? 
Tertiary*age .021 (.004)*** 
Tertiary*age* 
Household income*age 
Third quartile*age .003 (.002) .007 (.002)** 
Third quartile*age? —.0002 (.0001) 
Fourth quartile*age .006 (.002)* .007 (8002) 85» 
Fourth quartile*age? —.0003 (.0001)** 
Liquid asset*age 
Greater than median*age .006 (.0019)** .005 (OOD) 
Greater than median*age? —.0004 (.0001)** 


Note: Values in parentheses are robust standard errors. Age is centered around its mean for men and 
women, respectively. 


*p«.05 *p<.0l  ***p«.00l 


less than the median and for those whose households have liquid assets greater 
than the median? 

Among women, the coefficients of the interactions between two educational 
levels and age are significantly positive, suggesting a divergence in health status 
between those with different levels of educational attainment as age advances. 
Regarding the impacts of household income and liquid asset, the interactions 
with age show positive signs, while the signs of interactions with age squared are 
negative. This finding implies that health differentials by household income or 
liquid assets increase and then converge with age. Similar to Figure 1, Figure 2 
presents the predicted values of self-rated health by age and liquid assets among 
women. It is evident that the health gap by liquid assets increases with age and 
then converges in later old ages. 


Conclusions 


The results of regression analysis predicting self-rated health by demographic 
and SES measures showed that the three alternative measures—liquid assets, 
home ownership, and real estate ownership—did not have as strong effects as 
the two traditional indicators of education and household income, though the 
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Figure 1. The Predicted Self-Rated Health by Age and the Level of Liquid 
Assets (Men) 


liquid asset variable had a significant impact on self-rated health. The main 
purpose of the article is to explore the age variation in the effect of each SES 
measure. Our empirical findings in general did not support the convergence 
hypothesis especially for men. The effects of education, home ownership, and real 
estate ownership did not change across age groups, while health differentials by 
household income or liquid assets actually increased with age. That is, for both 
standard and alternative indicators of SES there was no evidence that the SES 
gap in health converged in old age among men. 

Nor did the results support the convergence hypothesis for women with 
regard to the effects of education, which indeed increased with age. The effect of 
home ownership or real estate ownership remained constant across age groups. 
However, there was some evidence of the converging effects of household income 
and liquid assets for women. Differences in self-rated health by household income 
or liquid assets diverged and then converged with age. Therefore, the results for 
women are not conclusive to assess the two contrasting hypotheses, whereas it 
can be concluded that the hypothesis predicting convergence in old age may not 
hold for men.!° 

The diverging or constant SES differentials in health with age among men 
and women might be in part attributable to very limited social welfare provision 
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Figure 2. The Predicted Self-Rated Health by Age and the Level of Liquid 
Assets (Women) 


in Korea, especially for the elderly. If there were strong social policies such as 
Medicare and Social Security for the elderly in Korea, the influences of SES 
on health among older people might be muted (Robert and House 1994). 
Nonetheless, evidence of a convergence in the effect of household income or 
liquid assets especially among women requires further explanations. Farlier, I 
emphasized that work-related variables should have less influence on Korean 
women’s health given their low levels of labor force participation. Although the 
variable of household income pertains to more than an individual's own economic 
activity, the variable can still be associated with some unobserved characteristics 
measuring work-related activities. Thus, household income is less likely to be 
relevant to health inequalities among women, especially the elderly. 

In addition to a low level of economic activities, Korean women usually have 
less control of the family's economic resources. A family head, who is usually a 
male, has exclusive power over social and economic decisions within the family. 
Liquid assets represent money available for conversion into cash. Therefore, the 
importance of liquid assets for predicting health may be substantially diminished 
for Korean elderly women who are much less likely than men to influence 
the use of liquid assets within a family. It is interesting, in contrast, to see the 
increasing effect of liquid assets with age among men. In fact, the impact of 
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household income shows the same pattern, as do liquid assets: divergence with 
age among men and convergence in old age among women. The similarity of age 
variations in the effects of household income and liquid assets deserves further 
examination. 

The endogenous problem in the relationship between socioeconomic standing 
and health status has been widely discussed in health literature. To deal with the 
issue to some extent, this study took the SES measures from the second wave of 
the data file with the variable of self-rated health taken from the fourth wave. 
However, no health measure was available from any earlier waves. Information on 
prior health status would be useful to examine the effects of SES on the change 
in health over time, and thus the issue of the reciprocal relationship between 
health and SES could have been better addressed. 

In this study, I only observed variation in the SES effects on health across 
age groups and treated the variation across age groups as reflecting the aging 
process of individuals throughout their life course, which can be problematic. For 
example, worse health status observed among older people could be in part due 
to differences between younger and older cohorts rather than the aging process 
(Ross and Wu 1996). In particular, Korean society has experienced remarkably 
rapid demographic and social changes during the last few decades, which implies 
substantial cohort differences in social and demographic characteristics. As Ross 
and Wu's (1996) study demonstrates, the analysis of the change in health status 
over time using prior health measures could be useful in assessing the relevance 
of the cohort explanation. 

Despite some limitations, the current study provides insight into trends in 
SES differentials in health in the context of a rapidly aging society. It shows 
substantial health inequalities by socioeconomic position among the Korean 
elderly. Along with recent evidence of increasing economic inequalities in the 
Korean society (Choi and Kim 2003), the fact that there is significant inequality 
in health status among the elderly seems to suggest that SES differentials in health 
would become wider as current young cohorts age, given the current low levels 
of social welfare provision. 


Notes 


1. Exceptional is research on the Japanese and Chinese elderly by Liang and his colleagues 
(Liang et al. 2000, 2002). Although the two studies are well designed using longitudinal 
data, the data pertain only to the elderly aged 60 and over. Therefore, they could not address 
possible variation in the effects of SES on health covering a broad range of age from early 
adulthood to very old ages. 


2. The KLIPS was initiated by the Korea Labor Institute (KLI) in 1998. Using the multistage 
and stratified area probability sampling, it interviewed 13,321 individuals (aged 15 and over) 
in 5,000 households from the seven metropolitan cities and urban areas in eight provinces 
in Korea. The sampling frame for the 1995 Korean census was used. Since 1998, the KLIPS 
has tracked the original households and their members every year. Retention rates of the 
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original households were 8896 in the second (1999) wave, 8196 in the third (1999) wave, 
and 7796 in the fourth (2001) wave. The surveys were administrated mainly by face-to-face 
interviews. Detailed information on the KLIPS can be obtained from the user's guide available 
from the KLIPS Web site, (www.kli.re.kr/kiips/). Phang et al. (1999) also provide a useful 
introduction to the KLIPS data. 


3. The only alternative measure available from the KLIPS is whether the respondent had any 
kind of chronic disease. However, instead of asking if an individual has a specific chronic 
disease among the prepared lists of diseases as most surveys do, the KLIPS first asked whether 
a respondent has any kind of chronic disease and then it requested the respondent to list 
the disease he or she has. Since respondents were not informed of the specific definition of 
“chronic disease,” depending on how they perceive chronic disease, individuals with the same 
disease might respond differently to this question. Due to this problem, I did not include 
the measure of chronic disease as another health outcome variable. 


4. For those respondents who were not interviewed in the second wave or who had missing 
information on the variables of the second wave, I used information collected in the third 
or fourth wave. 


5. The fourth wave of the KLIPS does not provide weights for taking into account the 
multistage sampling design and correcting for nonresponse over the different waves of data 
collection. Thus, the results are based on the unweighted data. 


6. Individuals in the KLIPS data are nested within households. Taking into account potential 
dependency among individuals in a same household, robust standard errors are reported in 
this study. However, using robust standard errors did not make any substantial differences 
in the results. 


7. To make the results more interpretable, the age variable in the all regression analyses is 
centered at its mean separately for males and females. 


8. The table of model comparisons can be obtained upon request to the author. 


9. The predicted value of self-rated health was calculated from the equation of the model 
estimated, by fixing other explanatory variables as follows: the mean of number of household 
members, nonmarried, high school completion, household income at the level of third 
quartile, no home ownership, and no real estate ownership. 


10. To see the extent to which the conclusion is affected by different methods, I conducted an 
additional analysis of ordered logistic regression to take into account the ordered feature of the 
self-rated health measure. The results were very similar between the two analyses, except that 
the additional interaction between household income and age squared among women found 
from OLS regression did not reach statistical significance in the ordered logistic regression. 
Given similar results, I decided to rely on OLS regression because it is more straightforward 
in interpreting the OLS results. Another advantage is that OLS allows direct comparisons 
of our results to those from the previous studies that addressed the same issue in America 
using OLS, especially Robert and House (1996) and Ross and Wu (1996). 
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Appendix. Descriptive Statistics by Gender 





Men Women 
(N=4,251) (N=4,479) 





Dependent variable 
Self-rated health (continuous variable)*> 3.43 3.14 
(.91) (.96) 
Self-rated health (percent) 
Very poor (1) 3.4 5.0 
Poor (2) 14.4 22:4. 
Fair (3) QUSE) 27.6 
Good (4) 54.9 41.8 
Very good (5) 4.8 2.5 
Independent variables 
Age?> 45.12 47.00 
(13.82) (15.12) 
Number of household member?‘ 3.92 3.91 
(4.35) (1.41) 
Marital status 
Married 79.2 74.6 
Other 20.8 25.4 
Education 
Less than high school 32.3 50.8 
High school 41.1 342 
Tertiary 26.7 15.0 
Household income 
First-second quartiles 49.2 50.8 
Third quartile 25.9 24.8 
Fourth quartile 24.9 24.4 
Liquid financial asset 
Median or less 49.5 50.4 
Greater than median 50.5 49.6 
Own house 
Yes 63.5 63.9 
No 36.5 36.2 
Own any real estate 
Yes 29.9 23.4 
No 76.7 76.6 


* Means and standard deviations (in parentheses) are reported for these continuous variables. 
> Mean difference between men and women is statistically significant. 
* Mean difference between men and women is not statistically significant. 
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Abstract 


During the 1980s, homeless people formed social movement organizations and 
mobilized collective action events in cities across the U.S. From the vantage point 
of social movement theories and scholarship on homelessness, it is surprising that 
homeless protest was so prevalent in the 1980s. Yet we find evidence of homeless 
protest events across no fewer than 50 U.S. cities in the 1980-90 period. Drawing 
on social movement theories about the precipitants of mobilization, we examine 
the extent to which city-level contextual factors, and their change over time, affect 
variation in the frequency of homeless mobilization across 17 of these cities. Our 
findings reveal that a mix of factors congruent with strain and resource mobilization 
theories helps to account for variation in the frequency of homeless protest across 
U.S. cities in the 1980s. 


During the 1980s, many U.S. cities witnessed the rise of insurgency by homeless 
people who engaged in protest rallies, marches, housing takeovers, and 
encampments on government property to express their collective grievances about 
their dire situation. While many readers will recall national events—such as the 
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one in October 1989 when more than 250,000 homeless individuals and their 
supporters marched on Washington under the banner of “Housing Now!”—local 
protests were quite widespread, and hundreds of city-level homeless protest events 
occurred between 1980 and 1990. 

That such widespread mobilization occurred among the homeless is somewhat 
surprising when viewed from the vantage point of the literatures on social 
movements and homelessness. Indeed, when considered in these terms, almost any 
mobilization and sustained protest involving the homeless appears to constitute 
an empirical and theoretical anomaly. 

Such pessimism about the prospect of mobilization among those in the 
lowest reaches of the social structure is grounded in the past 35 years of research 
that points to four sets of precipitants of movement mobilization and protest: 
(1) social network linkages, both interpersonal and organizational (Diani 2004; 
Gould 1991); (2) an indigenous organizational base or a facilitative organizational 
context (Evans and Boyte 1986; Morris 1981); (3) control of or access to resources 
(Edwards and McCarthy 2004; McCarthy and Zald 1977); and (4) some degree 
of social power in the sense of being implicated in the ongoing functioning of 
the social system rather than constituting a redundant or superfluous population 
(Jenkins 1985; Oberschall 1973). At first glance the homeless appear deficient on 
all these accounts and therefore are unlikely candidates for protest mobilization. 
The homeless not only suffer severe resource deficits materially and socially, 
but they are more impoverished in both realms than most other marginalized 
individuals and groups (Burt, Aron, and Lee 2001; Rossi 1989; Shinn and Gillespie 
1994). They also rarely have ready access to the kinds of material facilities and 
resources that most social movement activists and organizers take for granted, 
such as a meeting space, a telephone, and a few office supplies (Cress and Snow 
1996). 

In addition to their lack of resources and network ties, three characteristics of 
the homeless population conjoin to make for an unstable and unreliable adherent 
pool: the episodic, transitory character of homelessness, with estimates of more 
than 50% cycling on and off the streets two or more times (Burt, Aron, and Lee 
2001; Snow and Shockey 1998; Wong and Piliavin 1997); spatial mobility among 
the homeless within cities, as they often migrate from one service delivery area 
or “shadow work” node to another during the course of their daily routines (Lee 
and Price-Spratlen 2004; Snow and Anderson 1993; Snow and Mulcahy 2001); 
and the demands of day-to-day survival on the streets (Rosenthal 1994; Snow 
and Anderson 1993; Wagner 1993; Wright 1997). 

Finally, the homeless population has been repeatedly characterized as riddled 
with a higher incidence of disability (e.g., physical, mental, and substance-related) 
in comparison to the larger population (Burt, Aron, and Lee 2001; Rossi 1989; 
Shlay and Rossi 1992; Wright 1989).' The conjunction of these factors suggests 
a resource-poor and unstable adherent pool that, in light of the mentioned 
correlates of movement mobilization, would appear to be especially difficult to 
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Table 1. Homeless Protest in 17 Cities, 1980—1990 








City Number of Events Average Size of Events? 
Washington, D.C. 83 298.80 
New York 55 255.64 
Atlanta 39 159.09 
Boston 39 119.48 
Philadelphia 99 125.79 
Los Angeles 38 115.36 
St. Louis 36 150.11 
San Francisco 30 104.67 
Denver 27 101.92 
Chicago 22 105.62 
New Orleans 23 39.56 
Detroit | 22 284.87 
Tucson 22 145.70 
Honolulu 17 65.85 
Cleveland 12 100.45 
Houston 6 80.00 
Minneapolis 6 53.63 
Total 516 — 


* Measured as the number of participants reported by a newspaper at the event in question. Note that data 
on the number of participants are available only for 312 of 516 events (6096); thus averages computed 
only on those events for which the number of participants were reported. 


mobilize. Thus, if the collective-action problem that movements must solve is 
basically social, involving “coordinating unorganized, autonomous and dispersed 
populations into common and sustained action” (Tarrow 1994:9), then the 
problem would seem to be particularly troublesome with aggregations like the 
homeless. 

Yet, as noted earlier,-there has been considerable movement protest activity 
among the homeless across American cities. This activity can be seen in Table 1, 
which shows both the number and average size of protest events in 17 large U.S. 
cities over the 11-year period 1980-90. Here we see that there is considerable 
variation in their occurrence, ranging from a high of 83 in Washington, D.C., to 
a low of 6 in Houston and Minneapolis.’ 

It is this variation in the occurrence of homeless protest across these cities 
during the 1980s that we seek to illuminate by examining the various sets of 
contextual conditions posited to account for such variation. Little is known about 
whether city-level conditions account for variation in movement-sponsored 
homeless protest across cities. To date, research on homeless protest activities 
has focused primarily on the organizational form of their organizational carriers 
and sponsors and on their resource needs and benefactors (Cress 1997; Cress and 
Snow 1996); on the character of the protest activities or forms of “resistance,” 


1186 / Social Forces 83:3, March 2005 


as some scholars have referred to them (Wagner 1993; Wright 1997); and on 
protest outcomes or consequences (Cress and Snow 2000; Wagner and Cohen 
1991). While the issue of underlying contextual conditions or precipitants has 
not been ignored (Snow et al. 1998; Wright 1997), it has not been examined 
systematically across cities and over time. We seek here to understand variation 
in homeless protest across cities over the decade of the 1980s. 

Specifically, drawing on data on the 516 homeless protest events in the 17 U.S. 
cities between 1980 and 1990, shown in Table 1, we seek to identify the conditions 
that account for variation in the frequency of homeless protest events over this 
11-year period. We ask whether the dominant theories of protest mobilization are 
helpful when we consider groups, such as the homeless, that appear to be unlikely 
to mobilize. In turn, we use the findings to raise questions about and refine our 
understanding of the set of factors theorized to account for the emergence of 
movements and their protest activities more generally. The contribution of our 
analysis, then, is twofold: It illuminates empirically the extent to which variation in 
city-level contextual conditions, and their change over time, affect the frequency of 
homeless protest; and it contributes to our understanding of the extent to which 
theories about the precipitants of mobilization are general enough to account for 
mobilization of groups that appear to be unlikely candidates for mobilization. 


Theoretical Issues and Orienting Hypotheses 


Although the kinds of questions and issues students of social movements 
address have proliferated in recent decades, few, if any, have generated as much 
interest as the conditions underlying movement emergence. As Wilson (1973) 
noted more than 30 years ago in his overview of the study of social movements, 
"Without doubt, the question most frequently asked about social movements 
is: Under which typical social conditions do social movements normally arise?" 
(33). Writing nearly 25 years later, McAdam, McCarthy, and Zald (1996) agreed, 
asserting, "Understanding the mix of factors that give rise to a movement is the 
oldest, and arguably the most important, question in the field" (7). 

There are currently three leading accounts for the emergence of protest: 
strain theory, resource mobilization theory, and political opportunity structure 
theory. We argue that theories should be general enough to apply in a variety 
of contexts and to a variety of groups and movements. We assess whether these 
theories can help explain protest by a group that appears unlikely to protest. As 
such, we first briefly review these theories and suggest orienting hypotheses on 
homeless protest. 
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STRAIN THEORY 


Strain theory's basic postulate is that dire social conditions (e.g., the lack 
of affordable housing, widespread unemployment, and a gap between the 
cost of living and wages) or rapidly changing conditions and trends (e.g. 
dramatic changes in population, unemployment rates, and the cost of living) 
have disruptive, disintegrating effects that can render as likely candidates for 
participation in social movements those individuals or groups most vulnerable to 
dire or rapidly changing conditions or trends. Hence, the origins and emergence of 
social movement activity are rooted in these underlying dire social conditions or 
trends that affect some aggregations of individuals more directly than others. 

Although this theory offers one of the oldest and most persistent perspectives 
on the origins of social movements, dating back to at least the work of Émile 
Durkheim and even some of the work of Karl Marx, it has not fared well on 
either theoretical or empirical grounds over the past several decades (McAdam, 
McCarthy, and Zald 1988; McCarthy and Zald 1977; Tilly, Tilly, and Tilly 1975). 
Yet critics of the theory are not always clear about the precise target of their 
critique. Is the target the theory in general or one of its variant themes or 
arguments? Associated with the strain theory, as it has evolved, are at least four 
rather different claims or themes: One is the mass society variant that accentuates 
the disintegration of social ties (Coleman 1971; Kornhauser 1959) and thus 
approximates Durkheim's formulation. A second is the absolute deprivation thesis 
that focuses on immiserating life conditions, such as extreme poverty (Marx and 
Engels 1954; Piven and Cloward 1977; Van Dyke and Soule 2002). A third is the 
relative deprivation thesis, with its emphasis on the perceived discrepancy between 
expectation and attainment (Gurr 1970; Seeman 1981). A fourth is the "quotidian 
disruption" thesis, which highlights the disruption of everyday subsistence and 
survival routines (Snow et al. 1998; Useem 1998; Walsh 1981). 

Each of these variants of strain theory requires somewhat different kinds 
of data to assess their relevance to the emergence of social movement activity. 
Indicators, albeit different ones, of structural factors and trends work for the 
assessment of both the mass society and absolute deprivation theses, whereas 
subjective, perceptual data is required to provide a fair assessment of the relative 
deprivation thesis? and the quotidian disruption thesis requires a mixture of 
local-community and individual data. Because we are examining the relationship 
between city-level characteristics and the occurrence of protest events, and because 
the kinds of data we assembled are most appropriate for assessing the absolute 
deprivation thesis, we focus on this variant of the strain theory, hypothesizing a 
positive association between local or city-level indicators of absolute deprivation 
and higher frequency of homeless protest activity. 
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RESOURCE MOBILIZATION 


The central, orienting premise of the resource mobilization theory is that the 
emergence and persistence of social movement activity is contingent on the 
availability of resources that can be channeled into movement mobilization 
and activity (Edwards and McCarthy 2004; McAdam, McCarthy, and Zald 1988; 
McCarthy and Zald 1977). Although there are other core features of the theory— 
such as its emphasis on organization and rationality—and there continues to be 
some ambiguity about the specification of resources, the availability, aggregation, 
and deployment of resources are regarded as among the most critical determinants 
of movement emergence. As McCarthy and Zald hypothesized on the link 
between resource availability and movement emergence, not only is "the absolute 
and relative amount of resources available to" social movements contingent 
on "the amount of discretionary resources of mass and elite publics," but "the 
greater the absolute amount of (those) resources available to the SMS" (social 
movement sector) within a society, "the greater the likelihood that new SMIs 
[social movement industries] and SMOs [social movement organizations] will 
develop" (1977:1224—25). Evidence in support of this general hypothesis comes 
from a variety of studies. For example, McLaughlin and Marwan's (2000) study 
of the determinants of the founding of environmental movement organizations 
between 1895 and 1995 revealed that the founding of the organizations was 
positively associated with national prosperity and negatively associated with 
high rates of business failures; Soule et al. (1999) found that protest activity by 
U.S. feminist groups in the 30 years between 1955 and 1985 increased during 
prosperous times; and, internationally, Wiest, Smith, and Eterovic (2002) found 
that citizen participation in transnational social movement organizations, such 
as Greenpeace, tends to be positively associated with national wealth. Given the 
congruence between these findings and the resource mobilization theory, we 
hypothesize that the larger the overall resource pool at the city level, the greater 
the frequency of homeless protest events. 


POLITICAL OPPORTUNITY STRUCTURE 


Overlapping with the resource mobilization theory, with its emphasis on the 
resource context, is the political process model, with its emphasis on the context or 
structure of political opportunities (Kriesi 1995; McAdam 1996; Tarrow 1994; Tilly 
1978). Although there is no single consensual definition of political opportunity 
structure, Tarrow’s conceptualization will suffice: “consistent . . . dimensions 
of the political environment that provide incentives for people to undertake 
collective action by affecting their expectations for success or failure” (1994:85). 
Underlying this and related conceptions are three interrelated observations: 
that political systems—be they local municipalities or state, regional, or federal 
governments—can vary considerably in how receptive (open) or unreceptive 
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(closed) they are to organized protest and challenge; that this variability is 
signaled by or read from the system's ongoing functioning; and that these signals 
to or readings by social and political actors either encourage or discourage their 
mobilization into social movements. 

Given these observations, there are a number of issues and questions that have 
stimulated discussion and calls for empirical inquiry, one of which is especially 
relevant to our article. It concerns the dimensions of a political system that are 
most indicative of the context of political opportunity and can thus function to 
signal something about its receptivity to challenge. In his synthesis of a number 
of scholarly treatments of this issue, McAdam identified a *highly consensual 
list of [four] dimensions of political opportunity": (1) system accessibility or the 
degree to which a political system is open or closed to challenge, (2) the relative 
stability of the pattern of political alignments within a system, (3) the presence 
or absence of influential allies, and (4) the repressive capacity of the state or 
relevant political entity (1996:26—29). 

Presumably, the degree of political opportunity within a political entity 
can be indicated by any one of these dimensions or some combination of two 
or more. Thus, the structure of political opportunity within a system can be 
assessed empirically on any one of the above dimensions. Since our data set 
includes indicators of the first three dimensions—system accessibility, patterns of 
alignment, and the presence of allies, we assess the impact of those indicators on 
the frequency of homeless protest events across the 17 cities. We hypothesize that 
an open or favorable political opportunity structure in a city will be associated 
with greater frequency of homeless protest activity. 


Data, Procedures, and Concrete Hypotheses 


SAMPLE SELECTION AND PRIMARY DATA SOURCE 


We explore the extent to which the above theoretically derived propositions help 
to explain variation in the yearly count of homeless protest events across 17 U.S. 
cities from 1980 to 1990. We focus our analysis on this time period because of 
its temporal coincidence with both the growth of the homeless population (Burt 
1992; Rossi 1989; Wright 1989) and homeless protest (Rosenthal 1996; Wagner 
1993; Wright 1997) in the U.S. Our decision to begin our analysis of homeless 
protest activity with 1980 rather than earlier was based on the observation that 
homeless protest was almost nonexistent in 1980 and increased in its spread and 
frequency throughout the 1980s. Taking the 17 cities in our sample, for example, 
homeless protests events were noted in only two cities in 1980 but in 15 cities 
in 1987. Similarly, the number of protest events increased correspondingly from 
only 2 in 1980 to 73 in 1987 to a high of 109 the following year. 

Data on these homeless protest events were drawn from the major daily 
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newspapers in the 17 sampled cities because of the local character of the 
movement, our interest in accumulating a longitudinal record of the occurrence 
of homeless protest events, and the failure of local social movement organizations 
to maintain an ongoing record of their protest activity. Since the preponderance 
of the country's homeless are concentrated in its largest metropolitan areas and 
particularly their central cities (Lee and Price-Spratlen 2004), we initially hoped 
to select a random sample of the country's 50 largest cities and use the indexes 
to the New York Times and News Bank to determine the incidence and intensity 
of homeless collective action across the sampled cities. However, pilot fieldwork 
in Minneapolis, Philadelphia, and Tucson, including a summer spent working 
with the National Union of the Homeless in Philadelphia, revealed that accounts 
of homeless protest were dramatically underreported in these two indexes. Upon 
discovering this, we checked with the daily newspapers in the country's 50 largest 
cities to ascertain which ones were indexed through the 1980s and found that 17 
dailies, each associated with a different city, met this criterion. 

Thus, the 17 cities included in our sample were selected on the basis of two 
criteria: first, each was one of the 50 largest cities in the U.S. at the time of our 
research, and second, each had a daily newspaper indexed throughout the 1980s 
and into the 1990s. The 17 cities are clustered in Table 2 according to population 
size and region of the country in which they are located. Also indicated is their 
approximate homeless rate and their exact population rank at the time the 
research was initiated. 

Even though newspapers are typically the only source of temporally 
comparative data on protest events, recent research has criticized newspapers 
as a data source. Some of the criticisms take issue with data collection schemes, 
while others point to potential biases in the selection of events and the description 
of events that are covered by newspapers (Earl et al. 2004).5 The most relevant 
finding for our research is that the use of local rather than national news sources 
appears to reduce some of the biases noted in the literature, such as the tendency 
for more “newsworthy” (e.g., larger, more violent) events to be covered (Barranco 
and Wisler 1999; McCarthy, McPhail, and Smith 1996). To the extent that these 
findings are accurate, our use of local newspapers may provide a less biased set 
of events (and a less biased description of these events) than would national 
newspapers. Nonetheless, using multiple newspapers from different cities could 
introduce another problem: Each of these 17 newspapers could have a different 
degree of selection bias or description bias (Earl et al. 2004). In an attempt 
to control for this possibility (and the possibility that there could be other 
unmeasured, idiosyncratic factors present in one or more of our cities), dummy 
variables for each city were included in the models we ran, with Washington, D.C., 
omitted for comparison. (We describe this procedure in more detail below.) 
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Table 3. Grievances/Issues Associated with Homeless Protest Events 


Grievances/Issues Articulated by Events Proportion of Events for Which 
Issues Were Articulated 
1. General plight of homeless 42.4 
2. Housing 29.5 
3. More shelters/shelter beds 20.8 
4. Release HUD housing 12.8 
5. Homeless legislation 11.4 
6. Antihomeless ordinances 11.0 
7. More low-cost housing 5 
8. Work/jobs/minimum wage 4.0 
9. Police harassment/behavior 3.4 
10. Welfare 3.0 
11. Political representation and participation 1.5 
12. Other 2.8 


Note: Percentages do not sum to 100 because events often address multiple issues. 


DEPENDENT VARIABLE 


The dependent variable in our analysis is the yearly count of protest events 
between 1980 and 1990 in the 17 cities shown in Tables 1 and 2.5 Fifteen activities 
were identified in the data set as constituting homeless protest events, with the vast 
majority (95.296) of these skewed in the direction of noninstitutional collective 
action (e.g., public rallies, marches, housing takeovers, blockades, encampments, 
hunger strikes and fasts, and picketing)." It is important to note that some protest 
events actually consisted of several types of activities, but only the major activity 
was counted in such cases. For example, a protest event was coded as a single 
event whether it consisted of a rally in front of a federal building or a series of 
activities that began with a rally followed by a march to an abandoned house that 
was then taken over by the homeless (in this case the housing takeover was the 
major, focal activity of the event). In all, we coded 516 homeless protest events 
in the 17 cities during the 11-year period from 1980 to 1990. 

In the newspaper reports on these 516 events, there were data on organizational 
involvement for 223 of the events (4396 of the events). While this information 
cannot reliably be included in our regression models below, it is worthy of note 
that homeless-based social movement organizations (national and local) were 
reportedly present at more than 7596 of these 223 events. Other organizations 
present at these 223 events included various religious organizations and service 
providers, but the most common type of organization mentioned at these events 
were homeless-based social movement organizations, a finding that is consistent 
with fieldwork on the homeless movement in the U.S. during this same period 
(Cress 1997; Cress and Snow 1996, 2000; Wright 1997). 

All the events shared three features. First, they were collective rather than 
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Table 4. Targets of Homeless Protest Events 











Primary Targets Proportion of Events 
1. Government or government agency 75.5 
2. Media | 13.8 
3. Businesses | 4.1 
4. Service providers 3.5 
5. Other po 





individual actions; Table 1 includes a column for the average size of the events 
in each of the cities. It is not surprising that the average size of the events varied 
across cities, but they were by no means small events, as the average size ranged 
from about 40 (New Orleans) to nearly 300 (Washington, D.C.). 

Second, they addressed issues directly affecting the homeless. In other words, 
the events represented the collective expression of shared grievances over various 
issues related to the presumed causes of homelessness and the factors perceived as 
affecting the quality of life on the streets. This relationship can be seen in Table 3, 
which lists the array of issues associated with homeless protest and the proportion 
of the 516 events for which the issues were articulated. The plurality of protest 
gatherings (47.696) expressed concerns about the general problem of housing 
(rows 2, 4, and 7 in Table 3), 4296 focused attention on the general plight of the 
homeless, and 2196 protested the availability of shelters and shelter beds. 

Finally, the events were targeted or directed at other parties. This targeting can 
be seen in Table 4, which shows that the protest gatherings were sharply focused 
on the targets for their grievances—that is, those organizational entities thought 
to have the political or resource capability to attend to the issue or grievance. 
Three-quarters of the targets were governmental, including federal, state, and 
city governmental entities.* 

Although the foregoing suggests that the vast majority of homeless protest 
events are rationalized in terms of the general plight of the homeless and various 
housing issues and are directed at governmental agencies or agents in hopes of 
securing relief with respect to these and other concerns indicated in Table 3, 
there is striking variation in the frequency of homeless protest both across the 
17 cities and over the 1980s. It is this variation that we seek to illuminate in 
this article. We turn now to our specific hypotheses and measurement of our 
explanatory variables. 


SPECIFIC HYPOTHESES AND INDEPENDENT VARIABLES 


As noted earlier, we draw on what are generally regarded as the three major 
theoretical perspectives on movement emergence (strain, resource mobilization, 
and political opportunity structure) in order to guide our examination of the 
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broader contextual conditions that account for variation in the frequency of 
homeless protest events in 17 cities over the 11-year period. We hope to ascertain 
how well these theories are able to explain protest by a group that seems unlikely to 
mobilize. To do this, we run a set of three nested models to examine the influence 
of a number of different factors on the frequency of homeless protest. 

In all three of the models presented in Table 5, we include two control 
variables. First, as a proxy for the overall homeless population in each city, we 
include a measure of the number of shelter beds (per 10,000 population) available 
in a city. Because it is not possible to obtain actual rates of homelessness in each 
of our cities over this entire 11-year period, we follow Burt (1992) and argue 
that this is a reasonable proxy for the rate of homelessness. Second, we include 
a measure of the city population size as a control variable. We do so in part 
because our dependent variable is the count of homeless protest events in a city 
in a particular year, and in part because larger cities are likely to have more and 
larger events simply because there are more people therein, all else being equal. 

In addition to these control variables, model 1 includes five measures of the 
absolute deprivation variant of strain theory. The first variable is the ratio of 
median housing value to per capita income for a city. We use this ratio of median 
housing valuation to per capita income citywide because it is consistent with what 
has been argued about one set of structural conditions that breed homelessness: 
a widening gap between the cost of housing and available income or resources 
to meet that cost (Hopper and Hamberg 1986; Koegel, Burnam, and Baumohl 
1996). Thus, we hypothesize that the higher the ratio of housing costs to income, 
the greater the prospect of homeless protest. 

As a second indicator of strain, we include a measure of the unemployment 
rate in each city. We presume that when fewer people are gainfully employed, 
there should be higher rates of homelessness and more intense grievances, and 
thus greater frequency of protest. In other words, we hypothesize a positive 
relationship between unemployment and homeless protest. 

The next two strain variables we include are the change in the number of 
manufacturing jobs and the poverty rate in each city. We include the former 
because the rise in homelessness in the U.S. in the early 1980s appears to have 
been associated temporally with the acceleration of plant closings and the 
corresponding loss of manufacturing jobs in the second half of the 1970s and the 
first half of the 1980s. A study by the U.S. Department of Labor (1985) reports, for 
example, that nearly 12 million workers lost jobs between 1979 and 1985 because 
of plant closings and associated employment cutbacks. Given that these closings 
and cutbacks, often discussed under the rubric of “deindustrialization” (Bluestone 
and Harrison 1982; Perrucci et al. 1988), have been posited as precipitants of 
homelessness (Adams 1986; Hopper, Susser, and Conover 1985; Ropers 1988), we 
hypothesize that a decline in manufacturing jobs should be positively associated 
with greater frequency of homeless protest. We created a dummy variable which is 
coded 1 when there was a decline in the number of manufacturing jobs in a city 
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from previous year, and 0 when the number of manufacturing jobs remained the 
same or increased.? Regarding the poverty rates, we hypothesize that higher rates, 
indicating greater economic strain, should be associated with greater frequency of 
protest. Finally, we include the number of families on public assistance (divided 
by the total number of families in the city). This measure of strain, like poverty 
rates, should be associated with greater frequency of protest. 

In model 2 of Table 5, we include a measure of the overall resource pool 
within each city. As we noted earlier, according to resource mobilization theory, 
the amount of resources available to social movements is contingent on the 
amount of discretionary resources available in a system. Thus, larger resource 
pools lead to greater prospects for social movement mobilization (Edwards and 
McCarthy 2004; McCarthy and Zald 1977). Accordingly, we use per capita income 
as a system-level measure of resources for each of the 17 cities. We hypothesize 
that higher per capita income should be positively associated with higher rates 
of protest, since higher income should, all else being equal, increase the size of 
the resource pool for social causes and movements. 

In addition to our measure of the overall resources in the community, we 
also include a measure of the total transfer payments, per capita, in a city. 
This measure includes aid to families with dependent children (AFDC), social 
security, unemployment, and supplemental security income (SSI) payments, 
and is used as a general indicator of the variation in the amount of income 
support available to the poor across our 17 cities. Research on the homeless in 
various cities across the country shows that many homeless people do, in fact, 
access these programs (Burt, Aron, and Lee 2001; Rossi 1989; Wolch and Dear 
1993). Consistent with the resource mobilization approach, we expect a higher 
dollar volume of transfer payments to be associated with a greater frequency of 
homeless protest activity.'? 

A final and more direct indicator of a community's resource base for 
dealing with social issues is the volume of contributions to nonprofit social 
service agencies, such as the United Way. While numerous community-based 
organizations provide or facilitate the provision of social services, the United 
Way serves as a baseline institution for such support in most large communities, 
including the 17 cities in our sample. Thus, we expect the frequency of homeless 
protest events to be higher in communities with greater per capita contributions 
to United Way. 

Finally, in model 3 of Table 5, we include measures of the three dimensions 
of the political opportunity structure discussed above: accessibility or openness, 
stability or coherence of political alignments, and the presence or absence of elite 
allies. Considering first the accessibility dimension, and following Eisinger's (1973) 
assessment of the relationship between the structure of political opportunities in 
43 U.S. cities and the occurrence of protest activities in the late 1960s in those 
cities, and Knoke's (1982) parallel finding that mayor/council governments are 
more responsive to the demands of citizens, we hypothesize that cities with a 
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mayor/council form of government should have a greater frequency of protest 
activity because elected officials are more directly accountable to the electorate 
than unelected, appointed officials. Also consistent with Eisinger's (1973) research, 
we hypothesize that the ward form of election should yield more protest because 
ward-elected officials are directly accountable to even more specific population 
groups. 

Considering the alignment stability dimension, we look at the percentage of 
the mayoral vote that went to a third party. We assume that an increase in the 
percentage of votes going to a third party is indicative of some degree of political 
fractionalization and should thus be positively associated with a greater frequency 
of homeless protest. We also assume that a smaller margin of victory in a mayoral 
election indicates some degree of instability in the existing city power structure 
and therefore ought to be associated with a greater frequency of protest. In other 
words, the relationship between the margin of victory and the frequency of protest 
events should be negative, with larger margins of victory being associated with 
more political stability, and thus a less frequent occurrence of protests. 

To assess the third dimension of the political opportunity structure, the 
presence or absence of elite allies, we look at the strength of the Democratic 
party in each city. Since Democrats are generally more sympathetic toward, and 
supportive of, the economically marginal (such as the homeless), we assume that 
the percentage vote for Democrats in various elections provides an indication 
of political support and thus allies. To measure this, we collected data on the 
percentage of registered voters in a city that voted for Democratic political 
candidates for the presidential, gubernatorial, and senatorial elections in all 
elections during the 11-year period. Since these elections did not occur yearly, we 
used linear interpolation to estimate a between-election year figure for Democratic 
support for each of these elected offices. Finally, we constructed an index of 
Democratic strength in a city by summing these components.!! Accordingly, 
we hypothesize that there should be a positive association between Democratic 
strength across the cities and the amount of homeless protest activity. 


DATA SOURCES FOR INDEPENDENT VARIABLES 


Data for the explanatory variables described above come from several sources. The 
source from which most of our variables were derived is the County Statistics File 
4 (COSTAT-4), a public-use compilation of statistical information from numerous 
governmental sources produced by the U.S. Census Bureau. We extracted annual 
information for the 17 counties in which the seat of city government or city 
hall is located for the 17 greater Metropolitan Statistical Areas (MSAs) that are 
of analytical interest. Many of these include multiple counties; however, given 
the theoretical importance assigned to protest activity directed at governmental 
authority and the fact that three-quarters of the protest events were directed at 
governmental agencies, we consider the county in which the city governmental 
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authority is located as the most appropriate level of contextual analysis. Yearly 
values for variables with incomplete reporting were derived through linear 
interpolation. 

The COSTAT-4 file data include (1) income, welfare, housing, and population 
data (per capita income, poverty rates, total transfer payments, median 
housing valuation, and total city population) from U.S. Census tabulations; (2) 
employment data (unemployment rate, number of manufacturing jobs) from the 
Bureau of Labor Statistics; and (3) voting data (percentage voting for Democratic 
governor) from the National Election Survey. 

In addition to the COSTAT-4 data, we secured for each city United Way 
contribution data from the national office of the United Way, shelter-bed ratios 
from estimates generated by the Urban Institute's nationwide assessment of 
homeless shelter capacity during the time period in question (Burt 1992), and 
data on city governance structure and politics (mayor/council structure, ward 
structure, and third-party mayoral candidate) from telephone and field interviews 
with local officials in each city. Data on the number of families on public 
assistance come from Kasarda's (1992) Urban Underclass Database. 


ANALYTIC PROCEDURES 


Since the data on homeless protest events is in the form of yearly counts, we 
use negative binomial regression, a variant of Poisson regression, which is 
typically used to analyze event counts (Barron 1992; Land, McCall, and Nagin 
1996). We use negative binomial regression (rather than Poisson) to correct for 
overdispersion in the count of protest events, a condition that occurs when the 
variance of the count is greater than its mean. Negative binomial regression 
effectively takes care of the problem of over dispersion by adding a stochastic 
component to the model (Land, McCall, and Nagin 1996). 

As well, since our data structure is a pooled cross-sectional time series, we 
use the method of General Estimating Equations (GEE) developed by Liang 
and Zeger (1986) and Zeger and Liang (1986). As noted by recent users of this 
method (Baron, Hannan, and Burton 2001), this approach generalizes quasi- 
likelihood estimation (Barron 1992) to the sort of data structure we use. A 
common problem in this type of data structure is autocorrelation, or the fact 
that yearly observations for the same city will tend to be correlated. We addressed 
this problem by using Stata's extension of GEE, XTGEE, which allows a number 
of choices for the working correlation matrix (Version 7.0 Stata Corp 2001). We 
experimented with all of the choices and found that the models fit best when 
we specified first-order serial autocorrelation (arl). Coefficient estimates were 
obtained through the XTGEE routine in Stata. Stata calculates the robust standard 
errors (also referred to as the Huber/White or sandwich estimates), thus allowing 
for more conservative estimation of these models. 

Finally, all models presented below were run including fixed effects for 
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each city (with Washington, D.C., as the omitted category) to control for any 
unmeasured, idiosyncratic factors associated with a particular city. Because none 
of the dummy variables for any city was consistently significant and their removal 
did not alter the other results, we do not present those results. 


Results 


Table 5 presents the results of a nested set of models designed to assess our 
hypotheses. The first model includes our two control variables along with our five 
indicators of the absolute deprivation variant of strain, the second adds the three 
resource measures, and the third adds the five political opportunity measures." It 
is important to note that we ran all the models in Table 5 with and without the 
inclusion of Washington, D.C., because it had far more homeless-related protest 
events than any other city (see Table 3). However, the results were essentially 
the same either way we ran the models, so we include Washington, D.C., in the 
analyses below. 

Turning first to our two control variables, we find that the proxy for the size 
of the homeless population is positive and significant in the first two models but 
loses significance in the final model. Our measure of the city population is not 
significant in any of the three models. 

Turning next to our five measures congruent with the absolute deprivation 
variant of strain theory, we find that the first measure, the ratio of the median 
housing value to per capita income, is positive and significant in all three models. 
Even though the magnitude of association is quite low across the models, the 
finding does lend support to the hypothesis that the wider the gap between 
housing costs and income, the greater the occurrence of homeless protest, all else 
being equal. Also positive and significant in the three models is the coefficient on 
decline in the manufacturing sector, suggesting that homeless protest increases 
with the elimination of manufacturing jobs. The coefficients for the remaining 
two measures of strain, the unemployment and poverty rates, are in the predicted 
direction and are significant in all models, although only at the .10 level in some 
cases. 

Recall from Table 3 that 42.496 of the collective action events protested the 
general plight of the homeless, and around 4896 expressed concern about various 
housing issues. Given this, the fact that higher ratios of median housing value to 
per capita income, declines in manufacturing jobs, and higher unemployment 
and poverty rates are associated with a greater frequency of homeless protest 
makes sense, as these variables are all consistent with the grievances associated 
with the actual protest events we coded. 

Next, turning to our measures of resources, which are included in models 
2 and 3, we find that the coefficients for per capita income and total transfer 
payments are both positive and significant, suggesting that such resources help 
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Table 5. Negative Binomial Regression Models Predicting Yearly Counts of 
Homeless Protest Events in 17 U.S. Cities, 1980-1990 


Model 1 
Rate of homelessness 05 
(.01) 
City population .26 
(.16) 
Ratio of median housing value 005*% 
to income per capita (.001) 
Unemployment rate 1037 
(.02) 
Decline in manufacturing 5* 
(.17) 
Poverty rate 057 
(.03) 
Households on —2.46 
public assistance (4.40) 
Per capita income 
Total transfer payments 
United Way contributions 
Mayor/council 
Ward 
Third party (mayoral race) 
Margin of victory (mayor) 
Democratic strength 
Constant —5.17* 
(2.67) 
"a 38.19 
Model 1 versus model 11:82* 


with only controls (5 df) 
Model 2 versus model 1 (3 df) 
Model 3 versus model 2 (5 df) 


(N-187) 


Model 2 


2025 
(.01) 

207 
(.18) 

.004* 
(.001) 


597 
(.74) 
oO" 
(.18) 
ee 
(.04) 
—1.47 
(4.51) 


Woa 
(.04) 
1.06 
(EZ) 

.01 
(.03) 


—11.88*** 
(3.30) 
97.16 


58.975235 


Model 3 


.004 
(.02) 

.06 
(.22) 

.004* 
(.001) 


1.357 
(.74) 
397, 
(.20) 
LOS 
(.04) 
—7.81 
(6.27) 


45s 
(.06) 
.95*** 
(.18) 
.01 
(.05) 
.06 
(.62) 
zl 
(53) 
=1.22 
(.80) 
.18 
(.54) 
—1.06+ 
(.64) 
—7 237 
(4.24) 
97.51 


oo 





Note: Standard errors in parentheses. 


HPAI *p<.05, .**p<.01 .***p<.001 
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facilitate homeless mobilization by enlarging the resource pool that might 
be drawn on for the purpose of collective action. We say “might be drawn 
on" because our data do not allow us to specify a direct link between these 
resource measures and homeless mobilization. In other words, we do not know 
whether or how expanding resources at the individual level are funneled to 
-social movement organizations sponsoring and organizing homeless protest. 
But our findings do allow us to infer such a connection, especially since much 
theorized research associated with the resource mobilization theory posits and 
finds a positive and significant relationship between expanding resources and 
social movement emergence and mobilization (for a summary, see Edwards and 
McCarthy 2004). 

Finally, model 3 adds our five political opportunity structure variables. Looking 
first at the indicators of system accessibility, we find that neither a mayor/council 
form of government nor a ward system of election is associated with a greater 
frequency of protest among the homeless, as neither coefficient is significant, 
although both coefficients are in the expected positive direction. Turning to our 
two indicators of the stability of political alignment in a city, we find that the 
coefficient on the percentage of the vote that went to a third party in the mayoral 
race is significant but in the direction opposite of what we predicted. Contrary 
to our hypotheses derived from political opportunity theory, cities in which a 
lower percentage of the vote went to a third party have a greater frequency of 
homeless protest. Similarly, the coefficient on the size of the margin of victory 
in the mayoral election is in the opposite direction of what we hypothesized and 
fails to attain conventional significance as well. Finally, when we examine the 
effect of our measure of elite allies, the percentage vote for Democrats in each 
city, we see that the coefficient is significant (at the .10 level) but in the opposite 
direction of what we had predicted. 

In summary, our results indicate relatively strong support for strain theory and 
some support for resource mobilization theory. These accounts of the precipitants 
of mobilization help us to understand variation in the frequency of homeless 
protest across cities in the 1980s. It is interesting that our results show no support 
for the political opportunity theory when considering homeless mobilization. 
Our results are discussed in more detail in the next section. 


Discussion 


We began this article by seeking to identify whether variation in city-level 
contextual variables and their change over time affected variation in the 
frequency of homeless protest across 17 U.S. cities. We also asked whether the 
dominant theories of mobilization are general enough to account for protest 
among aggregations of individuals who appear to be unlikely candidates for 
mobilization. Our analysis indicates that a mix of theoretically relevant factors 


Identifying the Participants of Homeless Protests / 1201 


can best explain the yearly occurrence of these events. In particular, our analysis 
reveals that a combination of variables congruent with both strain and resource 
mobilization theories help to account for variation in the annual occurrence of 
homeless protest events across these 17 cities. 

According to our findings, the existence of various strains that provide the 
material base for the development of a set of mobilizing grievances increases 
the frequency of homeless protest. Here our findings revealed that most of our 
indicators of the absolute deprivation variant of strain theory—ratio of the 
median housing value to per capita income, unemployment rate, decline in 
manufacturing jobs, and poverty rate—were significantly and positively associated 
with the occurrence of protest events across the 17 cities. What makes these 
findings even more compelling is that they are consonant with what is known 
about the structural precipitants of homelessness and the plight and needs 
of the homeless as articulated by the protest events (see Table 3). Regarding 
structural precipitants, it generally can be argued that homelessness flourishes in 
“a widening gap for many households between the cost of their subsistence needs 
and the resources available to meet them” (Hopper and Hamberg 1986:14). The 
shortfall in affordable housing, as measured in terms of the widening gap between 
housing values and income, operates at the supply side of this dynamic, with 
unemployment, a decline in manufacturing jobs, and poverty operating at the 
resource end of the dynamic by contributing to an absence of resources sufficient 
to secure housing. Housing shortfalls are consistent with the fact that around 
two-thirds of the grievances/issues articulated by the homeless protest events 
were focused on housing-related matters (Table 3). And both the housing and 
job problems surface as being among the most salient concerns of the homeless 
when they are asked to list the services that would benefit them the most. As 
reported by one representative survey of nearly 1,800 homeless men and women 
in one of the 17 cities, “help finding low-rent housing” and “a full-time job” were 
named by the homeless as being among the 4 most important desired services, 
out of more than 20 possibilities (Snow and Shockey 1998). That the concerns of 
the homeless, expressed both collectively and individually, corresponded with our 
findings on the five structural indicators of strain provides even greater confidence 
in those findings, thus suggesting further that a number of strain-related factors 
were significant contributors to homeless protest in the 1980s. This finding is 
also consistent with the recent reexamination and reconfiguration of the strain 
theory on social movements (see Buechler 2004; Snow et al. 1998; Useem 1998; 
Van Dyke and Soule 2002). 

Our findings also suggest that the larger the monetary resource pool, as 
measured in terms of per capita income and transfer payments, the greater 
the prospect of homeless protest. Consonant with the resource mobilization 
theory, this finding suggests that as the resource pool increases, so does the 
likelihood that some portion of it will be channeled through other organizations 
to homeless social movement activists and organizations. While this presumed 
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connection might be established in a more compelling fashion with data that 
specifies the paths and processes through which monetary resources are funneled 
to homeless movement organizations, our data do not enable us to pursue that 
analysis. But it is important to note that this shortcoming hardly differentiates 
our analysis from most other studies that note empirically the importance of 
the connection between expanding resource pools and movement mobilization 
in other contexts (e.g., McLaughlin and Marwan 2000; Soule et al. 1999; Wiest, 
Smith, and Eterovic 2002). Furthermore, these findings make sense in light of the 
severe resource deficits suffered by the homeless both individually and collectively 
and the fact that they are more impoverished on both these measures than most 
other marginalized individuals and groups (Cress and Snow 1996; Rossi 1989; 
Shinn and Gillespie 1994). Given the general resource impoverishment among 
the homeless, it is not surprising that the occurrence of homeless protest activity 
was greater in cities with larger resource bases or pools. 

We also examined five indicators of city political opportunity, but we found 
that none of these variables attained significance in the direction predicted. This 
is puzzling for several reasons. First, it has become almost an article of faith over 
the past several decades that aspects of political opportunity structures constitute 
at least one set of key determinants of movement emergence and protest activity 
(McAdam, McCarthy, and Zald 1988, 1996; Tarrow 1994). Second, three of our 
variables—mayor/council form of government, ward form of election, and 
percentage of the mayoral vote that went to a third party—are directly reflective 
of the local political context. And third, these measures are almost identical to 
those Eisinger (1973) used in what is generally regarded as one of the seminal 
political opportunity analyses. 

These considerations notwithstanding, we suspect the differences might be 
partly explained by a number of factors. First, Eisinger's (1973) analysis relied 
on correlations rather than regression models and did not include measures of 
strain and resources, as we do here. Second, it is possible that large cities were 
generally more open to protest in the 1980s than during the turbulent 1960s 
when Eisinger conducted his research, not so much because of more accessible 
structures or sympathetic political orientations on the part of city authorities but 
because of the increasing institutionalization and conventionalization of protest 
(see Della Porta, Fillieule, and Reiter 1998; McCarthy and McPhail 1998). Third, 
our failure to find support for the explanatory value of the political opportunity 
structure could result from differences in the type of protest and challenges we 
investigated. Finally, with respect to the lack of support for the hypothesis that 
Democratic elite allies facilitate protest, this finding may be better understood 
when we consider the fact that more than 9596 of the protest events we analyze 
are noninstitutional in nature. Perhaps when protesters have allies in powerful 
positions, they are better able to use institutional channels and less likely to 
use noninstitutional channels, thus suggesting that when movements have elite 
allies, noninstitutional forms of protest decline. It is also worthy of note that 
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early versions of political opportunity theory (Tilly 1978) argued that protest is 
stimulated not only by opportunity, but also by threat. It could be that elite allies 
do not stimulate protest; instead, the lack of such allies could present a threat, 
which stimulates protest (Goldstone and Tilly 2001; Van Dyke and Soule 2002). 
In other words, it could be that non-Democrats in city governments present a 
distinct threat to homeless interests, a threat that may be best addressed by protest. 
McAdam (2004), in reflecting on two decades of empirical and theoretical work 
in this area, agrees that the importance of threat as a stimulant to protest has 
been eclipsed by opportunity. He notes that for “polities where there is some 
expectation of state responsiveness and few formal barriers to mobilization, we 
should expect perceived threats to group interests to serve, along with expanded 
opportunities, as two distinct precipitants of collective action" (2004:205). 

Considered together, then, our findings and the above observations suggest 
that variation in the yearly occurrence of homeless protest events across our 17 
cities is largely the result of a mix of factors congruent with strain and resource 
explanations of mobilization, with political opportunity variables contributing 
no significant support. 

Having reconciled our significant findings with what we know about the 
homeless and previous research on homeless protest, and having elaborated 
how these findings help to account for variation in the rate of homeless protest 
across cities, we turn to the broader theoretical implications of our findings 
for understanding the conditions underlying the emergence and occurrence 
of social movement activity more generally. In particular, three implications 
are suggested by the findings. The first is that analyses that focus on a mix of 
theoretically relevant factors are likely to provide greater explanatory yield than 
those that focus on a single set of theorized conditions. From this vantage point, 
the question is not so much which theory is more accurate in accounting for the 
emergence and occurrence of social movement phenomena, but what is the mix 
of factors that best accounts for the variation observed? In the case of homeless 
protest in the 1980s, it appears that strain and resource mobilization theories 
importantly contribute to our understanding of the variation in the frequency 
of homeless protest. 

Second, in light of our findings, it is reasonable to ask about the relationship 
between sets of explanatory factors and types of movements and their contexts. 
Regarding the positive and significant association between our strain measures 
and the occurrence of homeless protest, for example, it might be argued that one 
or more forms of strain may be of greater relevance to the mobilization of groups 
or categories whose economic marginality is more acute, or felt more acutely, 
than their political marginality. Such groups, which include the homeless, are 
more likely to clamor for sustenance-related interventions and resources than 
for political rights or identity-related issues. Parenthetically, the fact that many of 
the movements analyzed over the past 30 years have focused primarily on rights 
and identities may account in part for the apparent irrelevance of strain theory 
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in explaining the emergence of these movements. 

This suggests, in turn, a third general implication: that there may well be a 
kind of elective affinity between certain theoretical perspectives and the kinds 
of events or activities to which they appear to apply. However, this is not to 
suggest the general explanatory superiority of one theory to another. Rather, we 
contend that our findings raise questions about the fit between each theory's 
claims and the types of movements and levels of analysis to which they apply. It 
may well be that some theories apply more to some kinds of movements or to 
different aspects of the collective action problem. To date, however, applications 
and critiques of theoretical perspectives within the social movement arena have, 
with few exceptions, proceeded as if they apply in the same fashion to all kinds 
of movements and collective action problems, and this is particularly true of 
theoretical explanations for movement emergence and the occurrence of protest 
activity. Our findings, showing variation in the explanatory power of the three 
theories examined in accounting for the rate of homeless protest events, clearly 
call into question the presumed generalizability of the theories across social 
movement activity and collective action problems and thus call for more careful 
assessment of the actual empirical applicability of those claims by considering 
the array of scope conditions that apply to each of the theories. 


Notes 


1. Although reasonable questions have been raised about the assessment and likely 
overestimation of some disabilities among the homeless (see Snow, Anderson, and Koegel 
1994), there is no question that the homeless suffer from a high incidence of disability. 


2. Later in the article, we discuss in detail how we chose these cities for study. 


3. There are numerous analyses of relative deprivation within the collective-action arena 
that have attempted to examine its influence by assessing presumed structural indicators 
(e.g., Snyder and Tilly 1972; Spilerman 1970). But it is questionable whether such analyses 
provide the best assessment of relative deprivation. Seeman (1981) framed the problem well 
when he noted that 


the structuralists . . . do not typically derive indices of the sense of relative deprivation; 
they measure structural variables . . . that are taken to be indicators of the individual- 
level variables at issue. But as we should have guessed, the relationship between 
subjective evaluations of well being and external objective conditions is itself so 
filtered through individual circumstances that there is little evidence of a systematic 
effect of macroenvironmental conditions upon overall sense of well being. In short, 
it is difficult to get to relative deprivation from economic conditions alone, since 
structural variables are only weak indicators at best of the personal experience of 
relative deprivation. (396) 


4. Although people, money, and legitimacy are typically mentioned as major resources 
(Jenkins 1983), there continue to be efforts to clarify conceptually and operationally what 
constitutes an essential resource and to assess the relative importance of various resources 
(e.g., see Cress and Snow 1996; Edwards and McCarthy 2004). 
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5. We should note that we face the same issue that other social scientists face: Are the best 
data available, however possibly flawed, usable for analysis? Using newspaper data does not 
deviate markedly from accepted standards in social science. For example, the coverage rate of 
newspapers is probably higher than the rate of reporting of criminal activity, yet criminologists 
and sociologists continue to study crime rates because these are the best data available. Or, 
in survey analysis, acceptable response rates compare favorably to the 78% reporting rate for 
rallies (Oliver and Myers 1999). Thus, like all data in the social sciences, newspaper data are 
subject to biases; this does not mean that these data are unusable, however. 


6. It is important to note that we did not double-code events. Obviously, large protest events 
that took place in Washington, D.C., were covered in several of our newspapers, as were 
a number of other, smaller events in other cities. Thus, when more than one daily paper 
covered a particular event, we chose to code only the report in the paper from the city in 
which the event occurred. 


7. Protest activities coded are as follows: public meetings, petitions, letter-writing campaigns, 
public hearings, rallies, marches, takeovers of housing units, takeovers of other facilities, 
blockades, encampments, hunger strikes or fasting, picketing, lawsuits, lobbying, and press 
conferences. 


8. Within the government, the executive (president, governor, and mayor) and legislative 
(senate, house, city council) branches were the predominant targets. 


9. We also operationalized the decrease in manufacturing jobs as the change in the raw 
number of manufacturing jobs per city size, but we report the results of the dummy variable 
because it makes more intuitive sense and is easier to convey. 


10. Some readers might contend that transfer payments are a more appropriate indicator of 
strain than of resources. We argue, contrariwise, that it is not the mere existence of transfer 
payments that indicates strain, but—per Piven and Cloward (1977) and the previously 
mentioned quotidian disruption thesis—it is either low or declining transfer payments 
that are likely to be associated with strain. Here we emphasize a greater dollar volume of 
transfer payments, arguing that it contributes to the broader resource base and should thus 
be associated with greater frequency of homeless protest. As well, it is important to note that 
the cost of living varies across these cities such that these payments buy more (or less) in 
some of these cities. In an attempt to control for this, we were able to find data on a cost of 
living index for 14 of the cities, but only for 1985. We created a standardized transfer payment 
measure by dividing our total transfer payments by the cost of living index (1985 value). 
The coefficient on this measure, when included in our models, is positive and significant. 
However, we do not report the measure in this article because we were able to find the cost 
of living index only for some of our cities, and not for the entire time period. Nonetheless, 
it is worthy of note that, when standardizing our measure for the cost of living, the effect of 
transfer payments remains the same. 


11. We also included the percentage of the vote for each Democratic candidate separately 
but, since these are fairly highly correlated (as expected), it made more sense to construct 
the index to tap the overall Democratic strength in a city. 


12. In models not shown, we included various dummy variables for time periods corresponding 
to changes in the U.S. presidency to measure for possible changes in the national context 
that might have affected the amount of homeless protest, net of our city-level characteristics. 
In most models, none of these dummy variables was significant and their removal did not 
change the overall fit of the models; thus we do not include them in Table 5. 
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Abstract 


We describe a theory of legislative logic. This logic is based on the observation 
that each succeeding stage of the legislative process has increasingly stringent rules 
and becomes more consequential. This logic unevenly distributes the influence of 
social movements across the legislative process. Social movements should have less 
influence at later stages where stringent requirements are more likely to exhaust 
limited resources and where the consequentiality of action will cause legislators to 
revoke their support. We apply the theory to a study of state-level woman suffrage 
legislation. We find that legislators responded to suffragists by bringing the issue 
of woman suffrage to the legislative forum, but once suffrage bills reached the 
voting stage, differences in social movement tactics and organization did not have 
as great an impact. 


Scholars of U.S. political change characteristically see policy adoption as a 
single, discrete outcome. In reality, policy change takes place over a sequence of 
stages, which makes possible varying levels of policy success. Bills must first be 
introduced (a major feat on its own), come to a vote, be passed by two houses, 
and (if a constitutional amendment) be approved via referendum. As Burstein 
and Linton (2002) noted, most studies of policy change tend to focus on the final 
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stage of legislation while ignoring other critical moments in the legislative process 
(although see Burstein, Bricher, and Einwohner 1995; Soule, et al. 1999; McAdam 
and Su 2002). Whereas the usual method of inquiry depicts policy adoption 
as a dichotomous event, we assert that policy change might be alternatively 
conceptualized as a series of discrete outcomes. 

This alternative conception has theoretical and methodological consequences. 
The conception of policy adoption as a sequential process allows us to determine 
at which stages social movements are most likely to influence the progress of 
a legislative proposal. The *usual suspects" of influential variables on policy 
passage may vary in their influence across the different stages of the legislative 
process. For example, social movements may be more influential at one stage 
of the legislative process than at others. Political scientists argue that lobbying 
groups are most successful at the agenda setting phase of policy-making (Bauer, 
Pool, and Dexter 1963; Milbrath 1963; Scott and Hunt 1965) rather than directly 
affecting voting decisions. Social movement scholars have also suggested that 
movements may be more successful at attaining some types of legislation than 
others (Schumaker 1975; Burstein, Einwohner, and Hollander 1995; Andrews 
2001). 

We posit a theory of legislative logic in order to better understand the effect 
of social movements on legislative outcomes. A growing body of literature has 
accumulated in recent years looking at social movements' effects on legislative 
outcomes (Soule et al. 1999; McAdam and Su 2002). Indeed, many social 
movement theorists see policy change as a sign that movements have succeeded 
in attaining their goals (Giugni, McAdam, and Tilly 1999; McCammon et al. 
2001). But this research pays less attention to how movements are successful. This 
study is an attempt to look more carefully at the pathway to social movement 
success, recognizing that a logic of legislation conditions the influence of social 
movements at different stages. Enhancing our understanding of legislative 
logic will improve our understanding of how movements and other interest 
organizations penetrate the policy arena by delineating the moments in the 
legislative process when movement activities are most influential. 

Discrete time event history analysis is a common statistical method used by 
researchers to study policy adoption (see Burstein and Linton 2002 for a review). 
This method is extremely valuable for calculating the net effects of independent 
variables on a binary event that has a probability of occurring over a given 
span of time or spell (for examples in policy research see Soule and Zylan 1997; 
Grattet, Jenness, and Curry 1998). The event history analysis as applied to policy 
adoption tells us which independent variables have an influence on the rate of 
adoption and the estimated magnitude of those effects; however, it does not allow 
us to determine at which point in the legislative process certain factors are most 
likely to be influential. Alternatively, we disaggregate the legislative process into 
its constituent stages and determine the effects of social movement mobilization 
at each stage. We then compare the effects of social movement mobilization 
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across stages to determine at which stage(s) movements are most likely to have 
an impact. In order to do this, we disaggregate the process into four separate 
outcomes: bill introduction, bill voted (a roll-call vote is taken), bill passage in 
one house, and bill passage in the second house. Although other studies have 
looked at various legislative stages (Burstein et al. 1995; Soule et al. 1999; and 
McAdam and Su 2002 all looked at the earlier, agenda-setting stages in Congress 
for particular movement-related issues), no study has contrasted the magnitude 
of these effects across stages nor has any previous study made assertions about 
the differentiated influence of movements across legislative stages. 

We use random effects sequential logit models to assess the effects of 
independent variables on succeeding stages of legislation. Following a theory 
of legislative logic, we contend that stages in the legislative process vary in 
rule stringency and consequentiality. This variation unevenly distributes the 
influence of social movements and other interest organizations on the policy- 
making process. We argue that social movements are more influential at earlier, 
less consequential stages of the process than at later, more consequential stages. 
Social movements may be more successful at raising the salience of an issue 
and getting a bill introduced than getting the bill passed. This differential level 
of influence is due to the fragmented nature of the legislative process and the 
corresponding logic of influence at each stage. The later stages of legislation may 
be more determined by political, structural, and cultural influences than by the 
direct action of social movements. 

We explore the legislation of state-level woman suffrage! amendments 
between 1848 and 1918. The movement to grant women the right to vote 
culminated in the Nineteenth Amendment, but throughout the movement's 
history, most of the action took place at the state level. In fact, before 1919 
fifteen states granted women full suffrage and several more adopted variants of 
partial suffrage (presidential, primary, municipal, etc). Starting with Wyoming in 
1869, many western territories and states extended suffrage to women through 
constitutional amendments, thereby paving the way for legislation at the national 
level. Given the long history of successes and failures, this is an ideal case to test 
whether the logic of legislation fragments social movement influence on policy 
outcomes. Suffragists experienced multiple defeats and victories along the way to 
a constitutional amendment.’ The majority of suffrage referenda resulted from 
the legislation of a suffrage amendment. McCammon, Campbell, Granberg, 
and Mowery’s (2001) event history analysis of factors influencing state-level 
suffrage adoption makes an analysis of the impact of movement activities on 
state-level suffrage adoption timely. We draw from their findings and use some 
of their data to examine the impact of the suffrage movement on the likelihood 
of success at the different stages of the legislative process. 
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A Theory of Legislative Logic 


Legislatures do not adopt a policy in a single, discrete moment. Yet research 
on policy adoption treats outcomes as if they do (see for example Soule and 
Zylan 1997; Grattet, Jenness, and Curry 1998; McCammon, et al. 2001). In fact, 
the road to legislation is a fragmented one. In order to make policy changes or 
constitutional amendments through legislation, advocates must be successful at 
multiple stages. First, a bill must be introduced by at least one member of the 
state Senate or House of Representatives (also known as the Assembly). After 
introduction, the bill is read for the first time and then sent to committee. Before 
a second reading, the bill must be returned from committee. If the committee 
is unfriendly to the proposal, it may hold the bill until late in the legislative 
session, thus effectively defeating passage. Many woman suffrage bills were killed 
this way. If the bill is reported without recommendation, usually it disappears 
from the agenda. 

Following a second reading, a third reading and a roll-call vote are required. 
The rules are flexible enough that a speaker or chair can sit on the bill and not 
allow the bill to be read a third time. If it makes it to a vote, the bill passes only 
if the necessary constitutional majority votes for the bill. In the case of woman 
suffrage, because the bill proposed a constitutional amendment, the majority 
requirements for bill passage differed by state (5196, two-fifths, two-thirds, or 
three-fourths). If the bill passed by a majority, it moved to the other house where 
it had to undergo the same process.’ 

Political outsiders' accessibility to the policy-making process is likely limited 
due to the fragmentation of legislation. The political process consists of multiple 
“veto points” (Immergut 1992) and distinctive stages with differing levels of 
political access. Political institutional scholars have traditionally argued that 
the fragmentation of the polity provides minority interests more opportunity 
and access (see Steinmo 1986; Steinmo, Thelen, and Longstreth 1992). On 
the other hand, perhaps the fragmentation of the legislative process hampers 
political access by distributing the impact of mobilization and creating multiple 
moments of legislator decision-making. 

How does the logic of legislation affect the influence of social movements 
on policy-making? Do social movements have less impact at some stages of 
the process than at others? Qualitative work in political science suggests that 
lobbies are most influential at the early, agenda-setting stages of legislation (see 
Baumgartner and Leech 1998 for a review). Bauer, Pool, and Dexter (1965) 
found that interest groups often play roles similar to congressional staff. They 
help frame issues, weigh the importance of information, and draft bills, but their 
influence on voting decisions is minimal. While social movements attempt to 
have a broader impact (using outsider tactics in addition to insider tactics such 
as lobbying), their legislative role may be similar to that of traditional interest 
groups. Perhaps social movements are also more likely to affect legislation by 
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drawing attention to an issue, educating legislators, and provoking legislators to 
take some form of action. | 

We contend that the legislative process consists of its own institutional logic 
and that this logic differentially distributes the impact of social movements on 
legislation. The legislative logic is based on the observation that each succeeding 
stage has increasingly stringent rules and becomes more consequential. Earlier stages 
are governed by less stringent rules and are less consequential for legislators 
who support the bill. Bill introduction proceeds with relative ease and has fewer 
consequences for legislators who support the bill. In contrast, passing a bill is a 
rather rare, difficult, and momentous accomplishment, particularly in the case of 
constitutional amendments. While it is the case that the difficulty of passing a bill 
in part results from the additive nature of getting from bill introduction to bill 
passage, it is also true that ensuring that a sufficient majority of legislators will vote 
in support of the bill is a much more difficult accomplishment than introducing 
a bill. Bill introduction has fewer requirements—only one legislator needs to 
be persuaded to introduce a bill. Bill introduction is also less consequential, 
garnering less attention from other legislators or the public. Moving a bill from 
committee to a roll-call vote requires support and action from more than one 
legislator. Passing a bill requires even more support. 

Although we argue that bill introduction is a less consequential outcome 
compared to legislative passage, its significance to movement success should 
not be underestimated. As Banaszak notes, bill introduction “was often a sign 
of progress, since it indicated that the issue had achieved a degree of legitimacy 
in the public debate” (1996:84). Others have argued that access, responsiveness, 
and agenda-setting are important outcomes, although not the final outcome of 
most movements (Schumaker 1975; Burstein, Einwohner, and Hollander 1995; 
Andrews 2001). Our main point is that legislators weigh the consequentiality 
of their decisions differently at each stage. 


Social Movements and the Theory of Legislative Logic 


DISTRIBUTION OF MOVEMENT INFLUENCE 


Given our theory of legislative logic, at which stage are social movements 
most likely to have an impact? We posit that social movements should have 
less influence at later stages where stringent requirements are more likely 
to exhaust limited resources and where the consequentiality of action will 
cause legislators to revoke their support. A legislator may feel relatively safe 
introducing a bill without necessarily being committed to the bill’s eventual 
success. Action in earlier stages of the legislative process may be more susceptible 
to symbolic gestures on the part of legislators trying to appease a broad based 
constituency. 
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The fragmented legislative process allowed legislators to decouple their support 
of woman suffrage at the introduction stage with any real intent to see it succeed. 
Thus, active suffragist organizations may have been able to persuade legislators 
to introduce suffrage bills but were unable to gather enough support to see the 
bills passed. Rather than determining the fate of potential suffrage amendments 
by their targeted activism, suffragists likely served an agenda-setting function (see 
Schumaker 1975; Burstein, Einwohner, and Hollander 1995; Andrews 2001). 

Suffragists were often frustrated by seemingly promising attempts at 
legislation that lost support in later stages. A leader of the California suffrage 
movement summed up the feeling of suffragists toward the insincere pledges 
of legislators: 


As none of the lawmakers believed that his party or personal interests 
could be served in any way by our success, neither ourselves nor our 
measure was taken seriously. But we were “nice ladies” . . . and had to 
be pleased, so these polite statesmen administered to us from time to 
time doses of “soothing syrup” of their own special legislative brand (as 
cited in Solomons 1915:8—9). 


The “soothing syrup” of bill introduction was not enough to assure bill passage. 
In fact, once bills reached a second or third reading and moved to the floor for 
a roll-call vote, legislators often abandoned amendments, leaving them “half- 
orphan(s)” without political allies (Solomons 1915). We test the hypothesis that 
movements should be most influential in the bill introduction and voting stages 
and less influential at the passage stages by looking at the effects of suffragist 
tactics and the organizational capacity of the movement across stages of the 
legislative process. 


TACTICS 


Social movements employ different tactics for affecting political outcomes. 
McCammon et al. (2001) discovered that neither insider nor outsider tactics 
affected state-level woman suffrage adoption. Banaszak (1996), however, found 
that political lobbying (an insider strategy) influenced legislative success. Neither 
study provided support for the idea that disruptive (or outsider) tactics helped 
achieve success, and there is mixed evidence that insider tactics convinced 
legislators to support woman suffrage. We focus our attention on the influence of 
three tactics, the first two of which are clearly insider tactics: political lobbying, 
candidate campaigning, and cultural framing. 

Suffragists developed a strong lobby outside of the traditional party system 
(Clemens 1997). The lobby may have been particularly useful in educating 
legislators about women’s political capabilities. Learning to “talk the language 
of men,” women were also educated through the use of the lobby (as cited in 
Clemens 1997:233). In addition to lobbying, some state suffrage organizations 
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campaigned on behalf of candidates friendly to their cause in the hopes of 
gaining more political allies. 

Social movement scholars have theorized about the importance of cultural 
framing; in cultural framing, arguments are framed via dominant cultural 
ideologies in order to communicate effectively a group's political goals (see 
Snow and Benford 1988, 1992). McCammon et al. (2001) found that suffragists’ 
attempts to frame the necessity of woman suffrage in terms of men’s and 
women’s separate spheres positively influenced state-level woman suffrage 
adoption. By communicating the importance of woman suffrage in a way that 
resonated with extant cultural sensibilities, suffragists convinced legislators and 
staved off criticisms of being disruptive radicals. 

We expect that the influence of political lobbying, candidate campaigning, 
and cultural framing will be greater at the introduction stage of legislation and 
will be less influential at later stages. 


ORGANIZATIONAL CAPACITY 


Another predictor of social movement influence is the organizational capacity 
of a movement.’ In order to create the changes they desire, social movements 
must be strong enough in numbers to matter to policy makers. Gamson 
(1975) argues that mobilization success is correlated with the degree of 
bureaucratization and centralization within the movement. Movements that 
have a strong organizational structure are perhaps more capable of carrying 
out a policy campaign (see Goldstone 1980) and should be able to influence 
legislators’ perceptions regarding an issue. 

The organization of the suffrage movement mirrored the federalist structure 
of governance (Skocpol, Ganz, and Munson 2000) in that it was differentiated 
hierarchically into national, state, and local levels (Dubois 1978). In some states 
the local clubs operated autonomously. National organizations were also involved, 
providing money and expertise (see Earl 1976). Szymanski (2003) demonstrates 
that local movements coordinated through centralized national organizations 
formed the heart of the Prohibition movement and that this centralized yet local 
effort created the coordination necessary for their eventual success. Suffragists 
used a similar kind of organization. 

We use three indicators of social movement organizing. The number 
of suffrage organizations and the number of suffrage publications serve as 
indicators of local movement mobilization. In addition, the presence of a 
bureaucratized organization (organized with a hierarchical structure of offices 
and affiliated with a national organization) mobilized political resources at 
the local, grass-roots level and made resources from the national organization 
available. We expect that suffragists’ tactics and their movement’s organizational 
capacity should matter most at the introduction and vote stages of legislation 
but will have less influence at bill passage. 
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It is possible that insider tactics and organizational capacity may differentially 
impact success at any given stage. For example, lobbying may have more of an 
impact on bill introduction, and cultural framing may have more of an impact 
on bringing the bill to a vote. Moreover, the timing and intensity of tactics 
may change over the course of the legislative process as activists respond to 
intermediate successes or the possibility of sudden defeat. However, in this 
analysis we are only interested in assessing the overall impact of social movement 
mobilization at various stages using available indicators of tactics and capacity. 
The temporal granularity of available data does not allow a dynamic analysis of 
the changes in tactics across the various stages. 


Modeling the Legislative Process 


ESTIMATION TECHNIQUES 


We model the effects of movement variables on outcomes in four legislative 
stages: bill introduction, bill voted (a roll-call vote is taken), bill passed in one 
house, and bill passed in two houses. The unit of analysis is state legislative 
session year. We estimate the likelihood that success will be attained at each 
legislative stage for any given year that the legislature is in session. A sequential 
logit analysis is an appropriate way to conceptualize the legislative stage process. 
In a sequential logit model, cases are only included in the analysis if they 
succeeded at previous legislative stages. For example, in order for a bill to be 
considered and voted on, it must first be introduced. Thus, after modeling bill 
introduction, we then estimate the likelihood that the legislature will vote on 
the bill selecting only the state legislative session years in which a bill has been 
introduced. Observations at each later stage are selected in the same way. N 
decreases at each stage as more cases are selected out due to failure at an earlier 
stage. Table 1 displays the outcomes from the different legislative stages across 
time. Note that reading down the columns of the table, each stage is a subset 
of the previous stage. 

Only full suffrage bills (bills granting women full voting rights) are considered 
in the analysis. Partial suffrage bills (presidential, primary, etc.) are excluded from 
the analysis due to lack of equivalence. Logit models regress the binary outcomes 
on a set of social movement variables and a set of control variables. 

This study uses cross-sectional time-series data. All sessions are prior to 
adoption of full woman suffrage in the state? Although the first suffrage bill 
was introduced in 1854, we exclude cases from years earlier than 1866 due to 
missing data for some of the variables. Only sessions prior to 1919 are analyzed 
so that any effects resulting from the passage of the Nineteenth Amendment do 
not confound the results. 

Estimation techniques must take into account the time invariant unobserved 
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Table 1. Summary of Success at Different Legislative Stages Across Time 


Time Periods 


Legislative stage 1860-1879 1880-1899 1900-1919 Totals 

Legislative sessions 576 504 435 1515 

Sessions in which bills are introduced 55 104 181 290 

Sessions in which a roll call vote was held 31 70 103 204 

Sessions in which bills passed at least 16 39 60 115 
one house 

Sessions in which bills passed two houses 6 22 39 67 


Note: Totals do not match up perfectly with the number of observations in statistical models because 
of missing values in independent variables. 


characteristics of states that affect legislative outcomes and therefore lead to 
correlation of errors within states over time (Stimson 1985). We use a random 
effects logit model that takes into account panel-level variance and adjusts 
standard errors appropriately. Random effects models treat unobserved time 
invariant heterogeneity across states as a random variable (Petersen 1993). The 
advantage to using a random effects model, compared to a fixed effects technique, 
is that it allows us to take into account variation both over time and between 
states. We assessed all of the models with variance inflation factors (VIF) to 
detect multicollinearity in the data. This collinearity was not a problem with the 
regression models presented here. Because we expect that over time states were 
more likely to introduce, vote on, and pass suffrage bills, we control for temporal 
effects by including a year variable. 


DATA AND MEASUREMENTS 


The mobilization variables were collected by Holly McCammon and her associates 
from primary and secondary historical sources (see McCammon et al. 2001) 
except for the suffrage publication variable, which was collected by the authors 
using secondary and primary historical sources.’ The authors collected the 
legislative data directly from primary sources. Researchers visited the legislative 
archives or state law libraries in each state and conducted primary historical 
research of legislative records’ providing more complete information on the 
legislative history of woman suffrage bills than would have been the case if we 
had only relied on secondary histories.? 


Social Movement Variables 


We used six social movement variables. Suffrage lobby is a dichotomous variable 
measuring the presence of a suffrage political lobby. Candidate campaigning is 
a dichotomous variable measuring whether a suffrage organization campaigned 
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specifically for candidates who were sympathetic to the suffrage cause. Separate 
spheres is a dichotomous variable measuring whether suffragists framed their 
argument in terms of the distinct attributes possessed by women that make their 
contribution to politics needed. Number of suffrage organizations measures the 
degree of mobilization at the local level for woman suffrage in any given state for 
a given year. Suffrage publication indicates the total number of woman suffrage 
newspapers or other regular publications printed in the state. The presence of 
a bureaucratic state suffrage organization is a dichotomous variable indicating 
whether an organization is present that has a hierarchical leadership structure 
and is affiliated with the national organization (indicating that the organization 
is both hierarchical and centralized). 


Control Variables 


Four sets of control variables account for other explanatory factors of woman 
suffrage adoption. The first set measures the presence of potential and actual 
political allies. Total number of parties measures the extent to which party diversity 
existed in a state and captures the presence of third parties. Historians have noted 
that the Progressives and Populist parties were important allies with suffragists 
since they were often more progressive and reform oriented than the Democrats 
or Republicans. Third-party populists are credited for legislative success in at 
least one state—Colorado (Marilley 1996). However, McCammon et al. (2001) 
found no effect for presence of third party in their analysis of state-level suffrage 
adoption. Using information from Gubernatorial Elections (1998), we calculated 
a count of the total number of parties registering votes during an election. The 
variable not only measures the presence of third parties in the political process, 
but also the extent to which a plurality of political views received electoral 
support. Party endorsement is a dichotomous variable indicating whether at 
least one of the major parties endorsed woman suffrage in their platform.’ This 
variable was constructed by Holly McCammon and her associates from primary 
and secondary sources. 

The second set of control variables measures state structure. Two measures 
serve as indicators of the institutionalized rules of the legislative process. Legislative 
majority is an ordinal variable measuring variation in the percentage of legislator 
votes needed to pass a constitutional amendment. This variable ranges from 1 to 4, 
with 4 being the highest percentage (75%) and 1 being the lowest (51%). Sessions 
required measures the number of sessions required to move a passed bill on to the 
ballot for referendum. In some states, bills proposing constitutional amendments 
had to be passed by two consecutive legislative sessions. Often a suffrage bill was 
passed in one legislative session but failed in the second legislative session. Political 
openness measures a latent cultural willingness to expand democratic rights (see 
Berry 1994; Savage 1978; Huber, Ragin, and Stephens 1993). The measure is based 
on the political openness score constructed by Clemens (1997) and was derived 
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from several sources to construct a time-varying variable (Kettleborough 1923; 
Merriam and Overacker 1928; Ranney 1978; Heckelman 1995),'° 

The need to control for such variables is supported by our own analysis 
(Cornwall et al. 2004), McCammon and Campbell’s research on state-level woman 
suffrage adoption (2001), and the social movement literature theorizing about the 
importance of political opportunity structures for movement mobilization and 
success (see Amenta, Carruthers, and Zylan 1992; Tarrow 1994; McAdam 1996; 
Soule et al. 1999; McCammon et al. 2001; Soule and Olzak 2004). 

The last set of variables control for different aspects of cultural change. Two 
measure changing gender practices and norms as an indicator of the tension 
between the cultural environment and extant policy. These variables are included 
to reflect differences in the “gendered opportunity structure” as discussed by 
McCammon et al. (2001). Following their lead, we assume that as women moved 
into “traditionally male domains, and the social order between the sexes began a 
transformation,” assumptions about women’s place in society broke down, paving 
the way for suffrage adoption (2001:53). Percentage of women in nonagricultural 
employment measures the extent to which women engage in occupations more 
common to a public setting." Women employed in professional occupations as a 
percentage of women in nonagricultural employment measures the extent to which 
women are employed in higher-status, more autonomous jobs. Both of these 
variables were collected from census reports ranging from 1870 to 1920.” They 
are linearly interpolated to fill in measurement gaps. Other scholars studying 
policy adoptions have found that policy makers tend to mimic the actions of 
proximate legislatures (Grattet et al. 1998; Soule and Zylan 1997; Zylan and 
Soule 2000). Percentage of states in a region with woman suffrage measures the 
potential for the diffusion of suffrage ideals. This measure varies over time and 
is lagged yearly.” 

We also controlled for opportunities for success at each legislative stage. A bill 
is more likely to make it through committee if there are many woman suffrage bills 
introduced in that session. As the number of suffrage bills introduced increases, 
we would expect that the likelihood that a bill passes out of committee is also 
greater. In some instances where a bill was introduced in each house, both bills 
were voted on separately. The likelihood that a bill will pass is also a function of 
the total number of bills that make it to a roll-call vote. We therefore included 
two variables to control for this endogenous effect. The number of woman suffrage 
bills introduced should positively affect the likelihood of success at the committee 
stage, and the number of woman suffrage bills voted should increase the likelihood 
of success at the passage stage." 
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Results 


Models 1 and 2 (see Table 2) report the results of a logit model estimating the 
effects of the independent variables on bill introduction. Model 1 is the full 
model and contains both the control variables and the social movement variables 
of interest. Of the movement variables, three (total number of organizations, 
suffrage publication, and separate spheres framing) are not significant. The other 
three (bureaucratized organization, political lobby, and candidate campaigning) 
are significant and positive, when controlling for other important explanatory 
factors. The likelihood ratio test comparing Model 2 (the reduced model) to 
Model 1 (the full model) reveals that including the social movement variables 
improves the fit of the model. 

Models 3 and 4 report the results of a logit model estimating effects on roll- 
call vote selecting for state session years in which a bill was introduced. Note that 
the original N of 1143 has been reduced to an N of 306. In Model 3, the only 
social movement variables that are statistically significant are the presence of a 
bureaucratic movement organization and a political lobby. The likelihood ratio 
test comparing the full model to the reduced model indicates that including the 
movement variables in the analysis improves the fit of a model. This finding 
suggests that once a bill has been introduced, the presence of a bureaucratic 
suffrage organization and political lobbying increased the likelihood that a bill 
will come to a vote. 

Models 4 and 5 show the effects of independent variables on bill passage in 
one house. None of the social movement variables were statistically significant 
at this stage, and a likelihood ratio test confirms that the inclusion of movement 
variables does not significantly improve the fit of the model. Finally, in models 
6 and 7 we regress the odds of passing a bill in two houses on the independent 
variables. Similar to the previous stage, we find that none of the social movement 
variables are significant (nor do they improve the overall fit of the model), 
although the effect of political lobbying increases greatly. It appears that social 
movement variables do not have significant effects on the passage stages of the 
legislative process. 

Direct interpretation of the statistical significance as an indication of the 
difference of effects between stages is tricky because it is possible that the lack 
of significance at a later stage is merely due to loss of statistical power in the 
final models. Therefore, we also compare the size of social movement coefficients 
with the confidence intervals of earlier stages (those that had a larger N) to assess 
to what extent the lack of significance is due to loss of statistical power. In this 
approach, we should expect that b, ter stage U aa  * (1:96 * SE). Table 3 shows 
comparisons of coefficients to confidence intervals of preceding stages. Three 
social movement variables have significantly different effects at later stages. The 
effect of having a suffrage publication is more negative at the voting and passage 
stages compared to its effect at the introduction stage. The effect of political 
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Table 3. Comparisons of Coefficient Size across Stages of the Legislative 


Process 

Social Movement Model 3 Model 5 Model 5 Model 7 Model 7 Model 7 

Variable Coefficients versus versus “versus versus versus versus 
Confidence Confidence Confidence Confidence Confidence Confidence 
Interval of  Intervalof Intervalof Intervalof Interval of Interval of 
Model 1 Model 1 Model 3 Model 1 Model 3 Model 5 

Political lobby Yes Yes Yes 

Candidate campaign Yes Yes 


Separate spheres frame 

Number of organizations 

Suffrage publication Yes Yes Yes 
Bureaucratic organization : 


Note: Yes indicates that coefficient was significantly different from coefficient of earlier stage coefficient, 
using the 9596 confidence interval as criteria. 


lobby is also significantly different at the voting and passage stages compared to 
the introduction stage. The effect of candidate campaigning is only significantly 
different at the final two passage stages. The results from Table 3 temper the 
earlier findings and thus suggest that only some of the suffrage movements’ efforts 
(publishing newsletters, lobbying, and candidate campaigning) were less likely to 
impact success at later stages. We cannot determine, however, that the effect of a 
bureaucratic organizational form changed over the legislative process. 


Discussion 


The results presented here indicate that the effect of the woman suffrage 
movement on legislative action was greater in earlier stages of the legislative 
process. Suffragists appear to have been most successful at increasing the salience 
of the issue through the use of a political lobby and candidate campaigning. 
Legislators were willing to respond to organized suffragists by bringing the issue 
of woman suffrage to the legislative forum, but once suffrage bills reached the 
point at which they were to be voted, differences in social movement tactics did 
not have as great an impact. At this point, the effect of the movement itself was 
diminished in favor of other influential variables. 

These findings support the theory of legislative logic and are consistent with 
qualitative research in political science on the effects of lobbies on legislation. 
Earlier stages (namely the bill introduction and roll-call vote stage) have less 
stringent rules and are less consequential to legislators. As rules become more 
stringent, support from more legislators is needed to move a bill on to the next 
stage. As stages become more consequential, legislators weigh their actions 
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more seriously and are less likely to respond to outsider groups (such as social 
movements) than they are to other political and cultural factors. The result of 
legislative fragmentation is that movements and other interest groups find more 
access, or their voice is more audible, when the stakes are small. Legislators may 
attempt to pacify movements by introducing bills that will likely never receive 
strong support in a roll-call vote. 

If movements are successful at getting an issue on the agenda but less 
successful at changing the opinions of legislators to vote in a desired way, what 
other factors are important in legislative voting patterns? Given that our theory 
only pertains to social movement access, we can only speculate regarding the size 
and direction of the effects of the political opportunity structure (POS) variables 
and cultural indicators. A few findings do strike us as interesting however, and 
are worthy of some post hoc explanations. 

Of the political ally variables, we found that both the total number of 
parties and party endorsement positively affected the odds of introducing a 
bill. However, neither was influential in bringing a bill to vote, and only the 
total number of parties influenced the odds of passing a bill. The effect (or lack 
of effect) of party endorsement is not strange given the extremely apathetic 
view most parties held towards woman suffrage. The historical context of 
woman suffrage legislation was one of patronage politics and of distributive, 
rather than redistributive or ideology-oriented, party strategies (Skocpol 1992). 
Because party agendas were set with local interests and leaders in mind, woman 
suffragists never secured the long-term endorsement of a major party (at 
least not for a significant period of time), and thus suffrage was rarely able to 
receive bloc support in legislation. When Democrats or Republicans endorsed 
woman suffrage in their platforms, they may have done so in fear of alienating 
their progressive constituents, and therefore neither party became ideologically 
unified on the issue. After experiencing several disappointing bill defeats in a 
legislature dominated by a Republican majority that officially endorsed the 
cause, suffragists in Massachusetts decided that seeking party endorsement 
was no longer a fruitful use of resources. The state association advised that it 
was “worse than useless to ‘stand by the Republicans’ or any other party while 
we are deprived of the only means of enforcing a political opinion” (Stanton 
et al. 1985:280). 

State structure did not have a significant effect at the introduction stage; only 
sessions required significantly influenced the voting stage; and only political 
openness affected the bill passage stage. The effect of sessions required is positive 
and reflects the fact that once a bill passed one session (in states that required 
consecutive passage), a bill was much more likely to be voted on the following 
year. The effect of political openness is not as straightforward. Political openness, 
as a measure of a legislative legacy in democratic progressiveness, is a very strong 
indicator of legislative willingness to pass woman suffrage bills. Democratically 
progressive states were much more likely to pass woman suffrage bills, holding 
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constant other controls. However, political openness does not predict bill 
introductions or committee passage. It is possible that political openness works 
harmoniously with the activity of social movements. After social movements did 
the work of increasing sensitivity to the issue, states that were more democratically 
progressive were the most likely actually to do something for suffragists. 

Of the cultural variables, the most influential variable was the percentage of 
working women in nonagricultural occupations. We find it interesting that this 
was the only variable to be influential at all stages of legislation. This finding 
offers strong evidence that variation in gender relations, norms, and rules had a 
strong impact on legislators' decision-making. Not only did it increase awareness 
of woman suffrage, but it also increased the likelihood of support at the passage 
stage. 

The women's occupation variable may be a latent indicator of public opinion 
regarding women's role in society. Although we cannot measure opinion on the 
suffrage issue directly (given the lack of data on public opinion during this time 
period), we speculate that, as women gained more access to work outside of the 
domestic sphere, public opinion shifted in favor of giving women more public 
presence. Although we do not think this means that women necessarily achieved a 
significant level of equality at this time due to public employment, their presence 
certainly challenged the previously dominant logic of separate spheres (Cott 
1977). Insofar as changes in the gendered opportunity structure represent shifts 
in opinion (McCammon et al. 2001), our findings suggest that public opinion 
has a steady effect on decision-making throughout the legislative process, even as 
social movement influence declines. This would support theoretical claims that 
public opinion is an important determinant of policy outcomes (see Burstein 
1999; Burstein and Linton 2002). Future research should consider this important 
theoretical claim in more detail. 

It is also interesting that states in regions with other previous adopters of 
woman suffrage were more likely to introduce bills but not more likely to bring 
these bills to a vote or passage. Although very little is empirically known about 
how ideas are transmitted from state to state, theorists often see emulation as a 
kind of adaptive strategy that maintains legitimacy and cultural fit (Strang and 
Meyer 1993; Strang and Soule 1998). We find that the effect of diffusion may 
create more salience on an issue and not directly affect the vote of state legislators. 
Hence, legislators are not more likely to vote on a bill just because other states in 
their region have done so, but the high level of salience on a policy issue that is 
produced by regional adoptions may create an environment more suitable for its 
discussion. Given the post hoc nature of this explanation, future theoretical work 
could take seriously the idea that diffusion may occur differentially throughout 
the legislative process. 
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Conclusion 


We have suggested a theoretical perspective and offered a methodological tool for 
understanding the effects of social movements on policy change. Statistical results 
support our contention that social movement access to the legislative arena is 
differentially distributed according to the stringency of rules and consequentiality 
of the corresponding legislative stage. The findings suggest several implications 
for future research on policy change and social movement outcome. 

First, we believe sequential analysis of legislation provides insights not 
apparent when using other quantitative methods. In the study of movement 
outcomes, quantitative studies usually track the rate of passage over time and 
estimate the effects of various factors on that rate. While this approach helps us 
understand the long-term dynamics of policy change, it may oversimplify the 
causal relationships. For example, findings may indicate an association between 
social movement activity and movement outcome but offer little insight into 
the actual ordering of social movement activity and the legislative process. 
Though the association is generally robust, the finding tells us very little about 
how social movement mobilization or other interest group behavior actually 
produced the outcome. An analogy demonstrates our point. When making a 
cake one should not only know which ingredients to add to the mixture but also 
the specific order in which ingredients should be added. Without knowledge of 
both the ingredients and of the sequence in which they should be added, the 
final product would probably be less than desirable. Similarly, understanding 
the legislative process in more detail demystifies policy change and more clearly 
specifies the mechanisms whereby change occurs. 

As we seek to specify the mechanisms whereby change occurs, we need also 
to consider the "processes and mechanisms of [movements' political] impact" 
(Andrews 2001:90). For example, movement scholars have suggested that 
more attention should be given to agenda-setting or the agenda responsiveness 
function of movements (Schumaker 1975; Burstein, Einwohner, and Hollander 
1995; Andrews 2001). The theory of legislative logic suggests that more attention 
should be given to the intermediary legislative steps as well. We were unable 
to explore the differential impact of tactics and organizational capacity at any 
given stage, which leaves an important question unanswered. We need a clearer 
picture of how well movement organizers are able to read the political climate, 
respond to potential defeat, and adjust their tactics to the legislative process. 

But we offer more than a methodology. The theory of legislative logic 
conceptualizes the legislative process as fragmented by multiple stages of decision 
making. Because these stages vary in their level of stringency and consequentiality, 
the underlying factors that contribute to legislative outcomes will differ by 
stage. Later stages require increasingly more legislative support or the reform 
cannot survive and move on to the next stage. As the degree of support needed 
increases, social movements may find that their voices are drowned out by more 
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decisive structural or cultural elements. Succeeding stages of legislation are also 
more consequential to lawmakers. Legislators may be more willing to appease 
mobilized interests at early stages of legislation but back off in their support 
when the stakes are high, leaving bills “half-orphaned.” At more consequential 
stages of legislation, political, structural, and cultural forces have greater impact 
on moving bills forward. 

Together these two factors, stringent rules and consequentiality, form the 
logic of the legislative process—a process that is more fragmented than social 
movement scholars have heretofore conceptualized. Rather than increasing 
political access, this kind of fragmentation may limit the access of political 
outsiders and contribute to the inertia of political institutions. Moreover, policy 
reform or constitutional revision may also depend upon favorable cultural and 
structural conditions. 

The theoretical assumptions made by policy and social movement theorists 
are not quite as straightforward as they seem. Our findings call for more careful 
interpretations of the relationship between movements and policy outcomes. 
The findings mirror Rosenberg’s (1991) contentions that judicial decisions are 
less influential than previously thought and that courts only make significant 
reforms when other barriers—cultural or structural—have been removed. 
Courts then are simply following the trend or “officially recognizing the evolving 
state of affairs” (1991:338). As with courts, legislators may be more likely to pass 
legislation once cultural practices appear to be well ahead of previous policy. 
Legislation is enacted to make sense of culture. 

Suffragists were “outsiders” to the political process. Does outsider/insider 
status account for the results reported here? Political scientists have argued that 
the main function of interest group lobbyists is to assist legislative decision 
makers set the policy agenda (see Baumgartner and Leech 1998; Bauer, 
Pool, and Dexter 1965). Perhaps social movements and interest groups are 
more similar in this respect than previously thought. Future research should 
investigate the extent to which the influence of social movements and interest 
groups are differentially distributed across the legislative process. Conceivably, 
the “insider” nature of interest groups, especially business interests, may afford 
greater influence across legislative stages 

This study also enhances our understanding of the ways in which movements 
most effectively influence legislative decision-making. We have found that some 
forms of movement organization may be more effective than others. The number 
of movement organizations in a state was not associated with success at any stage. 
Instead, bureaucratic movement organizations and movements using lobbying 
and candidate campaigning were associated with success at the introduction 
stage. In many cases, local grass roots organizing may be insufficient to produce 
positive legislative outcomes. Interests that organize following the example of 
other more established political players may be better equipped to affect change. 
The results indicate that suffrage organizing produced positive outcomes and 
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that the influence of suffrage organizing was integral to the eventual success of 
the movement. 

While woman suffrage legislation may have been exceptional in some ways, 
the long road to success has provided a rich source of information about the 
many difficulties and pitfalls of movement organizing and legislative defeat. 
The suffrage movement serves as an interesting case against which others 
can be compared. As a methodology, sequential analysis may be helpful in 
understanding change in other policy arenas. One of the most closely studied 
policy areas, welfare state legislation, has not paid sufficient attention to the 
variation in successes and failures across the legislative process at the state level. 
While pouring over archival data and extracting all of the bills introduced on a 
subject is a painstaking (and expensive) project, the richness in detail produced 
allows more careful analysis of the legislative processes whereby political 
transformation occurs. 


Notes 


1. We use the common language of the day woman suffrage rather than women's 
suffrage. 


2. State-level suffrage adoption was achieved via multiple paths: (1) legislating a suffrage 
amendment (24 states legislated a suffrage amendment, but only 11 of the referenda were 
successful); (2) passing an amendment at a state constitutional convention (6 states held 
referenda as a result of amendments proposed at constitutional conventions; only 2 referenda 
were successful); and (3) placing an amendment on the ballot via voter initiative (6 states 
held referenda as a result of voter initiatives; only 2 were successful). Some states attempted 
to grant women the right to vote via multiple methods. A total of 54 referenda were held 
in the various states. 


3. Once a bill proposing a constitutional amendment passes both houses, the amendment 
must be approved by the voters in a referendum, unless, of course, the Governor vetoes the 
bill. Obviously, our theory of legislative logic will not apply beyond the legislative process. 
However, in future analysis we hope to explore the factors that influence voter support and 
referenda success (see also McDonagh and Price 1985). 


4. Although McDonagh and Price (1985) did not find that organizational capacity 
affected adoption of referendums, McCammon et al. (2001) showed that social movement 
organizations' capacities to raise funds for the campaign significantly affected adoption of 
state-level suffrage. 


5. Territorial bills are excluded from the analysis due to differences in legislative process and 
incomplete data. Bills introduced in territories did not require referendum success and thus 
may have affected other legislative outcomes. We also left out legislative sessions where the 
constitution specifically denied legislatures constitutional revision capabilities. In most of 
these cases, amendments could only be made during state constitutional conventions. 


6. We also performed a Hausman specification test in order to determine whether the random 
effects model was a better fit than the fixed-effects model. 


7. Most of the information about suffrage newspapers came from the American Newspaper 
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Directory (1871, 1886) and the American Newspaper Annual and Directory (1913, 1916). A 
complete list of sources can be obtained by request from the authors. 


8. We located a total of 582 full suffrage bills introduced in territorial and state legislatures 
between 1848 and 1918. Banaszak (1996) reports 418 full suffrage bills between 1870 and 
1920. She culls the bills from historical records of the suffrage movement. We visited state 
archives and libraries in 48 contiguous states and searched the legislative journals for both 
the House (Assembly) and Senate to identify both partial and full suffrage bills. 


9. Note that party endorsements are not equivalent to suffragist endorsements of parties. 
While suffragists may have been wary of endorsing any particular party (in fear of partisan 
retaliation), they gladly accepted endorsements from any of the major political parties and 
saw these endorsements as a visible sign of the growing acceptance of suffragists in the body 
of institutionalized politics. We thank one of the anonymous reviewers for drawing our 
attention to this distinction. 


10. We only use three of the four indicators used in the original political openness measure 
proposed by Clemens (the Australian ballot, mandatory direct primary, and popular initiative 
and/or referendum). Clemens included whether or not the state had granted women the 
right to vote. 


11. This is measured as the difference between the number of women working in agriculture 
and the total number of working women, divided by the total number of women over the 
age of 10. 


12. This variable is based on 1890 and later census categories. Data from 1870 to 1880 were 
recoded to be consistent with later classifications. Professional occupations included teachers 
and college professors, musicians and artists, trained nurses, authors, scientific and literary 
persons, clergy and charitable workers, journalists, physicians, and surgeons. For a full report 
of the changing employment patterns of women between 1870 and 1920, see Hill (1929). 


13. We considered a diffusion measure based on bill introduction, roll-call vote, and bill 
passage activity in the surrounding states. However, our search of primary sources suggested 
that legislators were not very aware of the intermediary suffrage activity in other states, 
especially in the early years of the suffrage movement. The records kept by national suffragists 
were not very accurate, as suggested by our own logging of bills recorded in the legislative 
journals. Newspapers did not report the day-to-day workings of the legislature. Newspaper 
accounts and debates were more likely to reference the success or failure of a referendum 
than the success or failure of any given bill. 


14. One reviewer also noted that this effect could indicate the extent to which woman 
suffrage was gaining legitimacy among lawmakers. Many bills introduced in a single session 
suggest that lawmakers are already predisposed to vote favorably for the woman suffrage 
amendment. 
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Finding a Job in the Internet Age* 


CHRISTINE FOUNTAIN, University of Washington 





Abstract 


Internet job searching offers an empirical setting in which to explore the trade-off 
between quality and quantity of information. As both firms and workers turn to 
the internet for unprecedented amounts of labor market information, it is unclear 
whether this makes the matching process more efficient. Using longitudinal data on 
two samples of unemployed job searchers in 1998 and 2000, I establish the rise of 
the internet as a job-search strategy and assess whether searching online increased 
the short-term probability of finding a job. Results suggest the internet’s contribution 
to an unemployed searcher’s information pool may afford a small advantage only 
to the extent that other job searchers are not using it, perhaps due to an inverse 
relationship between search and screening costs in a two-sided market. 


Disparities in labor market outcomes are due not only to the individual and 
human capital characteristics of workers and the requirements and rewards 
of jobs but also to the process through which workers are matched with jobs 
(Granovetter 1981; Sorensen and Kalleberg 1981). This matching process depends 
crucially on the availability of information, which is costly but vital for both 
workers and employers. Since neither parties have perfect information, much 
theoretical (Granovetter 1973, 1991; Montgomery 1991, 1992) and empirical 
(Granovetter [1974] 1995; Lin, Ensel, and Vaughn 1981; Mouw 2003) work has 
focused on examining the way that workers and employers find out about one 
another. In sociology, much of the research has pointed to the importance of 
social networks in solving this information problem. 

The proliferation of thousands of internet sites containing millions of job 
postings and resumés, not to mention online classified advertisements and job 
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postings on corporate Web sites, indicates that perhaps the internet is a new social 
space through which job information can flow. Since electronic technology makes 
it cheap and easy for employers to target a larger number of potential applicants 
and allows job searchers to find and apply for a larger number of vacant positions 
than with traditional methods, neoclassical economic theory suggests that this 
additional low-cost information should better prepare actors to transact in the 
market by more closely approximating the ideal of perfect information. The 
question is: Does online job searching help workers and employers make faster 
and better matches? 

There is reason to believe that the expansion of online job information may 
bring with it new problems. The predicament of having too much information is 
well known; just as the job searcher must devote a substantial amount of time and 
effort to finding the handful of promising job leads from the Sunday classifieds, so 
must the employer invest in identifying the qualified applicants from the stack of 
applications. Finding the balance between casting a wide net and snaring qualified 
applicants may be particularly important in order for employers to hire a qualified 
candidate while minimizing recruiting costs. Recruiting more applicants may put 
the firm in a better position to choose a good worker but may also raise the cost 
of screening out those applicants who will not be hired. 

Using the empirical example of internet job searching, this article explores 
the problem of too much information in the labor market. In this study, I use 
samples of unemployed workers in 1998 and 2000 to (1) assess the degree to 
. which using the internet is associated with increased likelihood of finding a job 
and (2) consider how this relationship changed as internet use became widespread. 
Linking longitudinal microdata files from the Current Population Survey (CPS) 
on internet usage and labor force participation, I examine evidence about the 
extent to which workers use the internet to search for jobs. After establishing 
a general rapid increase in use of the internet, I examine how the internet fits 
into unemployed job searchers' strategies, how unemployed internet users and 
searchers differ from nonusers, and how this has changed over a short period of 
time. I finally consider whether unemployed individuals who use the internet are 
advantaged in the short-term probability of finding a job and discuss how this 
might be related to information constraints for both workers and employers. 


Background 


Information, the crucial link between workers and jobs in a labor market, is 
neither uniformly distributed nor costless. Searchers need to manage carefully 
their strategies, given their particular mix of preferences, skills, resources, time, 
and social/human capital (Halaby 1988). Individuals read, answer, and place 
advertisements; consult firms' vacant job postings; use public and private 
employment agencies; attend job fairs; “pound the pavement” filling out 
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applications and delivering unsolicited resumés; use school, professional, or union 
employment resources; ask for help from friends, family, and acquaintances; 
and keep their ears open for attractive offers. Like workers, employers need to 
transmit and gather information in order to fill vacancies in their firms. Firms 
publicize their vacancies and signal characteristics of the vacant job, such as wages, 
working conditions, responsibilities, and opportunities for advancement, while 
at the same time collecting information about available workers. Increasingly, 
workers and employers are turning to the internet, both as an information 
source in itself and as a way to access and transmit information that could also 
be found offline. The two-sidedness of a labor market means that the strategies 
and effectiveness of workers' and firms' searching are quite interdependent and 
should not be examined in isolation. 


THE ECONOMICS OF INFORMATION 


A simple economic model of labor market searching suggests that if individuals 
have cheap access to more information about vacant jobs, they will be able to 
consider more potential jobs and thus find a better job or be more likely to 
find a job. Productivity should rise, along with wages, and workers should have 
less incentive to leave their jobs. Thus, as search costs are reduced, benefits to 
individuals and to society should accrue as workers are hired for jobs for which 
they are better matched (McCall 1970). 

In a two-sided model of a labor market, however, both workers and employers 
are gathering and evaluating information; workers are rarely able to pick and 
choose from a set of job offers without first making their productivity and 
availability known to employers. High productivity workers would like to inform 
employers of their quality and use signals such as educational credentials and 
resumés to transmit the evidence. When feasible, it may be profitable for low- 
productivity workers to adopt the signal as well to encourage employers to hire 
them at higher wages (Spence 1973). When this occurs, the signal becomes 
meaningless to the employer because it no longer helps discriminate high- from 
low-productivity workers. 

For employers, the decision to hire is risky; they are in essence buying a lottery 
ticket. With limited information on which to judge the productivity of a new 
worker, the employer may find they have made a right or wrong choice; either 
way, the often-substantial expense of screening, hiring, training, and evaluating 
are sunk (Spence 1973, 1974). Thus, neoclassical economic theory would suggest 
that the more information the employer can gather about potential workers 
before the hiring decision, the more accurate the decisions will be. Further, this 
implies that in order to reduce risks, employers would prefer to hire workers 
about whom they have more information. 

In a world where information is costless, employers who can gather the 
most information about potential workers should be best able to identify the 
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best match. However, both the acquisition and interpretation of information 
can be quite costly, and crucial information with the depth and detail required 
to choose the *best" candidate might be difficult to convey through resumés 
and application forms. In addition to minimum qualifications, employers 
are interested in knowing a worker's reliability, ability to work with existing 
employees, personality, and many other things that are not easily quantifiable 
on paper and can only be determined through interviews, if at all. Resumés and 
applications can serve as initial screens, but choosing an appropriate candidate 
from among those who have the minimum, easily observable qualifications 
requires much more intensive, and expensive, scrutiny. Some of this information 
can be gathered through informants, trusted by the employer, who are familiar 
with the applicant's suitability for the job. In this case, employers might prefer 
candidates who are referred to them because some relatively reliable information 
can be gathered through personal contacts, reducing the need to conduct screens 
and thus lowering the cost of recruiting. Without this rich information, a labor 
market is a classic “lemon” market, since workers know more about their own 
quality than does the employer (Akerlof 1970). 


SOCIAL SOLUTIONS TO THE INFORMATION PROBLEM 


Granovetter's influential 1974 study of the way that male professional, technical, 
and managerial (PTM) workers get jobs outlines a way that job seekers might find 
the balance between quality and quantity of information. His research showed 
that informal methods of job searching (e.g., word of mouth) were much more 
successful than formal methods (e.g., classified advertisements, employment 
agencies), and led to jobs with higher pay, better working conditions, and greater 
job satisfaction (Granovetter [1974] 1995). Social ties lead to better jobs not only 
because friends and family have a strong incentive to help, but also because the 
personal relationships may signal additional information or influence to employers. 
Perhaps surprisingly, informal searches that use weak ties such as acquaintances 
rather than strong ties such as close friends and family were the most effective 
because they gathered a larger amount of information. The crucial insight here 
is: given that the information transmitted through social connections is better 
than through formal job postings, weak ties maximize the amount of personal 
information that can be gathered. 

Several studies (e.g., Bridges and Villemez 1986; Marsden and Hurlbert 1988; 
Mouw 2003) have shown empirically that the strong/weak nature of the tie leading 
to the actual job match is not decisively correlated with occupational attainment 
measured as wages. Although this might seem to contradict the basic strength- 
of-weak-ties argument, Montgomery (1992) showed analytically that having a 
wide network of weak ties predicts higher earnings and labor market success 
even if the accepted job offer comes through a strong tie. Hence, the proper 
focus of analysis is the social network as a resource, rather than the specific path a 
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particular piece of job information flowed through. By extension, if the internet 
contributes a net advantage (or disadvantage) to a worker's overall information 
pool, then this should affect the worker's labor market outcomes regardless of 
whether that particular worker finds his or her job online. 

Many employers also exploit the social capital of their current workers by 
offering bonuses for referring applicants (Marsden 2001; Marsden and Gorman 
2001). Fernandez, Castilla, and Moore (2000) investigate this practice in a 
customer service phone center for a large financial organization. They show that 
although there is little evidence that referral applicants make better post-hire 
employees than nonreferrals, they are significantly more likely to make it past the 
initial screen to the interview stage and to be offered a job. This practice results 
in significant cost-savings to the firm in recruitment costs, net of the financial 
bonus given to the referring employee, because fewer interviews are required to 
produce an additional hire at the same quality level. Although these results may 
not be generalizable to other employment settings, they do provide evidence of 
the organizational processes that might give firms a financial incentive to prefer 
applicants who have been referred. 


INFORMATION SCARCITY AND THE MATCHING PROCESS 


Against this background, we can interpret the frequently observed reliance on 
personal ties and contact networks for job searching and hiring as a response 
to the scarcity of good information. Workers would like to use the method 
that brings them the most information that other workers do not have access to. 
Although Granovetter and subsequent researchers have focused on the breadth 
of the pool of information, an important facet of information gained through 
social ties is its exclusiveness: it is not available to everyone who might be able 
to make use of it. One could find a large amount of information about available 
jobs by browsing classified ads and employment bulletins, but one's application 
would then be in competition with every other person who had seen this ad 
(potentially every job searcher, since it is publicly available information) and 
applied for the job. 

The true advantage of searching through social ties (in terms of information, 
not influence) is that it might reduce the overall cost of searching by (1) screening 
out useless or extraneous information and (2) reducing competition.' Since one’s 
friends and acquaintances presumably know something about one’s qualifications 
and occupational aspirations, they are more likely to share information 
about jobs that one might actually apply and be hired for. In addition, since 
information gathered in this way might be spread through contact networks 
rather than broadcast to everyone, a smaller number of people have access to the 
information, and so one may have a better chance of being hired (although the 
other implication of this is that the smaller number of competitors might actually 
be more qualified). Further, the existence of a social tie between the potential 
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worker and the job or firm may be signaling to the employer that the quality of 
information about the worker might be better or more reliable. 

Similarly, the utility for an employer of recruiting through social ties comes as 
a result of reducing the cost and risks of hiring. Fernandez, Castilla, and Moore 
(2000) showed that the cost savings from recruiting through current employees’ 
social networks came not from increasing the volume of applicants (this might 
actually be more expensive) but rather by raising the average quality (or simply 
reducing the variance) of applicants so that fewer applications have to be screened 
in order to hire an acceptable worker. Homophilous social networks, in addition 
to containing relationships of trust, obligation, and concern for reputation, result 
in referral applicants being overall “better matches” than nonreferrals. Social 
networks further reduce the risks faced by employers by giving them a basis 
on which to judge the quality of the information since they can learn about an 
applicant from a third party, or at least know that the applicant is known to 
someone they know. 

Thus, workers and jobs may face a trade-off between quality and quantity 
of information. Methods such as advertisements might bring in large amounts 
of lower-quality information for both searchers and employers, while personal 
contacts bring in fewer potential matches but more trustworthy information. 
Being able to reject more inferior alternatives comes at higher screening costs, 
while relying on higher quality information leaves fewer choices but lower overall 
screening cost. 


THE RISE OP THE INTERNET 


In recent years, the internet has been rapidly incorporated into the information 
gathering and signaling strategies of workers and employers. Figure 1 depicts the 
growth in internet use among the U.S. population age three and above between 
1997 and 2001. Internet users doubled as a proportion of this population 
between 1997 and 2000 and became a majority by 2001. Internet use is even 
more common among adults of working age. In August 2000, over 4096 of U.S. 
households had internet access, and 5596 of working-age adults used the internet 
(U.S. Department of Commerce 2000). 

A major way Americans use the internet is to search for job information. A 
recent Bureau of Labor Statistics paper evaluated not only the extent to which 
Americans use online job-search resources but also the correlates of internet 
job searching with data from the December 1998 Computer and Internet Use 
Supplement to the Current Population Survey (Kuhn and Skuterud 2000). 
Interestingly, 1396 of unemployed and 796 of employed Americans used the 
internet to search for jobs. This indicates that the Internet is becoming an 
increasingly important source of information about jobs, at least as important 
as any single offline method, for those with internet access. Furthermore, Kuhn 
and Skuterud (2000) found that unemployed job seekers who use the internet 
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Figure 1. U.S. Internet Use Among Persons Age Three +, 1997-2001 


are also more likely to use traditional job-search strategies, suggesting that people 
who use the internet to search for jobs are a selected sample of those who search 
more intensely than others. 

Between the end of 1998 and mid-2000, use of the internet for any reason, 
especially for job searching, grew dramatically. This increase is evident across age 
levels, sex, racial groupings, as well as occupational categories and employment 
status. Figure 2 depicts the increase in use of the internet, e-mail, and online 
job searching for all, employed, and unemployed workers age 25-65.’ Internet 
and e-mail are used more frequently by the employed than the unemployed, but 
the unemployed are more likely than the employed to use the internet to search 
for jobs. During the 20 months between these two surveys, internet use among 
the unemployed increased by over 50%, from 26.2% of unemployed workers in 
December 1998 to 40.7% of the unemployed in August 2000. Significantly, the 
unemployed increased their online job searching more rapidly than they increased 
internet use and e-mail use in general. In 1998, 8% of employed workers and 
13% of unemployed workers were using the internet to search for jobs, while in 
2000 these figures had risen to 12% and 25%. Use of the internet to search for 
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Figure 2. Proportion of Workers Using the Internet for Selected Purposes 
by Employment 


Jobs almost doubled during the 20 months between these surveys, which is quite 
a dramatic rise for a short period of time. 


ONLINE JOB-SEARCH RESOURCES 


There are thousands of internet sites designed to mediate the process of matching 
workers with jobs, not including online classified ads and job postings on 
corporate Web sites (Kuhn and Skuterud, 2000). Many of the largest and most 
successful of these sites (e.g., hotjobs.com, monster.com) are for-profit internet 
firms that provide both information to job searchers as well as full-service 
recruiting and screening services for employers. Individual job searchers can 
search online databases of job postings to find appropriate openings and submit 
resumés in a standard format either to specific vacancies or the general resumé 
bank. This standardized format allows the sites' client firms to search and evaluate 
many applicants, and also allows applicants to apply for many jobs online with 
little additional effort. 

In addition to Web sites specifically designed to match workers with jobs, there 
are also many other ways to find job information online. Most major newspapers 
publish their classified advertisements online and often include tools that make 
it easy to search and screen out jobs for which the worker is not qualified or 
interested. Large firms and organizations frequently post job openings on their 
corporate Web sites, including online contact information so that resumés or 
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applications can be submitted via e-mail. This type of information is available 
but less easily obtained offline and once would have required a physical trip to a 
company’s offices to view the job listings. In addition to actual vacancy listings, the 
internet can be used to gather useful information about potential jobs, including 
firms' corporate atmosphere, financial stability, benefits, and job characteristics 
from sources such as the business press or the firm's own Web site. Finally, just as 
they do offline, both workers and employers can use their online social contacts 
to exchange labor market information. 


ECONOMICS OF INFORMATION AND THE INTERNET 


Searching for a job on the internet gives workers and employers cheap access to 
and the ability to sort through unprecedented amounts of information about 
each other. It may also serve as a way to signal potential employers that a worker 
is proficient in using the internet or has other desirable qualities thought to be 
associated with internet proficiency. If internet skills are in demand, the very act 
of finding and applying for a job online might give an employer information 
that would otherwise be difficult to obtain before the hiring decision—that the 
applicant is a competent user of computers and the internet. Even for jobs that 
do not require internet use, this might serve as a proxy signal for other qualities 
that might be associated with internet use (at least in its early stages), such as 
education, resourcefulness, or technical skills. An National Bureau of Economic 
Research study, in which false resumés were tailored to real job vacancies in order 
to assess the effect of names and addresses with racial associations, found that 
net of other variables, resumés containing the applicant's e-mail address were 
more likely to elicit callbacks, suggesting that employers screen in part on the 
basis of this signal (Bertrand and Mullianathan 2003). Of course, as the internet 
becomes broadly used by more segments of the population, its cost declines and 
it becomes a worse proxy signal for qualities held by only a few. 

The downside of reducing the cost of information for workers is that it could 
result in excess applications for employers. The same characteristics that make 
the internet attractive to searchers—low cost of information—may also make it 
less trustworthy for screeners. The fact that applying for jobs is so quick and easy 
makes it possible for applicants to apply, with just a mouse-click, for jobs for 
which they are not fully qualified, jobs that they have no intention of accepting, 
and even to falsify information on their resumés. Autor (2001) points to this 
difficulty and cites evidence that some employers report receiving many excess 
applications from online job posts, including redundant resumés and those from 
underqualified and overqualified candidates. In addition, the rich information 
that is crucial to the screening process but is not available through resumés is also 
absent from online applications. Thus, the jobseeker's abundance of information 
paradoxically creates a scarcity of reliable information for the employers. 
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HYPOTHESES 


There are several reasons to expect that using the internet might give job searchers 
an advantage over nonusers. First, any selectivity of the user pool might make 
internet users more employable in some way than nonusers. Second, the internet 
may reduce the search costs, increasing the searchers' job opportunities. Third, 
this additional information may be more valuable if only internet users, and not 
the general population, have access to it. Fourth, use of the internet to find a job 
may serve as a signal to employers that the applicant has useful skills or other 
characteristics thought to be associated with internet use. 

However, these advantages are all dependent on the fact that only a minority 
of job searchers are using the internet. As internet job searching becomes the rule 
rather than the exception, there are also good reasons to expect these advantages to 
decline. When the user pool becomes less “selected,” the employability of internet 
users should more closely resemble the general population (although perhaps the 
pool of nonusers might become more adversely selected). This line of reasoning 
suggests the following hypotheses, which test the extent to which internet job 
searchers are a small and select’ segment of the job-searching population, and 
how this changes as internet use becomes more common: 


Hypothesis 1: Unemployed internet users will have more (measurable) 
human capital than nonusers. 


Hypothesis 2: The differences (in observable characteristics) between these 
groups will diminish as internet use becomes more widespread. 


Further, although users will still have access to large amounts of information, 
this information will be less valuable since it will be less exclusive. Although early 
adopters of the internet may derive a job-search advantage, this advantage should 
decline as internet job searching becomes widespread. In fact, firms’ problem of 
judging information quality might even place internet searchers at a disadvantage. 
If this is the case, with time and internet penetration, any positive impact of 
internet searching should decrease, and any negative impact should increase. 

Hypothesis 3: When internet use is uncommon, unemployed people who 

search online should have an advantage in finding jobs. 


Hypothesis 4: As use of the internet for job searching increases, any net 
advantage for users over nonusers should disappear. 


Analysis 


Although the theory from which my hypotheses are derived treats the labor 
market as a two-sided matching process, in this study I examine only the labor 
market outcomes of job seekers. However, the logic of this analysis is based on a 
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model of a two-sided market, which implies empirical predictions for job seekers 
that depend in part on the information constraints on firms. That is, if not for the 
screening problems that emerge when employers must sort through multitudes 
of applications, the internet should be uniformly useful to job searchers. 

Aside from any direct effect of using the internet to search for jobs, it is 
possible that internet users and job searchers differ in their employability in some 
way from nonusers. In this case, we might expect that any positive or negative 
association between internet use and the probability of finding a job is spurious 
due to both variables' correlations with worker characteristics. In order to assess 
selectivity into internet job searching in both time points (hypotheses 1 and 2), 
the first step in this analysis estimates models of the log odds of using the internet 
and using the internet to search for jobs. In order to control for the observable 
characteristics that may be associated with both using the internet and finding 
a job, the significant variables in these models are included in the final models 
predicting the log odds of finding jobs. 

Next, to establish whether or not searching online affords an advantage in 
the odds of finding a job within four months, I estimate two logistic regression 
models on unemployed workers in both samples. An additive model includes all 
individual-level characteristics and job-searching activities, characteristics of the 
unemployment spell, and characteristics of the overall economic environment, as 
well as a term for sample year, which is intended to capture the effects of time/ 
internet penetration. In addition, I estimate a second model with two additional 
interaction terms. These interactions allow me to test whether the effect of using 
the internet to search for jobs had a different payoff in 1998 than in 2000, net of 
the control variables (hypotheses 3 and 4). Finally, I use this interactive model to 
illustrate the practical payoff in probability of finding a job for a typical worker in 
each year. All of these analyses model the likelihood that an unemployed worker 
will find any job within four months; the natural next step would be to assess the 
quality of those jobs. Unfortunately, the data used for this study are insufficient 
to address this question satisfactorily; however, a brief analysis of the wages of 
workers who found jobs can be found in Appendix B. 


THE SAMPLE 


The Current Population Survey (CPS) is a large, representative sample of 
about 50,000 U.S. households conducted monthly by the U.S. Census Bureau 
for the Bureau of Labor Statistics. The CPS contains demographic data on all 
household members and employment status and labor market information 
for all household members over age 15. An underused feature of the CPS is its 
short-term longitudinal component, which makes it possible to track the labor 
force transitions and employment status changes for a representative sample of 
the U.S. work force over four consecutive months. I match microdata files from 
the December 1998 and August 2000 Basic Monthly Survey and the Computer 
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and Internet Usage Supplements (which gathered data on the internet access 
and usage) with individual labor force participation data from the three months 
following the Internet supplements (Bureau of Labor Statistics 1998, 2000a). I 
create two representative samples of unemployed workers age 25-65 in December 
1998 (N=316) and August 2000* (N=346) who participated in an Internet Use 
Supplement survey in either the first or fifth month of their CPS panel and were 
tracked for the entire subsequent four-month period. The sample used here is 
fairly small in spite of the large CPS sample size mainly because the rotating 
panel structure requires the exclusion of three quarters of the cases (not counting 
attrition) for the longitudinal match, and because the unemployment rate was so 
low during this period. See Appendix A for a description of the sample selection 
procedure and discussion of representativeness. 


VARIABLES AND MEASUREMENT 


Finding Jobs 


The main outcome of interest is whether the respondent found a job within the 
three months following the baseline observation.° Fifty-two percent of the 1998 
sample and 61% of the 2000 unemployed sample found jobs during the three 
months following the first observation. I include several categories of independent 
variables in my analysis of employment transitions for unemployed CPS 
respondents: internet penetration, characteristics (and activities) of individuals, 
characteristics of the unemployment spell, and characteristics of the overall 
economic environment in which the transitions are made. Table 1 provides 
descriptive data. 


Internet Penetration 


I use year in my models (2000 versus 1998) as a proxy for degree of internet 
penetration. By comparing these two periods I am able to explore how the 
impact of the internet on job searching changed over time as the internet became 
pervasive.^ 


Characteristics and Activities of Individuals 


I include several standard demographic variables as controls, including dummy 
variables for race (non-Hispanic whites versus all others)’ and sex. Age in years 
is modeled as a quadratic formulation. I include two separate indicators of 
education in order to capture both linear and threshold effects of education 
on the probability of using the internet and on job-search success. Household 
income, in $10,000 bands, is also included in order to control for the effect of 
economic resources on access to computers and the internet. About 13% of the 
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Table 1. Description of Key Variables 








Mean (S.D.) 
Variable description _ 1998 (N=316) 2000 (N=346) 
Age (years) 40.07 —9.82 40.84 -9.96 
Nonwhite (or Hispanic white) .38 —.49 39 —.49 
Female 152 —.50 54 —.50 
Years of education 12.41 —2.92 13.00 —2.67 
Has at least four-year college degree .14 —.35 21 —.41 
Household income (median category) $30,000—$39,999 $30,000— $39,999 
MSA/state unemployment rate for month 445 | —-245 419  -225 
one of observation period 
On layoff .18 —.39 .20 —.40 
Weeks unemployed at month one 16.94  —24.36 13.48 —20.56 
Searched for job on school, professional, .04 «20 .04 —.20 
or union register 
Consulted a public or private job agency 24 —.43 29 —.42 
Sent application or resumé directly to an .70 —.46 .69 —.46 
employer or made direct contact 
Asked friends and/or family .14 —.35 .13 —.34 
Placed or answered an ad .15 —.36 dii —.32 
Read ads without applying 25 —.43 2g —.41 
Other methods .06 —.23 .05 —.22 
Search intensity: total number of (offline) 1.90 | —1.36 177 | —1.40 
methods used 
Used internet or e-mail for any reason .26 —.44 Al —.49 
Used internet to search for jobs NIS —.34 125 —.44 
Found a job within four months 252, —.50 .61 —.49 





cases were missing data on this variable, so the mean has been imputed and an 
indicator variable is also included in the model. 

I have created seven dummy variables from CPS items to represent meaningful 
categories of search strategies for these job searchers: using school or professional 
employment bulletins, using public or private employment agencies, contacting 
the employer directly, asking friends or family, placing or answering classified 
ads, reading but not answering ads, and "other" search strategies. An additional 
variable summarizes the intensity of the search by summing the number of 
different strategies the respondent claims to have used in the past month. To 
examine the independent effects of internet use and internet job searching, I 
use two dummy variables that indicate whether the respondent uses the internet 
for any purpose and whether the respondent uses the internet specifically to 
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look for information about jobs. Both are included in order to disentangle the 
independent effects coming from any actual advantage due to using the internet 
in a job search from those due to simply being a person with computer and 
internet skills. Note that while everyone who searches online necessarily uses the 
internet, not everyone who uses the internet uses it to search for jobs, so there 
is some overlap between these variables. 


Economic Environment 


To measure local labor market opportunity, I use the CPS geographic codes to 
match respondents to the unemployment rate in their metropolitan area for the 
month and year of their baseline observation. Respondents who live outside of 
a metropolitan area are considered to be subject to their state unemployment 
rate." 


Characteristics of Unemployment Spells 


For unemployed respondents, CPS collects data on the number of weeks 
unemployed. Since this distribution is very right-skewed, in regressions I use 
the natural log of weeks unemployed. I also include an indicator of whether 
the respondent was on layoff, since these unemployed workers may have some 
reason to believe that they will get their old job back and so be both less likely to 
search online and more likely to become employed again during the observation 
period. 


Results 


DO UNEMPLOYED INTERNET USERS DIFFER FROM NONUSERS? 


Table 2 displays the results of logistic regression models predicting the log 
odds of using the internet and the log odds of using the internet specifically to 
search for jobs stratified by sample year. Since the coefficients for each model 
are estimated on different data, they should not be directly compared. However, 
the variables are measured in the same way, so we can interpret any changes in 
the direction, size, and significance patterns, which show that internet users are 
a select group, and to a lesser extent that the predictors of using and searching 
online have changed during the period in question.? 

These models show that internet users and job searchers do indeed differ 
in important, measurable ways from nonusers. In particular, internet users and 
searchers have more education and are more likely to have a college degree, 
probably in part because these people are more likely to have used the internet at 
their previous jobs. This shows some support for hypothesis I—that people who 
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use the internet have more human capital than those who do not. People with 
higher incomes are more likely to use the internet and search online, probably 
because they can better afford internet access and computers. An especially 
strong finding shows that even holding education and income constant, whites 
are more likely to use the internet than nonwhites. Those who search more 
intensely are more likely to search online, suggesting that these unemployed are 
either more resourceful or more desperate, or both. In 1998, age had a inverted 
U-shaped relationship with internet job searching, which was not significant in 
2000. Although this analysis provides little information about the magnitude 
of the changes, it does provide limited support for hypothesis 2—that between 
1998 and 2000 the pool of internet users and job searchers became less selective 
and more like the general population. In addition, the analysis provides evidence 
about which variables should be included in the substantive model of finding 
jobs in order to control as much as possible for selection effects. 


DOES THE INTERNET HELP PEOPLE FIND JOBS? 


Model 2 in Table 3 displays the results of an interactive model to test whether 
the effect of searching online depended on the year in which the search was 
conducted. Although the interactive model does not fit significantly better than 
the additive model in the first column according a likelihood ratio test with two 
degrees of freedom (p — 0.11), I interpret these estimates because the coefficient 
for the interaction between year and searching online shows that the payoff to 
internet searching was significantly different (p=.04) in 2000 than in 1998. 

In 1998, the odds of finding a job for a person who searched online were 164% 
greater than for a person who did not search online, holding the other variables 
constant. In contrast, in 2000, the odds of finding a job for an internet searcher 
were 28% less than for a noninternet searcher, net of the control variables. This 
provides evidence in support of hypothesis 4, since it is apparent that the effect 
of internet searching has changed in both magnitude and direction during this 
time. 

Other results show that for noninternet searchers, there was no difference by 
year in the log odds of finding a job, although nonsearchers were more likely to 
succeed in 1998 than in 2000. People who are on layoff are more likely to find a job 
than people who are not, net of the other variables. Further, the log of the number 
of weeks unemployed is significant and negatively predictive of finding a job; each 
additional log week is associated with a 23% reduction in the odds of finding a 
job. None of the individual-ievel demographic variables are significant, suggesting 
that holding other things equal, age, race, sex, and education do not affect the 
odds of finding a job.” The local unemployment rate is highly significant, and 
negative. All else equal, each whole point increase in the unemployment rate cuts 
the odds of finding a job by about 1296. Apparently, the amount of opportunity 
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Table 3. Logistic Regression Models Predicting Odds of Finding a Job 
Within Three Months for Workers Unemployed in December 1998 


and August 2000 
Model 1 Model 2 

Parameter MLE (S.E.) P MLE (S-E) p 
2000 22 CIZ D a (.21) :42 
On layoff SA CSa E 01 IEO” (.38) «.01 
Log weeks unemployed —.26* (.06) <.001 —.27* (.06) <.001 
Age .09 (.07) 19 .07 (.07) .28 
Age? .00 (.00) ly .00 (.00) 125 
Female —.04 (.17) .81 —.05 (.17) .78 
Nonwhite -.15 (.19) A5 —.15 (.19) .43 
Education (years) .02 (.04) .64 .02 (.04) .59 
Four-year college degree 23 (32) .47 S (.32) .65 
Household income .04 (.05) .34 .05 (.05) 199 
Income imputed —.41 (.36) .26 —.40 (.36) 127] 
Local unemployment rate —.13* (.04) <.01 —.12* (.04) «.01 
Search methods 

Employment bulletin .64 (.47) {7 .60 (.47) .20 

Employment agency ok (.30) .09 .49 (.30) .10 

Direct contact .47 (.39) ES E (.40) 25 

Friend/family —.04 (539) 91 —.05 , (599) .88 

Placed or answered ads .81* (.33) .01 os (.33) .03 

Passively read ads .10 (.28) 2D .05 (.28) .85 

Other method .62 (.46) .18 .69 (.46) 14 
Search intensity —.25 (.19) .18 —.23 (.19) 22 
Internet variables 

Internet use —.43 (.27) Pl —.90* (.38) .02 

Internet search .19 (.30) 53 .97* (.49) .05 

Internet*2000 .87 (.51) .09 

Netsearch* 2000 —1.29* (.63) .04 
Constant —].11 (1.53) .A7 —.85 (1.54) .58 
Model LR x? (df) : 95.99 (22) <.001 98.49 (24) <.001 
—2 Log likelihood 812.45 807.95 
N=662 
` p< 05 


available in a local labor market is much more important than demographic 
characteristics when it comes to finding a job if one is unemployed. 

Although certain specific methods seem to increase the odds of finding a 
job, the sheer number of search methods does not help, possibly because of 
adverse selection (i.e., those with the most difficulty finding employment are 
searching harder). There is a significant positive effect of placing or answering 
advertisements. In fact, this effect is quite large—placing or answering ads more 
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than doubles the odds of finding a job. Although much of the literature suggests 
that people who must resort to these sorts of "formal" search methods are at a 
disadvantage in the labor market, members of this sample who use this method 
are more likely to find a job, controlling for other factors. Of course, this says 
nothing about the quality of the match to or the rewards of the jobs—we cannot 
tell from this analysis if the jobs are low-paying or temporary. The coefficient for 
asking friends and relatives for help is nonsignificant and negative, which runs 
counter to the findings of Granovetter (1995)! and others who show that job 
searchers who rely on personal contacts rather than "formal" searching methods 
find better jobs than those who do not but is similar to the results of Mouw 
(2003) showing that those who use contacts to search for jobs do not have higher 
wages or shorter unemployment spells. 

To illustrate the practical implication of the effects of using the internet to 
find a job, Figure 3 presents the fitted probabilities of finding a job with and 
without internet job searching, calculated for an unemployed, white, high school 
graduate male who is at the typical or baseline value for all model characteristics 
but varied in internet job searching at each time point. This shows that in 1998, 
internet searchers actually had a higher probability (.59) of finding a job than 
did nonsearchers (.54). Thus, searching online gave the unemployed a slight 
(~5%) boost in probability of finding a job over those who did not. Although 
the difference is small, this provides limited support for hypothesis 3, that at least 
in December 1998, using the internet to search for jobs might have given one 
a slight advantage in probability of finding a job. In contrast, in 2000, internet 
searchers were actually nearly 8% less likely to find a job than noninternet 
searchers. This suggests that the internet may have afforded a small advantage 
in odds of finding a job in 1998, but a small disadvantage in 2000 (see Appendix 
B for a brief discussion of the effects of using the internet in these years on the 
logged weekly wages of those who found jobs). 


Discussion 


This article investigates the utility of the internet-based job search in finding a 
job among the unemployed. I find that in 1998 people who searched for jobs on 
the internet were a small and select segment of all unemployed job searchers. Not 
many people had access to the internet, and even fewer used it to aid in their job 
search. Those who did were more educated and more likely to be white than other 
unemployed people. Even net of these human capital advantages, they were more 
likely to find a job within the four-month observation period than noninternet 
searchers, suggesting that using the internet may have increased their chances 
of finding a job. However by mid-2000, many more unemployed job searchers 
had incorporated the internet into their search, and these people, although still 
a select group, were much more diverse and representative of the unemployed 
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Figure 3: Predicted Probabilities (from the Model in Table 4) for Finding a 
Job With and Without Searching Online 1998 and 2000 


Note: These probabilities are calculated for white male high school graduates in the median income 
range who searched using ads and friends/relatives, with other covariate values at the pooled sample 
mean or reference value for all variables except sample year and internet searching, The probability 
for the “nonsearcher” group was calculated as a weighted average of probabilities for those who do 
not use the internet at all, and those who use it but not for job searching for that year. 





population at large than had previously been true. Yet as the internet became 
more frequently used, its effectiveness declined. 

I have proposed two main explanations for why internet searching may have 
afforded advantages in 1998 that had all but disappeared by 2000. First, if only 
a small proportion of all unemployed people were searching online, then any 
information that could only be found there was limited in its dissemination, 
and so those with access to it had an advantage in being hired for those jobs. 
This is an advantage that comes from having access to more information that 
others, possibly just as qualified, do not have. Second, when only a minority 
of unemployed workers uses the internet, the fact that they applied through 
information found online might be a signal to employers that these potential 
hires are particularly technologically savvy and/or more resourceful than other 
applicants. This advantage would come as a result of the additional information 
available to the employer about the perceived productivity of the worker. In both 
explanations, the relatively small proportion of internet searchers is the key to the 
advantage. By August of 2000, however, the proportion of unemployed people 
who searched for job information online had almost doubled (see Table 1 or 
Figure 2), and so the source of this advantage may have disappeared. 
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A third potential explanation, of course, is that there is some unobserved 
quality making workers more hirable that was more correlated with internet 
searching in 1998 than in 2000, net of the control variables. This quality could 
be considered productivity or resourcefulness, but it may be that workers who 
conduct better searches and are more capable or clever were among the early 
adopters of the internet, and that it is their productivity that caused them to 
be hired at greater rates, not searching online per se. Even in this case, internet 
searching may be serving as a signal to employers of this quality, which is difficult 
to judge in prospective employees. And as the internet began to be used by most 
unemployed searchers, not just the most capable and clever, this signal conveyed 
less information to employers. A similar impact could occur if the group of 
people searching online were more adversely selected compared to nonsearchers 
in 2000 than they were in 1998. In any event, since the association between 
internet searching and finding a job was weak at best, it seems unlikely that this 
hypothetical unobserved quality is playing a large role here. 


THE STRENGTH OF ONLINE TIES 


The results of this study suggest another set of questions that may warrant future 
investigation. Evidence from the job-search literature suggests that a substantial 
portion of the best and most efficient matches do not occur when searchers 
apply for advertised vacant positions, but when searchers and employers find 
information about one another through a chain of acquaintances. This brings us 
back to Granovetter's strength-of-weak-ties hypothesis (1973), which is a useful 
lens through which to view the online job-searching phenomenon. If, as he and 
others have shown, having a wide network of structurally “weak” ties provides 
greater quantity and quality of information with which to find a job, perhaps 
this is true of the internet as well. Although all types of relationships can be 
maintained all or in part online, a characteristic of internet communication is that 
it reduces the cost of transmitting information and can provide links to distant 
and otherwise inaccessible parts of the social structure created and maintained 
for purposes other than searching for jobs. 

Although online ties are not necessarily weak in an affective sense, they 
may provide a cheap way for people to maintain the relatively fragile ties to 
people outside of one's primary group, such as former colleagues or co-workers, 
acquaintances, friends of friends, and classmates who may serve as important 
bridges to a wide. network of information. The internet also might provide a 
mechanism through which individuals can form new weak ties by associating 
with others on the internet who share specific interests, thus forming new 
bridges between local networks and providing access to otherwise unavailable 
information. One implication of this is that if Granovetter's thesis can be applied 
to online relationships, then, all else equal, formal methods of job searching 
online are unlikely to be more successful than they are offline. In fact, there is 
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reason to believe that they may be less effective if formal online recruitment raises 
the screening costs for employers by bringing in greater numbers of applicants. 
Second, use of the internet to pursue hobbies and interests, to obtain information 
from other internet users, and to maintain connections with friends and 
acquaintances should expand networks of “weak” ties and contribute to a more 
successful job search. The significance of the strength-of-weak-ties hypothesis 
for online contacts is that networks established and/or maintained online can 
contribute to the success of job searches by widening the available pool of 
information and the ease with which it can be conveyed. Thus, although using 
the internet specifically to search for jobs may not hold any advantage, access to 
the internet could possibly contribute to networks of the types of ties that have 
been shown to be most effective in transmitting job information. 

If the strength of weak ties is supported online as well as offline, then perhaps 
the internet is indeed a labor market intermediary, but not in the conventional 
sense. Instead, this new communication technology might be helping people to 
find jobs not because it provides access to larger numbers of job advertisements, 
but because it facilitates the personal communications between friends and 
acquaintances that often provide information about jobs. In particular, online 
communication may facilitate exactly the sort of structurally weak ties that 
Granovetter and others have identified as the most useful for information 
gathering. Obtaining information through the internet via job boards and 
postings is thus analytically distinct from the process of getting information 
through two-way communication with other internet users. In this sense, the 
internet may be a useful tool to help workers and employers find each other 
because it contributes to the social capital that can be drawn on by actors in 
the labor market." This may have implications for unemployed workers finding 
jobs as well as for employed workers moving within (Podolny and Baron 1997) 
and between firms. 


Conclusion 


Recently, there has been a sizeable hullabaloo over the internet "revolution" —it 
has transformed access to information and forever changed many aspects of our 
lives, including communicating with friends and loved ones, buying and selling 
goods and services, accessing entertainment, getting jobs, and even finding mates. 
It is true that the internet has changed the way we do many things, and the rapid 
rise in use of the internet to search for information about jobs among all segments 
of the population is a case in point. However, the internet may have changed 
these things in form rather than substance. Workers and employers are still faced 
with the need to uncover good information about each other, and although the 
internet gives us new ways to find this information, the central problem—the 
difficulty of evaluating that information to produce a good match—remains the 


1256 / Social Forces 83:3, March 2005 


same. The large numbers of unemployed people searching for jobs on the internet 
provides a timely empirical situation in which the tension between quality and 
quantity of information is revealed. This analysis shows that, at least in the case 
of internet job searching, more information is not necessarily better. 


Notes 


1. Of course, an additional benefit is the possibility of a good reference, which is a signal that 
not all job applicants are equally able to produce. References may be valuable for workers 
as a signal and for employers as a screen but do little to aid in the initial information search. 
In addition, there is some evidence that, at least for entry-level jobs, employers rarely check 
references even when they are available (Miller and Rosenbaum 1997; Pager 2002). 


2. These data are my calculations based on CPS respondents age 25—65 in the December 1998 
and August 2000 survey months, from the sample used for this analysis. 


3. In addition to selectivity on observable characteristics, there may also be selectivity 
on unobservable qualities. Whether it is productivity, resourcefulness, or some other 
characteristic, this selectivity should also diminish as internet use becomes more widespread, 
although this hypothesis cannot be directly tested here. 


4. I exclude all workers under the age of 25 because I believe the processes studied here may 
operate differently for the youngest workers. Younger workers tend to have more frequent 
bouts of unemployment and change jobs more often than older workers as they try out 
new careers (Meisenheimer and Ig 2000). In addition, CPS basic monthly survey only asks 
respondents under age 25 about their school attendance. This means that I cannot exclude 
students consistently across age groups if I include the younger workers, which is an especially 
worrisome problem when the August 2000 sample period begins during summer vacation. 


5. All of these cases are not necessarily employed in the last observation, as a small number of 
the job seekers found jobs but subsequently became unemployed again during the observation 
period. Although these searchers have not necessarily been “successful” in terms of finding 
long-term stable employment, their job search did result in a job, albeit short-lived, so they 
will be considered job finders for this analysis. 


6. This measure also captures those unmeasured factors that may have changed between 
the two years. Since I control for economic conditions in the regression models, differences 
cannot be attributed to changes in the demand for workers. 


7. Alternative models were estimated with indicators for specific racial groups; however, the 
effects for all groups were consistent in their directions, and the cell frequencies for most 
categories were quite small. As a result, I decided to collapse the racial groups into two 
categories. 


8. In order to preserve confidentiality, the CPS does not record a geographic code smaller than 
the state level for those respondents who do not live in metropolitan areas. Unfortunately, 
this leaves nearly a quarter of the sample for whom the local labor market could not be 
identified. For these respondents, I matched their state unemployment rate instead. Although 
it is not ideal to include these different levels of precision in this variable, I ran all regression 
models with an interaction between the unemployment rate (UNEMPRT) and a dummy 
for metropolitan status (METSTAT), and the interaction coefficient was not significant at 
any acceptable alpha level, so I am satisfied that this variable fits nonmetropolitan workers 
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as well as metropolitan workers. It is possible that, although the local unemployment rate 
is a good measure of the local opportunity for those in cities and surrounding suburbs, for 
those in more rural areas, the state employment may be a better measure since workers there 
are more under the influence of the prevailing state unemployment rate. 


9. Although it would be desirable to be able to test directly for differences in coefficients by 
modeling internet use and searching with a pooled sample and interaction, I have elected to 
analyze the data in this way because the interpretation is simpler and the pattern much clearer. 
Because I am interested in changes over time, a pooled model would require interactions 
between every variable and the time variable. The additional interaction terms would 
make this table very confusing and are unnecessary since I only wish to examine changes in 
the predictors of internet use rather than interpret the precise slopes. Nevertheless, a table 
analyzing this model with pooled data is available by request from the author. 


10. In order to determine whether the internet might have a differential effect on finding a 
job depending on the human capital of the job searcher, I estimated another model with an 
additional interaction between having a college degree and searching online. This model 
tests whether the internet might be more useful in helping educated workers, who are more 
likely to have computer skills or use computers at work, to find jobs. This model did not fit 
significantly better according to a likelihood ratio test, and the coefficient for the interaction 
was nonsignificant, so I infer that the relationship between internet searching and finding a job 
does not differ by college degree. These results are available by request from the author. 


11. Note that this analysis does not assess the quality of the match, only the likelihood of 
finding one. Granovetter's study was conducted on a sample selected for employment (all 
PTM workers), which has a quite different character than the group of interest here. His 
job searchers all conducted successful searches by definition, while this study is a snapshot of 
search strategies and a binary indicator of their success (or lack thereof) over a short period 
of time. We do not know how many unemployed would-be PTM workers never had the 
chance to be selected for Granovetter’s sample because they were not able to find a job, or 
found a job in a different occupation. Similarly, the group under analysis here by definition 
does not include those who may have been looking for a job while employed in the baseline 
month and who may or may not have been successful. The difference between quality of an 
existing match (which Granovetter examines) and the short-term odds of a match (which is 
studied here) are analytically distinct and seem to have different mechanisms. Furthermore, 
the inherent differences between the groups of workers being studied makes this analysis 
fundamentally different from that of Granovetter and others following him. In any case, it 
seems that since unemployed workers are selected specifically for the fact that they do not 
have a job, they tend to be disadvantaged in some of the areas in which Granovetter’s sample 
was relatively superior. Thus, their friends and relatives may be less than useful when it 
comes to finding a new job, and the main source of reliable information about vacancies 
comes from advertisements. 


12. All of this is speculation, unexamined in the present study. Little is known about online 
communication networks, as the data requirements are quite daunting and no detailed study 
has been conducted of the use of online personal contacts in a job search. Just as an offline 
study, an online study of this type would impose theoretical and methodological problems 
of its own, as it is impossible to know until afterward which interactions and pieces of 
information may or may not lead to a job. If, as Granovetter notes, it is the relationships 
cultivated for reasons unrelated to job searching that may prove to be most important, 
then the study must include all social interactions, not just those intended to garner job 
information. 
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Appendix A. Sample Selection and Representativeness 


One-quarter of each monthly CPS sample are tracked across all four months and constitute a 
representative sample of the labor force before attrition. Since the CPS is a household survey, 
an additional group is lost as a result of attrition because members of the group move out of 
the household, refuse to participate, or cannot be contacted. The sample is further reduced 
when workers under age 25 and employed workers are eliminated. Since the unemployment 
rate during these years was the lowest it has been in decades, and younger workers are more 
likely to be unemployed, the vast majority of the labor force is eliminated. This process yields 
final samples of 316 unemployed workers in December 1998 and 346 in August 2000 who 
were surveyed for four consecutive months, as shown in Table A1. 


Table A1. CPS Sample Selection Details 


Civilian Labor Entering 4-Month Remaining Age 25-4 Unemployed 


Forcex 65 Sample Rotation? after Attrition? at Baseline 
December 1998—March 1999 60,709 14,966 11,362. 10,970 316 
August-November 2000 61,379 15,069 11,231 10,483 346 
Total 122,088 30,035 22,593 21,453 662 
Percent of previous column 100 25 75 95 3 
Percent of total sample 100 25 19 18 1 


* CPS respondents are interviewed for four consecutive months, are out of the sample for the subsequent 
eight months, and then are interviewed for another four consecutive months. Thus, in any given month, 
about one-fourth of the sample are entering their four-month rotation and are eligible for my sampling 
procedure. 

"Respondents who are not interviewed in any of the three months following their entrance into the 
sampling rotation were excluded. 


Since attrition and nonresponse may be associated with changes in employment status, the 
selection process may have some association with the dependent variable. As a consequence, the 
representativeness of the sample, as well as the reliability of the estimates, may be threatened. 
However, because the CPS is conducted by highly trained interviewers and weighted in order 
to adjust for sources of attrition, the data are the most representative national data available 
on unemployed workers. Since the sample as a weighted percent of the total workforce closely 
resembles the national unemployment rate in these months, it can, with some caution, be 
considered fairly representative. However, the sample size is fairly small, so the chance remains 
that any findings are due to sampling error. 
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Appendix B: Does the Internet Help People Find Better Jobs? 


If using the internet may affect one's odds of finding a job, the next question is whether it 
has an impact on the quality of those jobs. One measure, albeit an imperfect one, of job 
quality is wages. Here, I use data on reported weekly earnings, in 2001 dollars, for those who 
found jobs to estimate a simple log-wage model. These data have a substantial and almost 
certainly nonrandom nonresponse problem beyond what can normally be expected for an 
earnings variable (2396 missing). Three hundred and sixty-nine workers found jobs, but only 
285 reported wages. I expect that since these are workers who have just found new jobs, they 
are more likely than workers in general either not to know their weekly earnings or to be 
sensitive about answering this question. Since this data is unlikely to be missing at random, 
these findings should be taken with a grain of salt. In a separate analysis, I modeled the log 
odds of having missing data for this variable. Although the pattern is not entirely clear, men 
were more likely than women to be missing; in 1998 internet searchers were more likely to 
be missing than nonsearchers, but in 2000 there was no difference. (A table showing these 
results is available by request from the author.) For this reason, I do not develop this model 
fully but present a basic outline of some patterns in Table A2. Given the bias introduced by 


Table A2. Results of OLS Regression of Logged Weekly Earnings for 
Unemployed Workers Who Found a Job within Three Months 








b (S.E.) t p 
2000 —.41* (.11) ~3.67 <.001 
Logged weeks unemployed —.07* (.02) —3.05 <.01 
Age ue (.04) 2.97 <.01 
Age? .00* (.00) -3.01 «.01 
Female —.34* (.09) —3.66 <.001 
Nonwhite "15 (.10) 1:52. .19 
Education (years) .06* (.03) 2.24 .03 
Four-year college degree .36* (.16) 2.22 .03 
Netsearch 51 (.29) 1.79 .07 
Internet —.38 (.26) —1.50 14 
Netsearch*2000 —.68* (.34) — 1.99 .05 
Internet*2000 : 270 (.30) 2.36 .02 
Constant 3.25 (.85) 
R .24 
T (122273) 7.09 <.001 


N = 285 
*p<.05 
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nonrandomly missing data, any findings should be interpreted with extreme caution, and I 
prefer not to make inferences outside of the specific sample. 

In 1998, people who searched online had logged weekly earnings about .514 higher 
than those who did not, and in 2000 about .166 lower, controlling for the other variables. 
The interaction terms for both internet use and searching by year show that these slopes are 
significantly different from each other, although neither may be significantly different from 
zero. In addition, longer unemployment spells lead to lower wages (possibly due to selectivity 
and/or a decline in the reservation wage), women earn less than men, more educated people 
earn more, and older people earn more (at a decreasing rate). In general, at first cut the effect of 
internet searching on wages appears similar to its effect on the odds of finding a job, although 
the analysis presented here is insufficient to draw many conclusions. 


After Developmentalism and Globalization, 
What?* 


IMMANUEL WALLERSTEIN, Yale University 


In 1900, in preparation for the Exposition Universelle in Paris, the French Ministry 
of Colonies asked Camille Guy, the head of its geographical service, to produce a 
book entitled Les colonies françaises: la mise en valeur de notre domaine coloniale. 
A literal translation of mise en valeur is “making into value.” The dictionary, 
however, translates “mise en valeur” as “development.” At the time, this expression 
was preferred, when talking about economic phenomena in the colonies, to the 
perfectly acceptable French word, “développement? If one then goes to Les Usuels 
de Robert: Dictionnaire des Expressions et Locutions figurées (1979) to learn more 
about the meaning of the expression “mettre en valeur,” one finds the explanation 
that it is used as a metaphor meaning “to exploit, draw profit from.” 

Basically, this was the view of the pan-European world during the colonial era 
concerning economic development in the rest of the world. Development was a 
set of concrete actions effectuated by Europeans to exploit and draw profit from 
the resources of the non-European world. There were a number of assumptions in 
this view: Non-Europeans would not be able or perhaps even willing to “develop” 
their resources without the active intrusion of the pan-European world. But such 
development represented a material and moral good for the world. It was therefore 
the moral and political duty of the pan-Europeans to exploit the resources of 
these countries. There was consequently nothing wrong with the fact that, as a 
reward, the pan-Europeans who exploited the resources drew profit from them, 
since a secondary advantage would go to the persons whose resources were being 
exploited in this way. 

This rationale of course completely omitted discussion of the cost in life and 
limb to the local people of such exploitation. The conventional calculus was 
that these costs were, as we would say in today’s euphemisms, the necessary and 
inevitable “collateral damage” of Europe’s “civilizing mission.” 

The tone of the discussion began to change after 1945, primarily as a result 
of the strength of anticolonial sentiments and movements in Asia and Africa, 


* Keynote address at conference, “Development Challenges for the 21st Century,” Cornell University, 
Oct. 1, 2004. 
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and a new sense of collective assertiveness in Latin America. It is at this point 
that “development” came to be used as a code word for the belief that it was 
possible for the countries of the South to "develop" themselves, as opposed to 
*being developed" by the North. The new assumption was that, if the countries 
of the South would only adopt the proper policies, they would one day, some 
time in the future, become as technologically modern and as wealthy as the 
countries of the North. 

At some point in the post-1945 period, Latin American authors began to 
call this new ideology “desarollismo” or “developmentalism.” The ideology of 
developmentalism took a number of different forms. The Soviet Union called it 
instituting “socialism,” which became defined as the last stage before “communism.” 
The United States called it “economic development.” Ideologues in the South often 
used the two terms interchangeably. Amidst this worldwide consensus, all the 
states of the North—the United States, the Soviet Union (and its East European 
satellites), the West European colonial (now becoming ex-colonial) powers, and 
the Nordic countries plus Canada—began to offer “aid” and advice concerning 
this development that everyone favored. The Economic Commission for Latin 
America (CEPAL) developed a new language of “core-periphery” relations, used 
primarily to justify a program of “import-substitution industrialization.” And 
more radical Latin American (and other) intellectuals developed a language 
about “dependency,” which, they said, needed to be fought against and overcome 
in order that dependent countries be in a position to develop. 

The terminology may have differed but the one thing that was agreed 
upon by everyone was that development was indeed possible, if only . .. When 
therefore the United Nations declared that the 1970s would be the “decade of 
development," the term and the objective seemed virtually a piety. Yet, as we 
know, the 1970s turned out to be a very bad decade for most of the countries of 
the South. It was the decade of the two successive oil price increases instituted by 
OPEC and of stagflation in the North. The consequent rise in the cost of imports 
for countries in the South combined with a sharp decline in the value of their 
exports because of the stagnation in the world-economy created acute balance 
of payments difficulties for just about every one of these countries (including 
those in the so-called socialist bloc), with the sole exception of those which were 
oil-exporting states. 

The oil-exporting states acquired incredibly large surpluses, a large part of 
which they deposited in banks in the United States and Germany, who thereupon 
needed to find a remunerative use for this extra capital. They found it in loans to 
states with acute balance of payments difficulties. These loans, actively promoted 
by the banks themselves, solved both problems: finding an outlet for the surplus 
money in the accounts of the banks of the North and solving the liquidity 
problems of the virtually insolvent states of the South. But, alas, the loans led 
to cumulative interest payments which, by 1980, had led to even greater balance 
of payments difficulties in these states. Loans unfortunately are supposed to 
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be repaid. The world thus arrived at the suddenly discovered so-called debt 
crisis—Poland in 1980, Mexico in 1982, and then all over the place. 

It was easy enough to find the villain in the piece. The finger was pointed at 
developmentalism, so universally praised just a decade before. Import-substitution 
industrialization was now perceived as corrupt protectionism. State-building was 
deconstructed as feeding a bloated bureaucracy. Financial aid was now analyzed 
as money poured down a sink, if not a gutter. And parastatal structures, far 
from being virtuous efforts at pulling oneself up by one's own bootstraps, were 
exposed as deadening barriers to fruitful entrepreneurial achievement. It was 
decided that loans to states in distress, to be beneficial, needed to be hedged by 
requirements that these states cut wasteful state expenditures on such deferrable 
items as schools and health. It was further proclaimed that state enterprises were 
almost by definition inefficient and should be privatized as rapidly as possible, 
since private enterprises were again almost by definition responsive to the 
"market" and therefore maximally efficient. Or at least that was the consensus 
in Washington. 

Academic buzz words and fads are fickle and usually last but a decade or two. 
Development was suddenly out. Globalization arrived in its wake. University 
professors, foundation executives, book publishers, and op-ed columnists all saw 
the light. To be sure, the optic, or better said the remedies, had changed. Now, the 
way to move forward was not to import-substitute but to export-orient productive 
activities. Down not only with nationalized industries but with capital transfer 
controls; up with transparent, unhindered flows of capital. In place of one-party 
regimes, let us all together study governance (a new word, splendidly erudite and 
quite inscrutable, if not meaningless). Above all, let us face Mecca five times a 
day and intone Allahu Akhbar TINA— There is No Alternative. 

The new dogmas took root in the 1980s amidst the decaying rot of 
developmentalist dreams. They flourished in the 1990s bathed by the sparkle of 
the ^new economy" in which the United States and eastern Asia were supposed 
to be leading the world to its economic glory. But alas, the sheen began to 
tarnish. The currency crisis in East and Southeast Asia in 1997 (which spread to 
Russia and Brazil), the slide downward of the World Trade Organization from 
Seattle to Cancun, the fading of Davos and the spectacular rise of Porto Alegre, 
al-Qaeda and September 11, followed by the Bush fiasco in Iraq and the current 
accounts crisis of the United States—all this and more leads one to suspect that 
globalization as rhetoric may be going quickly the way of developmentalism. And 
hence our question—After Developmentalism and Globalization, What? 

Let us not be too acerbic about faded theorizing. The whole discussion from 
1945 to today has indeed been one long effort to take seriously the reality that 
the world-system is not only polarized but polarizing, and that this reality is both 
morally and politically intolerable. For the countries at the bottom, there seemed 
nothing more urgent than figuring out how to improve their situation, and first 
of all economically. After all, all these people had to do was see a movie and they 
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would know that there were other people and places in the world that were better 
off, far better off, than they were. As for the countries at the top, they realized, 
however dimly, that the *huddled masses yearning to breathe free" represented 
a permanent danger to world order and their own prosperity, and that therefore 
something, somehow had to be done to dampen the tinderbox. 

So, the intellectual analyses and the derived policy efforts represented by the 
discussion about development and globalization were serious and respectable, 
if in retrospect quite misguided in many ways. The first question we need to ask 
now is, is it at all possible for every part of the world to attain—one day in a 
plausibly not too remote future—the standard of living of say Denmark (and 
perhaps also similar political and cultural institutions)? The second question is, 
if it is not, is it possible for the present lopsided and highly inegalitarian world- 
system to persist, more or less as such? And the third question is, if it is not, what 
kinds of alternatives present themselves to all of us now? 


Is it at all possible for every part of the world to attain—one day in a 
plausibly not too remote future—the standard of living of say Denmark 
(and perhaps also similar political and cultural institutions)? 


There is no question that Denmark—and most OECD (Organisation for 
Economic Co-operation and Development) countries—have a quite decent 
standard of living for a substantial proportion of their population. The standard 
measure of internal variation of income, the Gini curve, shows quite low numbers 
for most OECD countries, and by world standards reasonably good ones for 
all of them (e.g., see Atkinson, Rainwater, and Smeeding). To be sure, there are 
many poor people in these countries, but compared to almost any country of the 
South, far fewer. So, of course, people in these poorer countries aspire to be as 
rich as people in Denmark. In the last few years, the world economic press has 
been full of stories about the remarkable rates of growth of China—a country 
which not too long ago was considered to be one of the poorest—along with 
much speculation about whether or not and to what degree these rates of growth 
can continue in the future and thereby transform China into a relatively wealthy 
country in terms of GDP per capita. 

Let us leave aside the fact that many, many other countries have shown 
remarkable growth spurts for as much as up to 20-30 years, which rates then 
nonetheless petered out. There are, for example, the recent cases of the Soviet 
Union and Yugoslavia. Let us also leave out of the equation the long list of 
countries whose GDP was better in the further past than in the present. Let us 
assume for a moment that China’s economic growth continues unhindered for 
another twenty years, and that China’s GDP per capita approaches, let us say, if 
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not that of Denmark at least that of Portugal or even Italy. Let us even speculate 
that up to 5096 of its population benefits significantly from this growth spurt, 
which is then reflected in their real income. 

Is it credible to hold everything else constant, and to assume that, at the very 
least, everyone else remains where they are today in terms of standard of living? 
Where is the surplus value to come from that would permit 5096 of China's 
population to consume at the level of 5096 of Italy's population, while all the 
rest of the world consumes at a level at least as high as at present? Is this all 
supposed to come from the so-called greater productivity of world (or Chinese) 
production? It is clear that the skilled workers of Ohio and the Ruhr valley do 
not think so. They think they would pay for it, that they are already paying for 
it, by significantly reduced standards of living. Are they really so wrong? Has this 
not been happening in the past decade? 

The first piece of evidence is the entire past history of the capitalist world- 
economy. In over five hundred years of its existence, the gap between the top 
and the bottom, the core and the periphery, has never gotten smaller, always 
larger. What is there in the present situation that should lead us to assume that 
this pattern would not continue? Of course, over those five hundred years, there 
is no question that some countries have improved their relative standing in the 
distribution of wealth in the world-system. Thus, it could be claimed that these 
countries had "developed" in some sense. But it is also true that other countries 
are lower in relative wealth rankings than earlier, some of them spectacularly so. 
And, although our statistical data is at most of even minimal quality only for 
the last 75—100 years, such comparative studies as we have do show a constant 
trimodal distribution of wealth in the world-system, with a few countries moving 
from one category to another.’ 

The second piece of evidence is that high levels of profit, and therefore of 
the possibility of accumulating surplus value, correlates directly with the relative 
degree of monopolization of productive activity? What we have been calling 
development for the last fifty years or so is basically the ability of some countries 
to erect productive enterprises of a type considered to be highly profitable. To 
the extent that they succeed in doing this, they thereby reduce the degree of 
monopolization of production in this particular arena and hence reduce the 
degree of profitability of such production. The historic pattern of successive so- 
called leading industries—from textiles to steel and automobiles to electronics to 
computer technology—is clear evidence of this. The U.S. pharmaceutical industry 
is right now fighting a rear-guard battle against just such decline in potential 
profitability. Can Boeing and Airbus maintain their present profit levels in the 
face of competition by a putative Chinese aircraft construction industry twenty 
or thirty years from now? 

So, basically, of two things one. Either the rising, so-called newly-developing 
countries will be crushed by some highly destructive process—warfare, plague, 
or civil war. And in this case, the existing economic centers of accumulation will 
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remain on top, and the polarization will be still more acute. Or the rising, newly- 
developing countries will be able to reproduce some of the major productive 
processes of the present centers. And in this case, either the polarization will 
simply be inverted (which is unlikely) or there will be a flattening of the curve. But 
in this latter case, the ability to accumulate surplus value in the world-economy 
taken as a whole will diminish severely, and the raison d'étre of a capitalist world- 
economy will be undermined. In none of these scenarios does every country 
become a Denmark. 

If there has come to be a general morosity about economic development and 
the positive benefits of globalization, it is, I would argue, because the sense that 
we are in a cul-de-sac has begun to creep in on more and more people—scholars, 
politicians, and above all ordinary workers. The optimism of the 1950s and 1960s, 
which was momentarily revived in the 1990s, is no longer with us. 

I personally can see no way in which, within the framework of a capitalist 
world-economy, we can approach a general equalization of the distribution of 
wealth in the world, and even less an equalization that would have everyone 
consume at the level of the modal Danish consumer. I say this, taking into 
account all possible technological advances as well as increases in that elusive 
concept, productivity. 


II 


If it is not [possible for all countries to achieve a Danish standard of 
living within the framework of the world-system in which we live], is it 
possible for the present lopsided and highly inegalitarian world-system 
to persist, more or less as such? 


I doubt it. But of course we must be careful here, since predictions of dramatic 
structural change have been made so frequently over the past two centuries and 
have turned out to be inaccurate over a medium term because some crucial 
elements were left out of the analyses. 

The major explanation of purported prospective fundamental structural 
change has been dissatisfaction of the exploited and oppressed. As conditions 
worsened, the people at the bottom, or some very large group, were destined—it 
was argued—to rebel. There would be what has usually been called a revolution. 
I shall not resume the arguments and counterarguments, which are no doubt 
quite familiar to almost anyone who has been seriously studying the history of 
the modern world-system. 

The twentieth century was, among other things, the moment of a long series 
of national uprisings and social movements which proclaimed their revolutionary 
intents and which achieved state power in one form or another. The high point of 
these movements was the period 1945-1970, the period precisely of the flourishing 
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of developmentalism, which was in some sense the credo of these movements. 
But we also know that the period 1970-2000 saw the downfall of most of these 
movements in power, or at least a drastic revision in their policies. This was the 
period of the flourishing of globalization, whose logic these movements—those 
still in power or those now seeking to play a role of parliamentary opposition— 
sullenly accepted. So, we have the era of triumphalism followed by the era of 
disillusionment. 

Some of the cadres of these movements adjusted to what were thought to 
be the new realities and others jumped ship, either into passive withdrawal or 
into joining actively the erstwhile enemy. In the 1980s and until the mid-1990s, 
antisystemic movements worldwide were in a bad way. By 1995, however, the 
momentary sheen of neoliberalism had begun to wear off and there ensued a 
worldwide search for new antisystemic strategies. The story from Chiapas to 
Seattle to Porto Alegre has been that of the emergence of a new kind of world 
antisystemic movement, sometimes called these days altermondialisme. My name 
for it is the spirit of Porto Alegre and I think it is going to be an important 
element in the world political struggles of the next 25—50 years. I shall return to 
it in my discussion of real alternatives now. 

However, I do not believe that a new version of revolutionary movement is 
the fundamental factor in what I see as the structural collapse of the capitalist 
world-economy. Systems collapse not primarily because of rebellion from below 
but because of the weaknesses of the dominant classes and the impossibility of 
their maintaining their level of gain and privilege. It is only when the existing 
system is weakened in terms of its own logic that the push from below can 
possibly be effective. 

The basic strength of capitalism as a system has been twofold. On the one 
hand, it has demonstrated an ability to ensure, against all odds, the endless 
accumulation of capital. And on the other hand, it has put into place political 
structures that have made it possible to guarantee this endless accumulation of 
capital without being dethroned by the rash and dissatisfied "dangerous classes." 
The basic weakness of capitalism as an historical system today is that success is 
leading to failure (as Schumpeter taught us normally happens). As a consequence 
today, both the ability to guarantee the endless accumulation of capital and the 
political structures that have kept the dangerous classes in line are collapsing 
simultaneously. 

The success of capitalism in ensuring the endless accumulation of capital has 
been in its ability to keep the three basic costs of production—costs of personnel, 
costs of inputs, and taxation—from escalating too fast. However, it has done 
this by mechanisms that have been exhausting themselves over historical time. 
The system has now begun to reach a point where these costs are dramatically 
too high to make production an adequate source of capital accumulation. The 
capitalist strata have turned to financial speculation as a substitute. Financial 
speculation, however, is intrinsically a transitory mechanism, since it is dependent 
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on confidence, and confidence in the medium run is undermined by the very 
speculation itself. Allow me to illustrate each of these points. 

The costs of personnel are a function of the ongoing, never-ending class 
struggle. What the workers have on their side is the concentration of production 
(for reasons of efficiency) and hence their ability over time to organize themselves 
in both the work place and the political arena to put pressures on the employers 
to increase their remuneration. To be sure, employers always fight back by playing 
one set of workers off against another. But there are limits to doing this within the 
framework of a single country or a single local area, since there are political means 
by which the workers can encrust their advantages (legally and/or culturally). 

Whenever we are in a Kondratieff A-phase, employers, faced with militant 
worker demands, usually prefer to allow remuneration to rise somewhat, since 
work stoppages do them more immediate damage than concessions. But as soon 
as we are in a Kondratieff B-phase, it becomes imperative for an employer who 
hopes to survive the bad times to reduce the remuneration package, since there is 
acute price competition. It is at this point that employers have historically resorted 
to relocation—the "runaway factory"—transferring their production to zones that 
have "historically"-lower rates of remuneration. But exactly what history accounts 
for these historically-lower rates? The answer is rather simple—the existence of a 
large pool of rural labor, for whom urban, waged employment, at whatever level 
of remuneration, represents a net increase in real income for the household. So, 
as remuneration goes up, more or less permanently, in one area of the world- 
economy, it is compensated in terms of the world-economy as a whole by the 
appearance of new cohorts of workers who will accept lower remuneration for 
the identical work, holding of course efficiency constant. 

The problem with this solution to the regularly repeated problem of the 
owner/producers is that after 25—50 years the workers in this new zone of 
production are able to overcome their initial urban disorientation and political 
ignorance and proceed down the same path of class struggle as did others 
previously in other areas of the world. The zone in question thereupon ceases to 
be a zone of historically-lower remuneration, or at least not to the same degree. 
Sooner or later, the employers are required, in their self-interest, to flee again, 
relocating to yet another zone. This constant geographical shift of the zones of 
production has worked quite well over the centuries but does have an Achilles 
heel. The world is running out of new zones into which to relocate. This is what 
we mean by the deruralization of the world, which is going on apace, and at a 
very accelerated rate since 1945. The proportion of world population that lives 
in cities went from 3096 to 6096 between 1950 and 2000 (see Neubauer). The 
capitalist world-economy should run out of such zones entirely within 25 years at 
the most. There are already too few. And with modern means of communication, 
the time period for new zones to learn the lessons of how to organize has been 
drastically reduced. Hence, the ability of employers to keep remuneration in 
check has been drastically curtailed. 
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The costs of inputs is dependent on what percentage of the inputs the employer 
is required to pay. To the extent that he can get inputs free, his costs remain low. 
The major mechanism by which employers have over the centuries been able 
to avoid payment for inputs is by shifting the cost to others. This is called the 
externalization of costs. The three principal costs that have been externalized are 
detoxification, renewal of primary resources, and infrastructure. 

Detoxification is easy to handle in the beginning. One dumps waste somewhere 
that is public or unoccupied. This costs next to nothing. The costs are usually 
not immediate, but delayed. The eventual difficulties become the problem of 
the "public"—either as individuals or collectively as governments. Clean-up, 
when it is undertaken, is seldom paid for by the original user. In premodern 
times, rulers moved to different castles as they ran out of sewage dumps. In the 
capitalist world-economy, producers do more or less the same. The problem 
here is identical to the problem of runaway factories and remuneration levels. 
We are running out of new prospective dumps. In addition, the collective cost of 
toxification has caught up with us, or at least we are more aware of it because of 
scientific advances. Hence, the world seeks to detoxify waste. This is called concern 
with the ecology. And as concern mounts, the question of who pays comes to 
the forefront. There is increasing pressure to make the user of the resources who 
leaves toxic waste pay the costs of detoxification. This is called internalization of 
costs. To the extent that governments impose such internalization of costs, the 
overall costs of production rise, sometimes quite steeply. 

The issue of the renewal of primary resources is basically analogous. If forests 
are cut down, they may renew themselves via natural processes, but often slowly. 
And the faster forests are cut down (because of increased world production), the 
harder it is for the natural renewal process to take place in meaningful time. So 
here too, as the ecological concerns have come to the fore, both governments 
and social actors have put pressure on users either to restrain use or to invest 
in renewal. And to the extent that governments impose internalization of these 
costs, the costs of production rise. 

Finally, the same is true of infrastructure. Infrastructure, almost by definition, 
is expenditure on costly activities that cannot be attributed to a single producer— 
for example, constructing public roadways over which transportation of goods 
takes place. But the fact that these costs cannot be considered the costs of a 
single producer does not mean that they cannot be considered the costs of a 
multitude of producers. Furthermore, the cost of such infrastructure has escalated 
geometrically. Yes, they are public goods, but the public can be specified up to 
a point. And once again, to the extent that governments impose even partial 
internalization of such costs, the costs of production rise. 

The third basic cost of production is taxation. Any comparison of the total 
level of taxation in the world, or in any part of the world, with the world of 
a century ago reveals that everyone is paying higher taxes today, whatever the 
oscillation of the rates. What accounts for this? There are three major expenditures 
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of all governments—the costs of collective security (armies, police, etc.), the costs 
of all kinds of public welfare, and the costs of administration (most importantly, 
the costs of collecting the taxes). Why have these costs of government risen so 
steeply? 

The costs of security have risen simply as a result of technological advance. 
The toys security forces use are every day in every way more expensive. After all, 
security is a game in which all sides always try to have more than their opponents. 
It is like an endless auction in which the bids are always being raised. Perhaps if 
we had a generalized nuclear holocaust, and the surviving world went back to 
bows and arrows, these costs would go down. But in the wake of anything less, 
I see no way to expect such a reduction. 

In addition, the costs of welfare have been going up steadily and nothing is 
slowing them down, despite all the hoopla about doing that. They are going up 
for three reasons. The first is that the politics of the capitalist world-economy have 
pushed the dominant strata to make concessions to the dangerous classes, who 
have been demanding three things—education, health services, and guarantees 
of lifelong income. Furthermore, the level of the demands has been going up 
steadily and becoming more geographically extensive. In addition, people are 
living longer (partly the consequence of precisely these welfare measures), and 
hence the collective costs have increased because of the increase in the number 
of beneficiaries. The second reason is that advances in technology in education 
and health have increased the costs of providing the appropriate machinery (just 
as in the case of expenditures on security). And finally, the producers in each 
of these domains have taken advantage of this government-subsidized public 
demand to take a big cut of the pie. 

Welfare, as the conservative complaint has said, has become an entitlement. 
And it is difficult to see how any government could survive a truly significant 
cutback in these expenditures. But of course, someone must pay for this. And 
producers in the end pay, either directly or via their employees who demand 
higher remuneration precisely to pay these costs. 

We do not have good data on the steady increase of all these costs, but they 
are considerable. On the other hand, we cannot have a rise in the sales price 
of world goods to match the increase of production costs precisely because of 
the enormous expansion of world production which has reduced the multiple 
monopolizations and increased world competition. So the bottom line is that the 
costs of production have risen faster than the sales prices of production, and this 
means a profit squeeze, which translates into difficulties in accumulating capital 
through production. This squeeze has been evident overall for some thirty years 
already, which accounts for the speculative rage that has encompassed world 
capitalists since the 1970s and which shows no signs of letting up. But bubbles 
burst. Balloons cannot be infinitely expanded. 

To be sure, capitalists collectively fight back. This is what neoliberal 
globalization is all about—a massive political attempt to roll back remuneration 
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costs, to counter demands for internalization of costs, and of course to reduce 
levels of taxation. As has happened with every previous such counteroffensive 
against rising costs, it has succeeded partially, but only very partially. Even after all 
the cutbacks by the most reactionary regimes, the costs of production in the first 
decade of the twenty-first century are markedly higher than they were in 1945. 
I think of this as the ratchet effect—two steps forward and one step backward 
add up to a secular rising curve. 

As the underlying economic structures of the capitalist world-economy 
have been moving in the direction of reaching an asymptote which makes it 
increasingly difficult to accumulate capital, the political structures that have been 
holding the dangerous classes in check have also run into trouble. 

The period of developmentalism, 1945-1970, was also the period of the 
triumph of the historic antisystemic movements, which came into power 
in one form or another almost everywhere. Their biggest promise had been 
the developmentalist dream. When that failed, the support of their followers 
disintegrated. The movements, whether they called themselves communist or 
social-democrat or national liberation movements, fell from power almost 
everywhere. The period of globalization, 1970—2000, was the period of deep 
disillusionment with the historic antisystemic movements. They fell from grace 
and are unlikely to attract the deep loyalty of the mass of the populations again. 
They may be supported electorally as better than the other guys, but they no 
longer are deemed worthy of the faith they represented for a golden future. 

The decline of these movements—the so-called Old Left—is not in fact a 
plus for the smooth functioning of the capitalist world-economy. While these 
movements were antisystemic in their goals, they were disciplined structures 
which controlled the spontaneous radical impulses of their followers. They 
mobilized for specific actions, but they also demobilized followers, especially when 
they were in government, insisting on the benefits in a distant future, as opposed 
to untrammeled disturbances in the present. The collapse of these movements 
represents the collapse on constraints on the dangerous classes, who thereby 
become dangerous again. The spreading anarchy of the twenty-first century is 
the clear reflection of this shift. 

The capitalist world-economy is today a very unstable structure. It has never 
been more so. It is very vulnerable to sudden and swift destructive currents. 


III 


If it is not, what kinds of alternatives present themselves to all of us 
now? 
It is not very comforting to anyone in countries of the South to say that the 
present world-system is in structural crisis and that we are in a transition from 
it to some other world-system over the next 25—50 years. They will want to know 
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what happens in the meantime, and what if anything they can or should do to 
improve the lot of the populations of these countries right now. People tend to 
live in the present, as indeed they should. On the other hand, it is important 
to know what are the constraints of the present in order that our actions be 
maximally useful, in the sense that they further the objectives we seek in some 
meaningful way. So let me indicate what I think is the scenario over the next 
25—50 years, and what that implies for the immediate present. 

The scenario over the next 25—50 years is twofold. On the one hand, the 
collapse of our existing historical system is most likely for all the reasons I laid 
out just previously. On the other hand, what will replace the existing system 
is completely uncertain, inherently unpredictable, although all of us can have 
input into that uncertain outcome. It is inherently uncertain because, whenever 
we are in a systemic bifurcation, there is no way of knowing in advance which 
fork in the road we shall collectively take. This is the message of the sciences of 
complexity (see Prigogine). 

On the other hand, precisely because this is a period of transition in which 
the existing system is far from equilibrium, with wild and chaotic oscillations 
in all domains, the pressures to return to equilibrium are extremely weak. This 
means that, in effect, we are in the domain of "free will" and therefore our actions, 
individual and collective, have a direct and large impact on the historical choices 
with which the world is faced. In a sense, to translate this into our concerns, we 
may say that the objective of "development" which countries and scholars have 
been pursuing for some fifty years now are far more realizable in the next 25—50 
years than they ever were up to now. But of course there is no guarantee, for the 
outcome is uncertain. 

In the larger geopolitical arena, there are presently three principal cleavages. 
There is first the triadic struggle between the United States, western Europe, and 
Japan/East Asia to be the principal locus of capital accumulation in the capitalist 
world-economy. There is secondly the long-standing struggle between North 
and South for distribution of the world surplus. And there is the new struggle 
that revolves around the structural crisis of the capitalist world-economy and 
centers on which of the two possible forks the world will take in completing the 
transition to a new system. 

The first two struggles are traditional within the framework of the modern 
world-system. The so-called triad are roughly equal contestants in the attempt 
to reorganize the world-system's production and financial systems. As with all 
such triadic struggles, there is pressure to reduce the triad to a dyad, which may 
occur in the next decade or so. I have long argued that the most likely pair is 
the United States and Japan/East Asia against western Europe/Russia (e.g., see 
Wallerstein). But I shall not repeat this argument here, since I consider this 
struggle secondary to the issue of overcoming the polarization of the existing 
system, that is, permitting what we have called *development" throughout the 
world-system. 
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The second struggle, that between North and South, has of course been a 
central focus of development issues for the last fifty years. Indeed, the great 
difference between the era of developmentalism and the era of globalization 
has been the relative strength of the two sides. While in the first era, the South 
seemed to be improving its position, if only slightly, the second period has been 
one of a triumphant pushback by the North. But this pushback has now come to 
a close, with the deadlock in the World Trade Organization and the split among 
the spokesmen of the North about the wisdom of the Washington consensus. 
I think here of the increasingly open dissent of such figures as Joseph Stiglitz, 
Jeffrey Sachs, and George Soros, among many others, and the remarkable softening 
of the rigidities of the International Monetary Fund in the post-2000 period. I 
do not expect that in the coming decades there will be much push off-center in 
this contest. 

It is the third cleavage which reflects the new situation, that of the structural 
crisis with its consequent chaos in the world-system and the bifurcation that is 
occurring. This is the split between the spirit of Davos and the spirit of Porto 
Alegre, which I mentioned previously. I should explain what I think are the 
central issues here. The struggle is not about whether or not we are in favor of 
capitalism as a world-system. The struggle is about what should replace it, given 
the implosion of the present world-system. The two replacement possibilities have 
no real names and have no detailed outlines. What is in question is essentially 
whether the replacement system will be hierarchical and polarizing (that is, 
like the present system, or worse) or will be instead relatively democratic and 
egalitarian. These are basic moral choices, and being on one side of the other 
dictates our politics. 

The contours of the actual political players are still uncertain. The side of 
the spirit of Davos is split between those whose vision of the future involves an 
unremitting harshness of strategy and institution-building and those who insist 
that such a vision would create an untenable system, which could not last. At 
the moment, it is a very divided camp. The side of the spirit of Porto Alegre has 
other problems. They constitute politically merely a loose alliance of variegated 
movements all over the world which, today at least, meet together within the 
framework of the World Social Forum (WSF). Collectively, they have no clear 
strategy as yet. But they do have a good deal of grassroots support, and they are 
clear about what they oppose. 

The question is what those who would uphold the spirit of Porto Alegre 
should really do to advance this "other world" they assert is possible. And this is 
a double question. What is it that those few governments who share their vision, 
at least up to a point, should do, and what the multiple movements should do. 
Governments deal with the short-run issues. Movements can deal with both short- 
run and middle-run issues. Both kinds of issues affect the longer-run transition 
process. And short-run issues affect our daily lives immediately. An intelligent 
political strategy must move on all fronts at once. 
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The biggest short-run issue is the continuing drive of the neoliberal globalizers 
to achieve a one-sided expansion of open borders—open in the South, but not 
really open in the North. This is the heart of the persistent discussion within the 
framework of the World Trade Organization, and of all the bilateral discussions 
being conducted most notably by the United States but also secondarily by 
the European Union and its members—the creation of multiple "free trade 
agreements" like NAFTA, CAFTA, etc. Basically what the United States pushes for 
is guarantees for its monopolies (so-called intellectual property) and access for 
its financial institutions in return for limited tariff concessions on agricultural 
and low-value industrial goods produced in countries of the South. 

The offensive within the WTO was stalled at Cancün by a coalition of medium 
powers of the South— Brazil, India, South Africa, etc.—who put forward a simple 
demand: free trade that works both ways. If the North wants us to open our 
borders to them, they said in effect, it must open its borders to us. But the North 
is basically unable to accept this kind of deal for two reasons. It would result in 
considerably increased unemployment and downsized income in countries of the 
North, which is politically impossible for governments subject to electoral contests 
to accept. And it is not clear to the triad which of them would profit most, or lose 
least, from such arrangements, and therefore they hesitate. After all, the triad is 
engaged with tariff/subsidy controversies with each other, and arrangements with 
the South would weaken their political positions in this economically even more 
important conflict from the point of view of the countries of the North. 

One can draw two conclusions from this. This is a political quarrel doomed 
to a standstill. And it is politically very important for the countries of the 
South to maintain this stance, from their own point of view. This is the single 
most important action these governments can take to further the possibility of 
maintaining or raising the standard of living in their countries. To the sirens of 
the neoliberal dogmas, these countries are now responding skeptically, “show 
me,’ and this skepticism is justified. 

Of course, these governments have to remain in power. And the biggest threat 
to that is external interference in their politics. What the larger countries of the 
South are now doing, and will speed up doing in the next decade, is seeking to 
enter the nuclear club. What this will accomplish is to largely neutralize external 
military threat and thereby minimize external political threat. And the third thing 
one can demand of these governments is social welfare distribution within their 
countries, which of course could include low-level development projects (such as 
digging wells, etc.). What one cannot expect of these countries is that some policy 
on their part is going to turn them into a Denmark in the next 10-20-30 years. 
It’s not going to happen and is basically a diversion from an intelligent policy. 
The role of progressive governments is primarily to make sure that conditions in 
their countries and the world do not get still worse in the decades to come. 

It is the movements that can do more than the governments, although the 
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movements need to keep minimally progressive governments in power and not 
engage in leftist infantilist critiques about the lack of achievements that are in fact 
impossible to expect. And here we must point out an important element that is 
often lost from observation. The first two geopolitical cleavages are geographic: 
conflicts among the Triad and North-South conflicts. But the conflict between 
the spirit of Davos and the spirit of Porto Alegre has no geography. It cuts across 
the entire world, as do the movements. It is a class struggle, a moral struggle, 
not a geographic struggle. 

In the medium run, what the movements can best do is to push 
decommodification wherever they can, and to the extent that they can. No one 
can be quite sure how this would work. It will take a lot of experimentation 
to find viable formulas. And such experimentation is going on. It is going on, 
we must remember, within a basically hostile environment, in which there are 
systemic pressures to undermine any such attempts, and which can corrupt the 
participants with not too much difficulty. But decommodification not only stems 
the drive for neoliberal extensions but builds the basis for an alternate political 
culture. 

Of course, the theorists of capitalism have long derided decommodification, 
arguing that it is illusory, that it goes against some presumed innate social 
psychology of humankind, that it is inefficient, and that it guarantees lack of 
economic growth and therefore of poverty. All of this is false. We have only 
to look at two major institutions of the modern world—universities and 
hospitals—to realize that, at least up to twenty years ago, no one questioned 
that they should be run as nonprofit institutions, without shareholders or profit- 
takers. And it would be hard to argue seriously that, for that reason, they have 
been inefficient, unreceptive to technological advances, incapable of attracting 
competent personnel to run them, or unable to perform the basic services for 
which they were created. 

We don’t know how these principles would work, if applied to large-scale 
production like steel production or small-scale, more artisanal production. But to 
dismiss this out of hand is simply blind and in an era when productive enterprises 
are becoming far less profitable than previously, precisely because of the economic 
growth which the capitalist world-economy has bred, is foolish. Pushing alternate 
forms of development along these lines has a potential for answering problems 
not only of the South but of the declining industrial regions of the North. 

In any case, as I have insisted, the issue is not what will magically solve the 
immediate dilemmas of our world-system but the basis on which we shall create 
the successor world-system. And to address that seriously, we must first of all 
comprehend with some clarity the historical development of our present system, 
appreciate its structural dilemmas today, and open our mind to radical alternatives 
for the future. And we must do all this, not merely academically but practically, 
that is, living in the present and concerned with the immediate needs of people 
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as well as longer-run transformations. We must therefore fight both defensively 
and offensively. And if we do it well, we may, but only may, come out ahead in 
the lifetimes of some of the younger members of this audience. 


Notes 


1. The classic article is that by Giovanni Arrighi and Jessica Drangel (1986). Arrighi is currently 
updating this argument in a forthcoming article. 


2. Although this is prima facie logical, it seldom enters into analyses of mainstream 
economists. 
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Response to Wallerstein: The Struggle for Global 
Society in a World System” 


JACKIE SMITH, State University of New York at Stony Brook 





As Wallerstein has shown, the contemporary global economy is in crisis, and this 
creates opportunities for change. Moreover, the rise of a broad-based and globally 
networked popular movement against neoliberal globalization both reflects and 
contributes to the system’s crisis and to the possibilities for transformation. I agree 
that we now see a unique opportunity for shaping a very different global order, 
and in this essay I explore the specific ways that sociologists might contribute 
to this effort. If, as Michael Burawoy argues, sociologists have a responsibility to 
help “represent humanity’s interest in containing the unbridled tyranny of market 
and state” (2004:257), we need a theory that integrates our understandings of 
global structures with concrete ideas to guide our actions. 

Rather than focusing our attention on the structures and vulnerabilities of 
global capitalism, we might instead ask questions of how, given the material 
conditions society faces, might we help empower global civil society as an agent 
for change in the world system (see Waterman and Timms 2004, who encourage 
just such a shift in the international labor movement). The key objective, I think, 
is to politicize the global economy. This requires a strengthening of international 
institutions such as the UN to enhance their deliberative and enforcement 
capacities relative to global financial institutions and the capitalist forces that 
shape those. While a stronger global polity would help empower and nurture civil 
society, it will not come about without concerted efforts by a solidly committed 
and globally networked civil society. This requires efforts to enhance global 
solidarity based on notions of a common humanity while fostering tolerance 
and pluralism. The spirit of Porto Alegre provides a good foundation here, and 
sociologists might think about how we can and should relate to social movements 
in order to help bring about a more just and democratic world system. 


* I am grateful to Graham Knight, John D. McCarthy, Neil McLaughlin, and Lesley Wood for their 
comments on an earlier draft. 
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Sociology for Civil Society 


Washing one's hands of the struggle between the powerful and the powerless 
is to side with the powerful, not to be neutral (Freire 1985:102). 


A vibrant civil society in today's information-rich world requires creative and 
forceful efforts to enhance access to knowledge and to communicate effectively 
our research results. Economists have received Nobel prizes for “proving” things 
that sociology has long established, such as the fact that no actors have perfect 
access to information (Joseph Stiglitz) and that hunger is a function of politics 
and not production capacity (Amartya Sen).' We can complain that those in 
power don't listen to critical social scientists, but what are we doing to try to find 
new ways to communicate sociological research to broader audiences? We should 
work harder to provide information that is relevant to public deliberation about 
policy, and we must ask continually how our work might be shaped by, as well 
as how it contributes to, existing conflicts over power and resources. 

As educators, we play important roles in strengthening the public sphere by 
cultivating interest in the social world and by helping strengthen the knowledge 
and skills people have for being active and critical citizens. We must be more 
self-conscious about this role and more active in helping to promote life-long 
learning in our communities. From our campuses, we can work to both expand 
the audience we reach and find ways to reinforce ideas about global solidarity and 
pluralism through the material we teach. By engaging our students in discussions 
about the benefits and challenges of pluralism and the ways different societies 
have managed differences among diverse groups, we can help foster the kind of 
culture required for a more democratic world order.’ 

As a scholar of social movements, I am sometimes challenged (usually when I 
venture outside of sociological and movement circles) for being “biased” because 
my analyses explicitly aim to strengthen movements I study. The unspoken 
assumption of those challenging my scholarly integrity is that their own work 
is unbiased. But what is often the case is that their work does not challenge 
predominant assumptions about power, and therefore its potential bias is not 
questioned. Given the unprecedented inequities of the global system and the 
complex ways that power. relations are reproduced, it is vital that sociologists 
take sides in an effort to alter the balance of power in favor of civil society. But 
that does not mean that we align ourselves uncritically with particular agents 
of change. 

In case it needs restating, we must debunk the myth of social science objectivity. 
Analysts must recognize their own role within existing power relations and make 
explicit the values and preferences that guide their own research agendas. When 
these values are acknowledged, both the researcher and her readers can better 
evaluate how these preferences may have influenced the analysis or conclusions 
of the study. Social scientists cannot remove themselves from the world, and we 
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should not pretend that our research methods provide us with untainted and 
somehow comprehensive understandings of complex social reality. We are shaped 
by our cultures, by our own national political systems, and by the discourses 
and information available in the social contexts in which we live and work. Our 
disciplines can operate much like religious belief systems, and we accept certain 
ideas about what evidence and research methods are acceptable mainly on faith 
rather than on a serious effort to analyze critically the merits and limitations of 
different approaches. In short, the social contexts in which we operate strongly 
influence both the choices we make about what to investigate and how we do so. 
If we fail to appreciate how these contexts shape our analyses, our research will 
have limited potential for helping improve the human condition. 

While always striving for intellectual rigor in our work, social scientists should 
have a personal concern for the issues they research. How, for instance, can we 
continue to study rising inequality without wanting to know more about what 
causes it or how it can be remedied? Similarly, how can we study the causes of 
wars without wanting to find ways to use that knowledge to prevent future wars? 
Social science cannot prevent values or preferences from shaping our choices of 
research questions, but it does provide guidance that can limit the potential for 
bias. 

A growing number of activists have extensive training in social science, and 
more scholars are seeking ways to form relationships with movements that 
allow them to maintain their professional identities while bringing particular 
resources into the movement. For instance, the 2005 meeting of the World 
Social Forum generated more than 40 proposals for workshops focusing on 
relationships between universities and social change advocates, and Sociologists 
Without Borders is participating in the process of dialogue and coordination 
among these groups. While I don’t have an answer to the question of what is 
the proper relationship between scholars and movements, I do know that broad, 
international dialogue is needed in order to articulate such a relationship in the 
complex and global system. 

Sociologists and other analysts concerned with justice and equity should be 
up front about the need to understand how those with power keep it and how 
those without power might be given a stake in the system. To do this we also 
need to scrutinize another disciplinary norm. Our methodological preference 
for analyzing “what is” or “what was” steers us away from questions about “what 
could be.” We lack a strong tradition of using our social scientific skills to explore 
the “sociology of the nonexistent” (e.g., see de Sousa Santos 2004; Korzeniewicz 
and Smith 2000:27). If, as the spirit of Porto Alegre suggests, “another world is 
possible,” what can sociology tell us about what such a world might look like 
and how we might get there? 

During last August’s American Sociological Association (ASA) meeting in San 
Francisco, many observed that our profession’s incentive structures discourage 
critical and engaged sociological work. We must work to identify how both our 
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discipline and higher education systems generally constrain possibilities for 
doing socially and politically relevant work. And we must find ways to transform 
these arrangements so that we can nurture possibilities for public sociology. This 
struggle will take us well beyond the classroom and the ASA, but it is crucial to 
the survival of both our profession and democracy. 


Acting for Change 


If our analysis of the contemporary global situation leads us to conclude that 
there is an urgent need for major social transformation, then how can we as 
individuals and as sociologists be part of a broader process of change? How 
can we promote “decommodification,” which Wallerstein rightly regards as an 
important step to a better world? Along with Peter Waterman and Jill Timms, 
I think the more effective approach for critical theorists and for international 
labor is to emphasize the empowerment of civil society over the abolition or 
transformation of capitalism, or global society over world system. While the two 
complement each other, too often the struggle against capital has alienated key 
allies and disempowered those it has sought to liberate. We need a new approach, 
and the contemporary movement for global justice provides opportunities and 
ideas in this regard. 

Walden Bello, a sociologist at the University of the Philippines and a major 
figure in the movement for global justice, offers some of the most insightful 
and clear discussion of how this might be done (2003). The key failures of 
the contemporary global economy, according to Bello, are its subordination of 
society to markets and its failure to tolerate and profit from diversity. Following 
Polanyi, he calls for efforts to re-eembed economy in society so that the values of 
security, equity, and social solidarity take precedence over profit maximization. 
Drawing from vast experience in social movements and from his own careful 
and expansive analyses, he has a two-pronged proposal for helping advance this 
agenda, which he labels “deglobalization.” 

Deglobalization reflects much of what the spirit of Porto Alegre is about. It is 
not about withdrawing from the international economy, but rather it celebrates 
global ties while seeking to expand the freedom and choice local communities 
have about how they will organize their economic lives. In other words, it fosters 
global solidarity and pluralism. But such a re-orientation cannot happen within 
the current system of economic totalitarianism, which denies alternative economic 
models the resources and protection they need to survive. Thus, Bello argues 
that we need simultaneous efforts to “deconstruct” the existing institutions that 
support corporate globalization and to “reconstruct” new ways of organizing 
economic life around the core organizing principle of tolerance of diversity 
(2003:113-14). 

The global economy is a force that centralizes and homogenizes. It must 
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constantly expand in order to survive. Deconstructing this system will obviously 
be an immense struggle, but as the work of Wallerstein and others shows, it stands 
on shaky foundations. Bello argues that by continuing to disrupt meetings of the 
global financial institutions and by increasing direct action aimed at obstructing 
the operations of major corporations and governments that advance the corporate 
globalization agenda— particularly those companies tied to the Iraq occupation 
and to the broader Middle East conflict—social movements can affect two of 
the major cleavages Wallerstein identifies (among competitors in the core and 
between the North and South) and help roll back the global capitalist order and 
open up spaces for alternatives. 

To strengthen civil societies, Bello calls for what is essentially the democratization 
of the global economy. Deglobalization involves: 


* Reducing dependence on foreign investment and foreign financial 
markets by increasing reliance on locally available resources wherever 
possible; 


* Redistributing income and land to create the financial resources for 
investment; 


* De-emphasizing growth and maximizing equity in economic policy; 


* Abandoning market governance in favor of more democratic forms of 
economic decision making; 


* Subjecting the private sector and the state to constant monitoring by 
civil society; 

* Reorienting production to favor a diverse mix of local and national 
producers over remote ones; 


* Encouraging subsidiarity in our economies so that the production of 
goods takes place at the community and national level wherever possible. 
(2003:113-14). 

Deglobalization would redistribute economic power and thereby make economic 
decision making more inclusive, diverse, and responsive to local needs. 

Deglobalization encourages activities that many activists around the world 

are already doing and that are celebrated in spaces like the World Social Forum 
and its regional and local counterparts. Economic direct action schemes can help 
create more participatory economies, where people become more conscious of 
their roles as economic actors. Instead of allowing global markets and actors to 
determine what kinds of economic development will take place in a region or 
locale, citizens combine their efforts to define their own economic options. The 
aim is to encourage people to think about the economy in ways that empower 
them rather than perpetuate dependence on corporations and remote sources of 
finance. Some examples include, for instance, community supported agriculture 
(CSAs), local currencies, fair trade, “do it yourself” (DIY) and voluntary simplicity, 
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and “culture jamming” to challenge the ideology of consumerism. By working to 
advance these and other alternatives, social movements (and the sociologists who 
work with them) contribute to local empowerment and pluralism in the world 
economy. Sociologists can play a particular role in helping foster global solidarity 
by helping people understand how such local initiatives are related to a broader, 
global movement. Without such solidarity, local initiatives can become insular 
and defensive rather than inclusive and mutually supportive. 

Deglobalization is not “antiglobalization.” Instead, it requires a global order 
(solidarity) that encourages and protects diversity and pluralism. Is there hope 
for such a system? Not without prior efforts to deconstruct the existing, oligarchic 
order. But there are signs of hope in two recent United Nations reports. The 
High Level Panel on Threats, Challenges and Change stressed the need to make 
the UN more representative by making the Security Council more inclusive 
and by subjecting its decisions to more rigorous scrutiny by the international 
community. Although we might have wished for more far-reaching proposals, 
the report provides an opportunity to press demands for greater pluralism and 
accountability in the global polity. The second report, by the Eminent Persons 
Panel on UN-Civil Society relations (headed by a sociologist and former Brazilian 
President Fernando Henrique Cardoso), provides even more support for those 
advocating for more representative and inclusive international institutions. That 
report adopts some fundamentally different ideas about governance, calling on 
the UN to help facilitate more equitable participation in international affairs by 
both civil society actors and national legislators.’ 

Taking up proposals by the latter panel (which will happen only if there is 
widespread popular mobilization to encourage it) would make for a far more 
inclusive, representative, and responsive UN than we now have. It would also 
decrease the likelihood of future unilateralist actions such as the U.S. invasion 
of Iraq. But we need more intellectual leadership to enhance public discourse 
about multilateralism and to debunk campaigns by right-wing media and U.S. 
leadership aimed at demonizing global institutions. Efforts by social scientists are 
crucial here to help encourage more global awareness among the U.S. public and 
to demonstrate the connections between the values and ideals of democracy and 
the multilateral practice of democracy in the broader world community. 

Sociologists can play important roles in supporting civil society at local, 
national, and global levels. We can help nurture critical perspectives on power 
while helping articulate new understandings of human identities in an increasingly 
interconnected world. We need to be self-conscious about our role in helping 
people view themselves as part of a wider, international community. We also need 
to be part of an effort to foster appreciation for and tolerance of diversity. By 
supporting initiatives to expand participation in local and national economies, 
and by encouraging more inclusive and accountable multilateral institutions, 
sociologists can help advance a world where the economy is a means and not 
an end of social life. 
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Notes 
1. I am grateful to my colleague Timothy P. Moran for sharing this observation. 


2. The Human Development Report 2004: Cultural Liberty in Today's Diverse World can be a 
useful resource for this kind of discussion. 


3. The report, along with civil society groups' responses and critiques of it, can be found at 
http://www.un-ngls.org/UNreform.htm. 
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Book Reviews 


Interaction Ritual Chains. By Randall Collins. Princeton University Press, 2004. 
439 pp. 


GARY ALAN FINE, Northwestern University 


No sociologist has been as generous as Erving Goffman. Tossing concepts like a 
tipsy Krewe chief at Mardi Gras, laden with baubles, Goffman gifted us a theoretical 
treasure trove. For most of our giant theorists, the number of important and lasting 
constructs can be counted on the fingers of one hand, but with Goffman even 
neophyte graduate students must use their toes. 

Goffman was not, of course, a systematizer, but a wondrous voyeur. He 
specialized in analysis, not synthesis, spawning centrifugal ideas for others to 
grasp. And so we have frame analysis, impression management, tie-signs, stigma, 
the interaction order, backstage, and total institutions. 

For nearly a quarter century—amidst a set a magisterial projects—Randall 
Collins has been attempting to develop and systematize Goffman's concept of 
interaction ritual, the title of a 1967 book of essays, although the genesis of the idea 
dates back a decade further. Collins, along with his nonidentical twin, Jonathan 
Turner, has striven to develop a microsociology that owes little to the social 
psychological tradition in the discipline. This project underlines a division that 
reminds us that forces impinging on selves (and their associated bodies) can be 
separated from a focus on forces generated from situations. 

Interaction Ritual Chains is both a serious and essential attempt to connect the 
Goffmanian and Durkheimian projects so as to make clear their relevance to the 
understanding of the continuity and stability of social structure. If the manuscript 
doesn't quite succeed in its stated goal of creating a "full-scale social psychology," 
it does lay out a determined groundwork for an ambitious microsociology that 
strives to link cognitions and emotions to social structure through situated action. 
Simply put, Collins argues that interaction rituals produce emotional energy, the 
gathering of which is a central motivating force for individuals. Affect is the engine 
of social order. Those interaction rituals that are most effective in generating 
emotional energy are the ones that bolster institutional stability. We seek emotional 
energy the way that felines seek catnip—it gives us a buzz. Collins stands athwart 
the cognitive turn in social psychology, finding affect where others find thought, 
and indeed valuably devotes a chapter to demonstrating the various ways in which 
thinking must be linked to emotional entrainment. 

This theory depends on a clear definition of interaction ritual, and it is here 
that the theory departs from Goffman's model—while using his terminology— 
muddling our mental waters. (Although Goffman's collection of essays is entitled 
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Interaction Ritual, he doesn't define the term). Surely one might naively suppose 
thata theory of interaction ritual would depend upon routines by which individual 
actions are channeled by historical expectations, that ritual should be a distinctly 
recognizable, bounded subclass of the vast bramble of potential action. Collins 
begins by presenting a definition of (interaction) ritual grounded in Durkheim 
and Goffman: “a mechanism of mutually focused emotion and attention producing 
a momentarily shared reality, which thereby generates solidarity and symbols of 
group membership.” One might have wished that Collins had suggested “emotion or 
attention,” avoiding the trap of finding emotion everywhere, watering down affect 
into a rather thin gruel. However, what is most challenging in Collins’ analysis is his 
neglect in practice of the final phrase of his definition: an interaction that generates 
solidarity and symbols of group membership. Taken seriously this would permit us 
to set aside much routine interaction. Only in the sense that interaction by creating 
an awareness of others must necessarily generate solidarity and microculture does 
the vast domain of interaction do this. Collins admits himself as “one of the worst 
of sinners,” finding ritual everywhere. In doing this, he creates a highly general 
theory, but at the cost of specifying ritual as a meaningful concept. We could excise 
“ritual chains,” leaving a theory of “interaction,” and get it about right. 

One of the elements that is lost here, and indeed in so much contemporary 
microsociology, is the reality of the group. It is not fair to suggest that Collins 
does not mention the group—he does—but he does not theorize the group. Yet, if 
ritual depends upon group symbols, the group as a unit, and not just as a human 
manifestation of the situation, must be accounted for. It is the group that embraces 
the history from which rituals derive and which is regenerated by the creation of 
symbols of social solidarity. The meso-level concept of the group is the glue that 
connects action and organization. Interaction ritual chains are constitutive less of 
situations than of those groups that provide the arenas where the action is. 

Readers who are not directly concerned with the challenges of theory in creating 
a microsociology will most appreciate the two most empirical chapters—on sexual 
intimacy as interaction ritual and on tobacco ritual (Collins also adds less empirical 
chapters on the relation of IR theory to stratification and individualism). Of the 
two, the analysis of tobacco ritual chains is the most compelling. (It would be hard 
to imagine a better Goffman-inspired chapter of "love" than the opening chapters 
of Murray Davis's Smut). In examining the development of acts of smoking, 
he depicts behaviors that are often precisely public and group rituals. Collins 
demonstrates that tobacco rituals (and our now common antirituals) are grounded 
in the establishment of group relations and the pumping of emotional fulfillment. 
A good cigar is a smoke—and much more. 

A book of the brawny power of Interaction Ritual Chains deserves a review 
of more than a scant 800 words, barely a word for each of Collins's arguments. 
By treating microsociology as a domain for theoretical systematization, Randall 
Collins takes life seriously, building our search for solidarity. What better gift can 
we idea-starved ritual actors have than this? 
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Social Movements and Networks: Relational Approaches to Collective Action. 
Edited by Mario Diana and Doug McAdam. Oxford University Press, 2003. 348 
pp. Cloth, $70.00 Paper, $29.95. 


JOHN D. MCCARTHY, Pennsylvania State University 


“The structural research program on social movements has produced several 
consistent empirical findings. However, the meaning of these findings, and more 
importantly, the actual social processes that account for them remain opaque.” So 
says Doug McAdam in his concluding essay to this rich and rewarding volume that 
brings together 13 original papers in an effort to integrate more systematically the 
parallel work of social movement scholars with that of the broader community 
of structuralist social network researchers. The empirical regularities to which 
McAdam refers include the centrality of network connections between persons, 
positions, and organizations to individual recruitment tomovements and collective 
mobilization as well as to the diffusion of movement processes across social and 
geographical space. 

The opaqueness of the social processes underlying movement network 
regularities, however, is a more general problem characteristic of much of the 
work done by researchers in the broader social network research community. 
Information, influence, resources, and affect are thought to flow through dyadic 
network ties and then outward through the more extensive aggregate grids of 
relationships within which they are embedded. Yet many network structuralists—no 
matter their substantive focus, be it, for instance, corporate interlocks, ties between 
nations, or common attendance at feminist conferences—also ignore the actual 
content of network connections. Without systematic knowledge of that content, 
underlying social processes remain partially obscure. As a consequence, network 
scholars must depend heavily upon a sophisticated theoretical superstructure in 
order to make sense of the regularities that link patterns of network connections 
to individual and collective behavior. And what is the case in general of structural 
network analyses is also true of many of the papers in this volume. 

In spite of the theoretical robustness of the network paradigm, efforts to apply 
it to social movement processes, mirroring the broader social network literature, 
have tended to either narrow into deep analyses of the network connections of 
individual participants or to broaden toward consideration of the structural 
complexity of expanded network structures. The former approach provides rich 
detail about network connections while remaining mostly blind to the structural 
complexities of the larger network structures within which they are embedded. 
The strong chapters by Florence Passey, who analyzes individual participation 
by members of the Bern Declaration and the World Wildlife Fund, and by Ann 
Mische, who explores conversational mechanisms employed by Brazilian activists 
as they negotiate their complex social networks, illustrate both the strengths and 
weaknesses of a deep analysis of network connections. 

The latter approach, mapping the elaborate structure of interconnections 
between multiple network nodes, whether individuals, institutional actors, or 
collective actors, is the one taken by the authors of the other substantive chapters 
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in this volume. These include Maryjane Osa's mapping of the structure of Polish 
dissident organization's interconnections through the overlapping memberships 
of individual activists during protest waves in the late 1960s and late 1970s, 
Helmut Anheier's demonstration of the important role of local activists’ network 
embeddedness for early organizational development of the German National 
Socialist Party, and Christopher Ansell’s analysis of the collaborative structure 
of relations between contemporary environmental organizations in the San 
Francisco Bay Area. And while each of these papers makes a strong contribution to 
understanding how the structure of networks of activists and their organizations 
are patterned, the nature of the network ties they study remain, for the most part, 
opaque. 

The typical analytic tradeoff between a deep focus upon network connections 
and a comprehensive look at network structure is, of course, importantly the result 
of research design choices. Gathering data on far-flung networks of activists and 
social movement organizations as well as the ties that bind them together is severely 
constrained by prohibitive costs, so it is not surprising that such studies are rare. 
Mario Diani's widely ignored gem, Green Networks (Edinburgh University Press), 
is a singular exception. 

Another, less costly, analytic strategy for maximizing both scope and depth in 
accounting for network regularities in social movement analysis is to model network 
recruitment and diffusion, and the two papers in the volume that take this approach 
are, together, worth the price of admission. Pam Oliver and Dan Myers’ “Network 
Diffusion and Cycles of Collective Action" places a series of assumptions about the 
role of mass media at the center of models of waves of collective action, while Roger 
Gould's "Why Do Networks Matter?" aims to unpack the simultaneous effects of 
network structures and the content of individual connectedness in recruitment to 
activism. Each of their complicated arguments rests upon clearly stated assumptions 
about network structures and social influence, and they converge in drawing 
attention to both the importance and theoretical difficulty of accounting for what 
Gould terms “first movers”—organizers who begin recruitment chains—and the 
phenomena of “hidden organizing,” which Oliver and Myers suggest can account 
for spikes in protest waves. 

A concluding essay by Mario Diani sketches the outlines of a more elaborated 
network research agenda for social movement scholars from the vantage point of 
the broader structural network paradigm, and Doug McAdam’s concluding essay 
is a largely successful effort to provide theoretical accounts of the mechanisms 
underlying the centrality of network connections between persons, positions, and 
organizations to both individual recruitment to movements and the diffusion 
of movement processes across social and geographical space. This is a far more 
coherent volume than typically results from a scholarly conference, and the Papers, 
together, demonstrate the continued fruitfulness of a social network perspective. 
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Collective Preferences in Democratic Politics: Opinion Surveys and the Will of 
the People. By Scott L. Althaus. Cambridge University Press, 2003. 370 pp. 


HOWARD SCHUMAN, University of Michigan 


What national surveys tell us about the nature of public opinion depends on 
whether we approach the data from a microscopic or a telescopic perspective. Those 
who look closely at individual responses, whether as interviewers posing questions 
or as investigators focusing on individual responses, are often struck by the ill- 
informed basis and unreliability of responses (Converse 1964). Those who examine 
broad trends produced when answers are aggregated and charted are frequently 
impressed by change over time and across socioeconomic categories that appears 
to have considerable regularity and meaning (Page and Shapiro 1992). 

The two perspectives, microscopic and telescopic, can be reconciled by assuming 
that the noise inherent in responses at the individual level tends to cancel out, leaving 
a kind of collective rationality. However, Scott Althaus disputes the assumption that 
most individual response error is offsetting and that aggregation simply eliminates 
random error. He argues instead that survey results are seriously influenced by what 
he calls information effects": ^bias in the shape of collective opinion caused by the 
low levels and uneven social distribution of political knowledge in a population." 
Specifically, the poor, the young, women, and African Americans disproportionately 
opt out of questions on political issues (i.e., say “Don’t Know”), and when they do 
offer opinions they do not, Althaus believes, express their own predispositions as 
they would if better informed. Thus survey results fail to represent adequately the 
attitudes of the total public and its important parts. 

Althaus’s book is largely an attempt to estimate how aggregate opinion would 
appear if all relevant social groups were optimally informed about politics. 
Extending approaches developed by Larry Bartels and by Delli Carpini and Keeter, 
he employs logistic regression to estimate what “fully informed opinion” would 
look like “by assigning the distribution of preferences held by the more highly 
informed members of a given demographic group to all members of that group, 
simultaneously taking into account the influence of a wide range of demographic 
variables.” The measures of political information that Althaus relies on consist of 
straightforward factual items, such as asking respondents what office is held by 
William Rehnquist and which party controls the Senate. Of course, estimates derived 
on this basis do not guarantee that ill-informed respondents would actually offer 
the simulated responses if they were themselves well-informed, but the method 
does provide a plausible set of hypothetical answers. 

The core of the book is a comparison of distributions of “fully informed” 
opinion with the actual distributions produced by the same surveys. Althaus first 
shows that, of 235 questions drawn from recent National Election Studies, nearly 
7% show an average change in marginals when “fully informed” and surveyed 
opinion are compared. He also finds that this change in marginals is not due much 
to the effects of item nonresponse (DK answers), but rather to different patterns 
of answering by respondents with different levels of information. However, small 
marginal differences are not so telling in themselves, since we already know that 
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changes in wording and in other aspects of surveys can shift marginals substantially, 
and particular survey items should not be treated as having quasi-official standing 
as referenda. 

More interesting than changes in item marginals are specific analyses. 
For example, in one case Althaus shows that estimating the effects of greater 
information makes little difference in the attitudes of men toward abortion rights, 
but it increases support by women quite substantially and changes conclusions 
about gender differences. There are a number of other useful analyses reported 
throughout the book, and some topic areas are shown to be more susceptible 
to information effects than others. At the same time, the author's careful search 
for theoretical generalizations about where and why important differences in 
information levels occur does not bear much fruit. In this sense, identifying 
information effects becomes another analytic tool we can use in understanding 
data, not unlike important variables such as education. Indeed, it turns out that 
differences in responses between the most knowledgeable and least knowledgeable 
quartiles on the indices of political information approximate the information effects 
discovered with more complex regression estimates, and the author suggests that 
such comparisons might be used in ordinary surveys where more technical and 
time-consuming methods are impractical. 

Collective Preferences in Democratic Politics is a densely argued book and likely 
to be of interest primarily to those making serious use of survey data to understand 
public attitudes and beliefs. The author deals thoughtfully with many issues now 
being debated by political scientists and sociologists, and he draws on an unusually 
wide range of literature across the social sciences. He also ends his book with a 
stimulating discussion of the relation of public opinion in the form of poll numbers 
to classic conceptions of public opinion as popular sovereignty. 
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What is Sexual Harassment? From Capitol Hill to the Sorbonne. By Abigail 
Saguy. University of California Press, 2003. 235 pp. 


ANNA-MARIA MARSHALL, University of Illinois—Urbana-Champaign 


What is the cultural life of a legal category? This important question in sociology 
of law moves researchers beyond the boundaries of formal legal institutions and 
actors and recognizes that law shapes the way we organize our families, workplaces, 
neighborhoods, and even our thoughts. Abigail Saguy’s analysis of the cultural 
significance of sexual harassment in France and the United States makes an 
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important contribution to this research tradition. 

Sexual harassment is a legal concept that has spilled out of the courtroom and 
into workplaces, the mass media, politics, and the consciousness of ordinary men 
and women all over the world. While the concept may be familiar, it has distinct 
cultural meanings in the political and legal traditions of different countries. In the 
United States, sexual harassment is a form of sex discrimination that interferes with 
women's equal employment opportunities. In the 1970s, when the claim for sexual 
harassment first emerged, lawyers drew on Title VII of the Civil Rights Act of 1964, 
one of the great achievements of the civil rights movement. The legal prohibition 
against sexual harassment followed a much different trajectory in France. While 
France had no civil rights movement, French workers have enjoyed expansive legal 
protection in the workplace. Given these political commitments, French feminists 
and activists chose to emphasize the abuse of power, framing sexual harassment 
as a form of sexual violence, akin to rape. As a result, sexual harassment has been 
criminalized in France. Harassers can be prosecuted, but employers are not legally 
responsible for such conduct. 

Illustrating how the law in action diverges from the law on the books, Saguy's 
comparative method provides unique insights into the mechanisms responsible for 
this divergence. In the United States, employers’ fear of liability for sexual harassment 
is the engine that drives corporate policy-making and law enforcement on this issue. 
Structural features of the American legal system—such as damage awards and 
contingency fee arrangements—make these lawsuits possible. Saguy also offers a 
cultural explanation for the lawsuits. She argues that mass-media reports about large 
damage awards in sexual harassment cases encourage victims to file lawsuits. In her 
account, personnel managers promptly handle the few grievances they receive, and 
plaintiffs’ lawyers often have to turn away women because their complaints are not 
serious enough to constitute sexual harassment. By contrast, French employers do 
not take sexual harassment seriously. They view sexual harassment as a puritanical 
import from the United States. French employers have not adopted policies, nor 
are they particularly interested in handling employee complaints. Saguy argues that 
this corporate indifference also has structural and cultural explanations. First, the 
criminal penalties for harassment make women reluctant to complain. In addition, 
French law makes damage awards unlikely. Finally, Saguy claims that French victims 
of sexual harassment are not likely to file lawsuits because of “the shame and guilt 
most victims feel about the harassment.” 

Saguy’s evidence of victims’ motivations, however, is limited to the observations 
of human resource managers and attorneys, who are probably biased informants. 
Moreover, French and American women may not be as different on this dimension 
as Saguy suggests. An ample literature on sexual harassment in the United States— 
not cited in this study—has shown that American women also feel shame and guilt 
about their experiences. As a result, women in the United States rarely complain 
to anyone about sexual harassment—let alone pursue a formal legal complaint 
(Fitzgerald, Swan, and Fischer 1995; MSPB 1995; Welsh 1995). Indeed, interviews 
with French and American victims of sexual harassment were outside the scope 
of this already extensive study, but their omission should have made the author 
slightly more cautious about the conclusions she drew from her data. 


1294 / Social Forces 83:3, March 2005 


Thelaw in action mirrors widespread cultural beliefs about sexual harassment 
in the United States and France. Saguy's cultural analysis is based on a content 
analysis of press reports about sexual harassment and interviews with "cultural 
entrepreneurs”—individuals “who, through their jobs or volunteer activity, are 
likely to have a particular impact on the conceptualization of this social problem.” 
Saguy shows that while sexual harassment may be a serious social problem in the 
American workplace, the antidiscrimination frame limits the applicability of the 
concept to other arenas, such as public spaces or housing. Moreover, press coverage 
of the issue has changed as political scandals in the United States have unfolded. At 
first portraying sexual harassment as a serious problem for women, the press has 
begun to question seeming excesses of the legal rules. Also dominated by American 
sex scandals, French media coverage of sexual harassment emphasizes American 
prudishness rather than social problems faced by women. But the French press is 
more sympathetic to French women, depicting victims as having suffered from 
sexual violence and affronts to their dignity. 

Saguy's approach has several notable strengths. Her comparative methodology 
vividly illustrates that law is not an exogenous force imported (or exported) by a 
nation. Rather, it emerges from the cultural meanings prevalent in the society at 
the time. In turn, those cultural meanings reflect legal rules. Saguy's comparative 
analysis thus demonstrates the constitutive relationship between law and culture. 
In addition, she shows that legal rules do not establish sterile, reified categories. 
Instead, laws—and the meaning of those laws—are enacted and constructed by 
actors such as political activists, personnel managers, journalists, and the victims 
themselves. 
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Poor Kids in a Rich Country: America's Children in Comparative Perspective. 
By Lee Rainwater and Timothy M. Smeeding. Russell Sage Foundation, 2003. 263 
pp. Cloth, $39.95 


DANIEL T. LICHTER, Ohio State University 


In Poor Kids in a Rich Country, Lee Rainwater and Timothy Smeeding have solidified 
their reputations as America’s foremost international poverty researchers. This short 
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book has a straightforward and important goal: to place the poverty rates of U.S. 
children in comparative perspective. This is accomplished nicely using comparable 
cross-sectional poverty data from the Luxenbourg Income Study (LIS) for the 
United States, Canada, Australia, and the 12 Western European countries. 

This is mostly a nontechnical book aimed at the broadest possible general 
audience. There are no weighty or arcane theoretical debates regarding the various 
causes or consequences of child poverty, either here or abroad. The empirical 
approach is straightforward and compelling, but without pretension regarding 
measurement and the kinds of statistical methods used. The first part of the book 
includes eight chapters that provide comparative information about relative 
poverty rates, the role of demographics, income packaging (e.g., earnings and 
government assistance), and poverty policy. The second part of the book includes 
three additional chapters aimed at more technically sophisticated readers interested 
in learning how poverty lines are established and how countries with different 
currencies, different consumption patterns, and different purchasing power can be 
compared. In some ways, these chapters might be considered technical appendices. 
They are very nicely done. 

The best thing about this book is its accessibility and its mostly dispassionate 
discussion of results. Rainwater and Smeeding ask the most fundamental questions 
and then answer them in clever and compelling ways. How does child poverty in 
the United States—the world's richest country—compare with poverty rates in 14 
other rich Western industrialized countries? Using one-half the median equivalent 
income as a relative measure of poverty (which they defend in the opening chapter), 
Rainwater and Smeeding report a child poverty rate of 20.396 in the United 
States in 1997. This is higher than any other country. Sweden's poverty rate, by 
comparison, is only 2.496. The U.S. also has the highest percentage of high-income 
children (defined as one and one-half times the median equivalent income) and 
a comparatively small middle-income population. Income inequality among U.S. 
children is extreme. 

Compared with children in other countries, low-income children in the 
United States have the largest percentage below poverty (50%). U.S. low-income 
children are more deeply in poverty than children elsewhere, and they are relatively 
disadvantaged in comparison to America’s adult population. By benchmarking 
poverty rates to the gross domestic product, the authors also show that the “average 
child” in the U.S. is poorer than the average child in 12 of the 14 comparison 
countries. They dispute the usual claim that America has more inequality but that 
the average well-being is nevertheless higher than other countries. This turns out 
not to be the case. 

Critics often point to the fact that the United States population of children is 
more heterogeneous and at greater risk of poverty (e.g., more female heads and 
fewer adult workers in the household). But Rainwater and Smeeding anticipate and 
then dispel this argument; demographic differences between countries are small 
and do not explain the higher poverty in the United States. Perhaps surprisingly, 
differences in the number of earners are not the explanation for higher child poverty 
rates in the United States. The U.S. and Sweden have among the lowest percentages 
of poor children’s families with no earners (about 25%). But the poverty rate for 
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America's children in these families is 8796, compared with only 1696 in Sweden. By 
comparison, 6796 of Belgium's poor children live with no earners, yet only 3296 of 
all children in nonearning families are poor. Clearly, the issue in America is not the 
lack of working parents but remuneration from work and the effectiveness of the 
welfare safety net. Rainwater and Smeeding demonstrate this in separate chapters 
on income packaging among children in two-parent and one-parent families. 

These are rather depressing conclusions that speak volumes about the innocent 
causalities of our competitive free enterprise system, America's welfare safety net, 
and the political will of average Americans and the politicians who represent them 
to change things for the better. 

What can be done? Rainwater and Smeeding outline the usual strategies. For 
example, every household should have at least one earner (hopefully with a good 
job). They argue for more work supports— parental leave, childcare subsidies, and 
child support—and for better tax and education policies. But in the end Rainwater 
and Smeeding conclude that none of these policies will work unless the policies are 
consistent with American cultural values (e.g., hard work). Even more important, 
a rich nation like the United States must find the political will to change things. 
America’s apparent indifference toward its children, especially its poorest children, 
contrasts sharply with the United Kingdom, where Prime Minister Tony Blair 
has committed $90 billion toward programs aimed at eliminating child poverty 
altogether. 

There is much to like about this book. It is well written, interesting, and it 
largely achieves its stated objectives. It is also accessible to policy makers, and it 
would be useful in undergraduate and graduate courses in poverty and public 
policy, social stratification, or inequality. Poor Kids in a Rich Country also provides 
good advertising for a rich comparative data set on income—the LIS. These data 
are available online at http://www.lisproject.org/. I recommend reading the book 
and then building on its main conclusions by downloading and analyzing the data 
yourself. 


We Are Not Babysitters: Family Child Care Providers Redefine Work and Care. 
By Mary C. Tuominen. Rutgers University Press, 2003. 206 pp. 


LYNN MAY RIVAS, University of California, Berkeley 


In We're Not Babysitters: Family Child Care Providers Redefine Work and Care, 
Tuominen demonstrates how race and gender operate on a structural level to 
produce low wages for family child care providers. This book is an important 
contribution to the literature on women’s low-wage work and the definitive book 
on family child care. However, my advice to prospective readers is to skip the first 
chapter, which distracts from an otherwise excellent book. 

Tuominen interviewed only 20 individuals. Yet she takes full advantage of the 
previous research on family child care and masterfully uses the extensive literature 
on women’s low wage and caring labor to back up her argument. She starts with 
the daily lives of her interviewees, but rather than lingering over daily interactions 
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she connects their everyday experiences with the broader social relations and social 
forces (political, cultural, and economic) that impact their lives and which they 
in turn impact. 

Through her attention to class and race, Tuominen once and for all lays to rest 
the notion dominant in the academic literature that working mothers struggle 
with an internalized ideal of motherhood that is incompatible with paid work. 
This is not only untrue for the majority of family child care providers, who are 
disproportionately women of color, but as Tuominen points out, this dilemma 
only ever existed for a specific segment of the population: middle-class white 
women. Immigrant women, women of color, and poor women have long had to 
work outside the home and have integrated employment into their conceptions of 
motherhood. This is important because the idea that family child care providers 
are motivated to do this kind of work in order to enact some idealized conception 
of the full-time mother is used against them to justify low wages. However, though 
the ideology of intensive mothering does not explain their entry into this work, 
Tuominen clearly shows that gender is a central factor organizing the provision of 
family child care. As she explains: “Because women bear primary responsibility for 
child care within families and structural supports for employed mothers are, at best, 
in short supply, women return to the home to provide paid child care when labor 
markets (organized in accordance with gendered and racial ethnic ideologies) fail 
to pay wages sufficient for families to afford reliable child care.” 

All the women in her study entered family child care in order to meet the social 
and economic needs of their families. However, Tuominen found variation in 
how they understood their choice based on differences in race/ethnicity, country 
of origin, and social class. Her findings, though interesting, should be considered 
tentative. Here Tuominen’s small sample is a liability. Tuominen stands on firm 
ground when she is able to use the literature to back up her findings, but in areas 
like these, where her findings are genuinely new, future research will be necessary 
to validate her claims. 

Social distinctions also determined her respondents’ clientele and level of 
compensation. Bounded by geography and use of primarily “word of mouth” 
recruitment strategies, social networks and neighborhoods ensure that family child 
care providers serve families similar to them in terms of racial ethnic and cultural 
identity as well as social class. Low-income providers serve low-income families who 
pay the lowest rates, trapping these family care providers in an economic situation 
that keeps them poor. The state, which subsidizes the care of many low-income 
families, could but does not ameliorate this situation by paying family child care 
providers a reasonable rate. 

Tuominen writes compellingly about the social forces and social relationships 
that impact the lives of family care providers. However, she is also interested in the 
agency of family care providers and the ways in which they push back. In particular, 
she writes a very interesting chapter about African American women who provide 
what Tuominen calls “Community Care Work.” For these women, providing care 
to other low-income women in their community is not simply the reproduction 
of economic disadvantage, but also an empowering decision to contribute to the 
economic advancement and survival of their community. 
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This book seeks to bring family child care providers in to the policy debates 
about child care that are currently limited to the needs of children and parents. 
It makes a significant contribution to the literature on care work and shows how 
racialized and gendered processes result in the work being devalued. Tuominen 
argues persuasively about the value of this work to families and communities. 
Family child care providers do an important and difficult job and their pay should 
reflect this. 


A Diagnosis for Our Times: Alternative Health from Lifeworld to Politics. 
By Matthew Schneirov and Jonathan David Geczik. SUNY Press, 2003. 230 pp. 


PHIL BROWN, Brown University 


Schneirov and Geczik view alternative health as a significant force for overall social 
and cultural change, a counter-institutional challenge to the commodification and 
bureaucratization of life. The authors argue that there are commonalities of the 
alternative health paradigm that cross two networks they studied: (1) a holistic 
network that includes food-coops and the Holistic Living Quest's combination 
of new age, Eastern, and social movement elements; and (2) a Christian network 
composed of the Committee for Freedom of Choice in Cancer Therapy, the Natural 
Living Group, and related organizations. Additional data came from a hospital- 
based alternative health clinic. The authors emphasize the social movement nature 
of alternative health, rather than simply lifestyle choices. They see this movement as 
transformative, though they admit it does not thoroughly bridge the gap between 
the "lifeworld" (as characterized by Habermas) and political challenge. 

There is some very good material on the key elements of alternative health. The 
interesting discussion of classic doctor-patient interaction helps pose alternative 
health as a challenger. The core concepts holding together so many disparate 
approaches and groups include an ecological view of the body where biology, 
emotion, belief, spirituality, and lifestyle are integrated. Furthermore, low-tech 
care, self-healing, a faith in past wisdom of ordinary people, and greater patient 
agency are integral factors. 

My main concern is that the book is driven more by theory than by data. The 
opening vignettes hint at a more ethnographic approach than we get. Too often 
interview and observational data are used to support theoretical arguments, and 
even then we don't get enough flavor of the people and groups. The theoretical 
material is sometimes quite interesting—for example, the authors view alternative 
health as a method of reenchanting the world, using a Weberian approach, but the 
book doesn't return to this later. Nor does the book return to much of the other 
theoretical material and thus doesn't tie things together in the end. Too many 
different theoretical perspectives are introduced, and it is not clear how they fit 
together. So the book feels top-heavy on theory compared to data and yet fails 
to make that theory increase our understanding of the real-life experiences of 
alternative health users and providers. 

The authors use Walter Benjamin's concept of “aura,” a reflective identification 
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that people have with objects in the natural world that enables people to recover 
meaning. Schneirov and Geczik give a nod to the dangers of uncritical celebration 
of the past, and they emphasize the tension between aura and commodity but are 
ultimately too uncritical of the mystical elements. Overall, however, this concept 
doesn't get developed enough to make the reader feel comfortable with it. The 
authors criticize the marketing of alternative health products and services, noting 
the tension between aura and commodity, but they don't take this marketing on 
enough. This means, for example, that "past life regression" therapy is part of the 
same package of alternative health as organic foods and chiropractic. 

Schneirov and Geczik “conceptualize alternative health regimes as aesthetic 
practices whose goals is the creation of the perfect body in complete harmony with 
nature.” This notion stems from Foucault’s notion of recapturing knowledge to 
achieve liberation. I find this “artful” form of “self-perfection” rather at odds with 
the political-economic critique of medicine, even though the authors claim to be 
integrating that critique. 

To explain the social movement nature of alternative health, Schneirov and 
Geczik adopt a new social movement theory, emphasizing Melucci’s idea of 
“submerged” movements that lie beneath the surface of overt political action 
where people develop a shared collective identity. The authors then return to the 
two alternative health networks that opened the book and claim to discover a site 
where the two meet: a set of beliefs about personal responsibility, a critique of 
biomedicine, rejection of government authority, and anti-consumerism. But the 
meeting point is fragmented, and it seems unlikely that messianic conservative 
religion will find much in common with radical critics like women’s health and 
AIDS activists, who are not even discussed. 

I was bothered by the authors’ lack of a critical perspective on the religious 
right. If alternative health aims to provide people with personal liberation from 
authoritarian medical relationships and to give them agency in making choices, 
what are we to think of political ideologies that limit freedom. In particular, the 
religious right’s attack on reproductive rights would certainly be a major concern, 
yet there is no hint of this problem. 

Near the end, Schneirov and Geczik briefly note the significant research on 
social inequalities and health and realize that holistic health advocates fail to grasp 
that underlying concern. They also realize that a broader movement is necessary, 
including social medicine and environmental justice. But they don’t recognize that 
many alternative health elements, especially the fundamentalist religious ones, 
won't be along for this ride. 


A Population History of the United States. By Herbert S. Klein. Cambridge 
University Press, 2004. 316 pp. 


DENNIS HODGSON, Fairfield University 


In just 238 pages of text, Herbert S. Klein, the Gouverneur Morris Professor 
of History at Columbia University, offers an overview of U.S. demographic 
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history from prehistory until today. In the Introduction, Klein defines basic 
demographic measures for the nonspecialist reader and describes the various 
demographic transitions that have transformed populations over the last several 
hundred years. The two chapters that follow are a high point of the volume as 
Klein presents a masterful survey of what is known about the demography of the 
Western Hemisphere, from the arrival of the first humans through colonization 
by Europeans. Klein clarifies differences in how the Portuguese, Spanish, and the 
British settled their New World colonies by examining their differing situations 
with respect to the availability of indigenous labor and the cost of imported 
European labor. He also uses this labor supply framework to explain where and 
when colonizing powers resorted to the use of African slaves. 

Klein's treatment of the actual population history of the United States, Chapters 
Three through Seven, is a more perfunctory examination of mortality, fertility, 
migration, population growth, and population distribution trends during five 
historic periods: 1776—1860, 1860—1914, 1914—1945, 1945—1980, and 1980-2003. 
In these chapters, Klein’s focus is narrowly quantitative, although he occasionally 
reviews competing explanations of particular trends, for example, why fertility 
started to decline so early in the U.S. He also includes in each chapter a comparison 
of U.S. trends with those occurring in European societies. He makes no attempt 
to treat the contemporary debates and policy initiatives that are associated with 
the trends he describes. For instance, Klein documents the dramatic decline in 
immigration associated with the passage of the National Quota Laws in the 1920s, 
but he does not discuss the immigration restriction movement or the racist and 
eugenic fears that produced laws that overtly discriminated against Southern and 
Eastern Europeans. Similarly, when describing fertility decline from 1860 to 1914 
he discusses the possible increased use of contraception and abortion but makes 
no mention of the contemporary legal initiates (Comstock laws and state abortion 
laws) that limited access to these birth control methods. Of course, many readers 
will have the requisite knowledge to “fill-in” the missing history, but even Klein's 
limited goal of charting U.S. population trends would seem to call for some 
treatment of contemporary reactions to population trends, especially those that 
produced policy initiatives aimed at altering them. 

Admittedly there is a voluminous literature that narrowly focuses on accurately 
portraying U.S. population trends, and it has been over half a century since 
someone has attempted to summarize this body of work in a single volume. A 
one-volume overview of U.S. population trends would be an ideal main text for 
the courses in U.S. population history that are beginning to be offered and would 
be a very useful ancillary text for many courses in which a general knowledge of 
U.S. population trends is useful background information. Additionally, research on 
U.S. demographic trends is a highly specialized affair, and researchers tend to focus 
on a single demographic variable or a single time period. Reading a brief overview 
of U.S. trends would be an easy way for such specialists to become familiar with 
research on other demographic variables or other time periods, allowing them to 
better situate their work within a larger frame. That being said, I have to advise the 
reader that this particular one-volume overview of U.S. population trends contains 
more than a few errors of fact and interpretation. I hesitate to recommend it for 
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the purposes mentioned above. 

Consider Klein's description of nineteenth-century African American fertility 
trends. He begins by misreading one source's rates of natural increase as crude birth 
rates, arriving at a profoundly incorrect assessment: “The relatively steady level of 
crude death rates for this population—which remained at around 30 per thousand 
resident population for the century—was matched by a very significant decline in 
the rates of reproduction, with the crude birth rate dropping from 22.7 to 14.0 in 
this same period.” In fact, the African American crude birth rate began the century 
at 53.1, not 22.7, and ended it at 43.8, not 14. Klein then repeats the same mistake 
when reading a table from a second source to arrive at an incorrect depiction of 
nineteenth-century racial fertility differentials: “Black fertility also declined faster 
than white fertility and remained below white fertility throughout this period.” 
African American fertility actually didn’t begin to decline until after 1880, much 
later than white fertility, and was considerably higher than white fertility. These 
assessments are clearly stated in the text of the source cited by Klein. Accuracy 
problems also plague Klein’s analysis of recent crude birth rate trends: “From 1980 
to 2000, the crude birth rate dropped from 24 per thousand resident population to 
just 15 per thousand.” The actual crude birth rate in 1980 was 15.9, not 24. Klein’s 
38% drop in crude birth rates simply didn’t happen, making his analysis of which 
groups were most responsible for it of doubtful relevance. Klein also incorrectly 
reports that while in the 1970s teenagers had the highest rates of births outside of 
marriage “by the end of the century, it was older women whose rate of illegitimacy 
was highest and rising.” In 1999 78.6% of births to women aged 15-19 were to the 
unmarried, a considerably higher rate of illegitimacy than that of any other age 
group. There are also problems of interpretation. Klein claims that there has been 
“little change in the number of women going childless,” with a rate that “has stayed 
relatively steady since 1960 at roughly 15% to 16% for women who have reached 44 
years of age.” His source actually shows a rapid decline in childlessness from 15.1% 
in 1960 to 8.8% in 1975 and then an increase back up to 16.5% by the year 1995, 
a trend line that is not “steady.” Technical demographic measures are occasionally 
mishandled by Klein. He incorrectly defines life expectancy at birth as a median 
age of death: “when demographers say that life expectancy of a given population 
is 45 years of age, it means that half the population born in, say, 1850 will survive 
to the age of 45 in 1895.” Life expectancy at birth simply does not mean this. In 
a number of places (see the titles and descriptions of Graphs 7.3 and 7.4) Klein 
conflates the total fertility rate (a period rate) and children ever born (a cohort 
rate) in ways that make passages difficult to decipher: “the total fertility rates barely 
reached replacement and fluctuated between 2.0 and 2.1 children per women who 
had completed their fertility by the end of the century.” 

There are whole sections of this short book that are excellent and insightful. In 
addition to the first two chapters, I found Klein’s treatment of twentieth-century 
mortality trends to be especially good. He makes clear, for instance, that major 
twentieth-century advances in the control over infectious disease largely occurred 
over the brief period between 1938 and 1952. Many, though, will be using this book 
as a reference work, a place to turn to when desiring a definitive description of U.S. 
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population trends. The presence of factual inaccuracies and the imprecise use of 
demographic measures and terms greatly limit its value as a reference source. 


The Travels and Adventures of Serendipity: A Study in Sociological Semantics 
and the Sociology of Science. By Robert K. Merton and Elinor Barber. Princeton 
University Press, 2004. 313 pp. Cloth, $29.95. 


MATHIEU DEFLEM, University of South Carolina 


The word serendipity was coined by the eighteenth-century English author Horace 
Walpole, who had discovered a fairy tale about the three princes of Serendip and 
the various instances of accidental sagacity that befell the tale's protagonists. With 
the publication of this book by Robert Merton and Elinor Barber, sociologists need 
wait no more for this once ‘still unpublished’ work about serendipity that Merton 
had sporadically hinted at in some of his writings. The monograph was written 
in 1958 and appears here in unaltered form, expanded with an afterword. An 
Italian edition appeared in 2002. Professor Merton received word of the decision 
by Princeton University Press to publish the book shortly before his death in 2003. 
Elinor Barber, a former student of Merton, passed away in 1999. The informed 
Introduction to the book by James L. Shulman provides some elucidation as to why 
it took almost half a century for the work finally to see the light of day. Perhaps the 
delay of the publication is only befitting to the theme of the work and its place in 
Merton's oeuvre and the contributions to the sociology of science by Barber and 
other Mertonians. 

When Merton and Barber wrote this book, serendipity was a rather obscure 
and rarely used term. Merton had first accidentally stumbled on the word when 
looking up another word in his Oxford English Dictionary. Coinciding with Merton's 
interest in the sociology of unanticipated consequences and his development of 
the sociology of science began a quest to unravel the trials and tribulations of 
serendipity. 

Merton and Barber trace the travels of serendipity in various literary traditions 
and publications. They describe the changing meanings of serendipity and examine 
its history with reference to those 135 people in the English-speaking world who 
used the word in the period up to the 1950s. The writings of Walpole were not 
widely discussed until they were republished some 80 years after his introduction 
of serendipity in 1754. Yet the Victorians who discussed Walpole's work were not 
keen on giving a place to chance in art and science. In nineteenth-century science, 
the role of chance in the making of discoveries was well known, but it was in literary 
circles that serendipity was initially discussed and diffused. The social milieus that 
favored serendipity were bibliophiles and antiquarians, authors, literary scholars 
and lexicographers, medical humanists, social scientists, applied researchers, and 
science writers. 

In science, serendipity's role is by definition problematic, as science cannot 
readily accept luck. But the role of accident in discovery can be approached from 
a scholarly viewpoint as well. Merton and Barber's study is framed within the 
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Mertonian sociology of science paradigm and its attention for the unanticipated 
consequences of purposive conduct that found expression in such now familiar 
notions as the Matthew effect and the Thomas theorem. Work on Merton's related 
book On the Shoulder of Giants contributed to preventing earlier publication of 
the 1958 serendipity manuscript. 

Onlya fool would review a posthumous contribution by one of the discipline's 
leading representatives and one of his collaborators on the basis of the usual criteria 
of scholarly merit. As a contribution to the sociology of semantics and science, 
the book makes partly good on its promises. Much of the book provides thick 
description of the paths and turning points of serendipity, but it does not develop a 
full-fledged sociological semantics on the basis of a theory of the social structure, a 
tradition to which nobody has contributed more than Merton. A sociological theory 
of serendipity is occasionally hinted at—best of all in Merton’s 2002 afterword—but 
could have been fleshed out in more detail. As it stands, the book offers a rather 
rudimentary sociology of professions as the basis of its exegesis. 

The humanist will find much beauty in this text. However, as Merton clarified 
better than anyone else, sociology as a social science stands between the physical 
sciences and the humanities. This ambiguity is decidedly not about the dichotomy 
that exists between scientific and unscientific work. Merton’s work and that of his 
Columbia compatriots is always resolutely sociological. The literary appeal of this 
book is one of its distinctive features, but it ought not take away from its sociological 
ambitions, even if they remain largely latent. In the afterword to the book, Merton 
bemoans the condition that today serendipity has become something of a Disneylike 
expression, a vague and vogue word that is often rephrased in psychological terms. 
Likewise, a misreading of Serendipity might mistakenly lead to the conclusion that 
sociology can be a literary exercise. But the fact that the literary fanatic Merton 
wrote in the Simmelian tradition of the extended essay is a matter of form only. 

Hurrah for Serendipity? Of course. Merton and Barber have written a very fine 
tale, one that makes good sense in the framework of a sociology of science and 
sociological semantics. The entertainment that provides The Travels and Adventures 
of Serendipity is matched by the brilliance of Merton’s Social Theory and Social 
Structure. 


Social Inequalities in Comparative Perspective. Edited by Fiona Devine and 
Mary C. Waters. Blackwell Publishing, 2004. 318 pp. 


JOHN MOLAND JR. Alabama State University 


In Social Inequalities in Comparative Perspective, Fiona Devine and Mary C. Waters 
have assembled a collection of essays by scholars from the U.S.A, Canada, France, 
UK, Portugal, Australia, Japan, and Finland. They have produced a textbook aimed 
at helping students learn about inequalities in a comparative global perspective. 
Contributors call attention to globalization as involving not only extensive economic 
shifts and changes, but also extensive immigration flows and influences on identities 
and public policies. The essays in this comparative work demonstrate how meanings 


1304 / Social Forces 83:3, March 2005 


and identities are socially influenced and constructed in contextual settings relative 
to issues of social inequalities in areas of race, ethnicity, class, gender, and sexuality. 
To achieve leverage on social inequalities and the contextual nature of experiences, 
identity paths, meanings, and social processes, contributors used two or more 
qualitative research tools, including observation, participant observation, in-depth 
interviews, field studies, and documentary analysis. An integrating theme running 
throughout the essays is the significant value of qualitative research methods for 
providing insight into the meanings and understandings of contextual experiences 
in the making and perpetuation of inequalities in each country. 

The essays are grouped into three sections. The first section, "Inequalities of 
Race and Ethnicity," begins with a chapter by Mary C. Waters that examines race, 
ethnicity, and immigration in the United States with emphasis on second generation 
West Indian immigrants in New York City. Waters identifies three paths of identify 
development: (1) identifying as Americans, (2) identifying as ethnic American with 
some distancing from black Americans, or (3) identifying as an immigrant in a way 
that does not equate with American racial and ethnic categories. Micheline Labelle 
considers the “language of race,” identity development, and inequality as related 
to second generation Haitian and Jamaican immigrants and their integration into 
Canadian and/or Quebec society. Using qualitative research methods, including field 
studies and in-depth interviews, she reveals four types of national identities. Riva 
Kastoryano provides an examination of race and ethnic inequalities in France with 
emphasis on ethnicity as a developmental process. The French state has pursued a 
national secular identity emphasizing separation of church and state while avoiding 
giving legitimacy to diversities. However, immigrant associations have emerged with 
identities developed around cultural factors, particularly religious identities, such 
as Islam and the head scarf, which has become highly politicized. Liviu Popoviciu 
and Mairtin Mac An Ghaill point to the making of racisms and ethnicities in the 
development of British nationalism. Based on their in-depth research on young 
Irish men and women and on immigrants, they call for a reexamination of ethnic 
and racial theory and research in light of the study of whiteness and the use of 
qualitative methods to uncover contextual meanings and understandings. 

The second section of the book focuses attention on “Inequalities of Income 
and Class” in four countries: U.S., Britain, Portugal, and Australia. The contributors 
show how the use of qualitative methods allow for an exploration of the subjective 
understandings of class relations and in the process demonstrate the influence of 
the particular social history of each country. 

Katherine Newman and Chauncy Lennon focus attention on the working poor 
in Harlem, New York, and report findings from intensive interviews conducted 
in a panel study of the career trajectories of those at the bottom of the labor 
market. Although the participants had experienced only limited advancement, 
their subjective understandings of values regarding individualism and education 
sustained their optimism. Newman and Lennon make the case for more qualitative 
longitudinal research on the working poor in the U.S. in light of recent policy 
changes. Elisio Estanque presents information from his ethnographic study 
involving participant observation of working-class practices on the shop floor 
and discusses contextual historical factors contributing to class, gender, and ethnic 
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exploitation and inequalities in Portugal. Bill Martin and Judy Wajcman discuss 
class and inequality in Australia by drawing on both quantitative and qualitative 
data. Using in-depth qualitative studies, they found three ways in which education 
is understood as contributing to mobility based on the meaning parents attach to it. 
In Chapter Eight, Fiona Devine explores the saliency of middle-class and working- 
class identities in Britain given the growing similarities between the working class 
and middle class in terms of life style and social political proclivities. Although 
class identities remain strong, ambivalence exists about the use of the middle-class 
label. Ethnographic work is needed to capture feelings and emotions as associated 
with stratification in the form of class or other concepts. 

Section three of the book directs attention to inequalities of gender and 
sexualities in the U.S., Japan, Britain, and Finland. Christine L. Williams, Patti A. 
Giuffre, and Kirsten A. Dellinger examine the growth and development of theory 
and research on gender, including qualitative studies of sexual harassment in the 
U.S. Their studies suggest that definitions of sexual harassment vary depending 
on the social and cultural context, as may other areas of gender stratification and 
identity. Yuko Ogasawara focuses attention on qualitative research on gender 
inequalities in Japan and reveals that Japanese women are not powerless and 
should not be pitied as Western stereotypes suggest. Findings from her research 
on interaction between men and women and the contextual perceptions of both 
suggest otherwise. In Chapter 11, Harriet Bradley addresses the issue of whether 
women are catching up with men in the UK. She discusses the wide range of theory 
and research leading up to and including post-structuralist and postmodernist 
feminist perspectives and presents research findings questioning the progress 
of women in general. The final chapter, by Paivi Korvajarvi, calls attention to 
gender and work related inequalities in Finland. In spite of the long participation 
of women in the labor market, supported by state provided child care, Finland's 
occupational structure is characterized by gender segregation, and women are paid 
less than men. Her case study shows the process by which gender segregation is 
remade in new industries and occupations and shows the diversity of forms that 
inequality of race, ethnicity, class, and gender can take in different places. 

One observation that could make the book more appealing as a textbook would 
be to identify the essays in the Content under the section headings presented and 
discussed in the Introduction. 

Recognizing the limitations of qualitative research with respect to its subjective 
nature and generalizability, each contributor calls for more qualitative research and 
analysis of social inequalities not only to address the limitations but to promote a 
deeper understanding of the meanings and social processes underlying the making 
of racial, ethnic, class, and gender inequalities. Consequently, this book is a major 
contribution to the subject matter area and is a fine choice for course adoption for 
research on gender, race, and ethnic inequalities in a global perspective. 
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Shrinking Violets and Caspar Milquetoasts: Shyness, Power, and Intimacy in 
the United States, 1950—1995. By Patricia A. McDaniel, New York University 
Press, 2003. 214 pp. 


JACKIE L. ELLER, Middle Tennessee State University 


Periodically shyness has been treated as a set of behaviors, a personality trait, and 
an emotion. Is it an emotion to be “seen but not felt”; is it to be viewed as proof 
of femininity, a natural protection of feminine virtue, a social "disaster," or social 
phobia? How does shyness, however defined, relate to other feelings and social 
behaviors called for in a given situation? How and why has the conceptualization 
of shyness changed from the 1950s to the 1990s? As part of emotion culture, how 
has the perception of shyness contributed to as well as challenged gender, race, 
and class relations of power? These are the questions McDaniel explores in her 
well-researched study. 

Knowing that McDaniel based her study of shyness on an examination of 
popular self-help books and magazines, I admit I was skeptical. Would she be able 
to convince me or other readers of the interplay between the advice given and the 
meanings and expressions of shyness constructed in everyday life? The thoroughness 
of her data collection and analysis has left little doubt. Through an exhaustive 
procedure to identify truly popular self-help books and magazines, McDaniel’s 
final sample consisted of 101 self-help books written for an audience of mostly 
white females and males, 36 magazines for an audience of black males and females, 
and 78 child-rearing manuals. The strength of her work, though, is not found in 
her sample alone, but in the way in which she is able to weave together the advice 
given with social, political, and economic realities, especially as they impact white, 
middle-class women and men. 

McDaniel begins with an overview of prevailing attitudes toward shyness in 
the West from the Middle Ages into the early twentieth century. In one whirlwind 
chapter that sets the stage for a later analysis of shyness, she takes the reader from the 
medieval codes of conduct, to the emergence of the middle class, to the construction 
of Anglo-Saxon manliness and the “natural” timidity of the nineteenth-century 
lady. Here she traces how the conceptualization of shyness contributes to these 
constructions, but also to inherent contradictions that eventually lead to a crisis 
of masculinity that produces images of the “sissy,” black “boys,” and savages. For 
a privileged white woman in the South, the cult of True Womanhood embraces 
her natural shyness while juxtaposing it with the “natural” brazenness of the 
black “jezebel” or “mammy.” For white women in the North, the cult of “Real 
Womanhood” posits dignity not false modesty (shyness) as an interactional 
strategy. As the nineteenth century comes to a close, the culture shifts from a 
“culture of character” to a “culture of personality,” and norms again change, but 
as McDaniel emphasizes throughout her study, “. . . the link between shyness and 
gender remains a powerful ideological tool in support of gender as well as race 
and class hierarchies.” 

Throughout the next four chapters, McDaniel examines shyness from the 
1950s to the 1990s within intimate heterosexual relationships, the workplace, and 
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friendships. Whether for intimate relationships or friendships, ideas about the 
place of shyness in the roles and rules of intimacy change. Over those changes 
and in great detail McDaniel is able to demonstrate the consistent expectation of 
white, middle-class women's emotion work (self-disclosure and the drawing out 
of men from their "natural" reserve) in the achievement and maintenance of that 
intimacy. For blacks, the issues of intimacy are intricately entwined with civil rights, 
yet here again McDaniel is able to demonstrate how the advice givers convey the 
importance of women's emotion work (managing assertiveness) in shoring up 
masculine pride while empowering self. 

Within the workplace, McDaniel traces the shift from shyness as required of 
both men and women to shyness as antithetical to success. In the 1950s, shyness 
for men, defined as deferential emotion labor, was the key to success; for women, 
shyness was the key to workplace survival. But by the 1990s, the emphasis had 
shifted to empathetic listening as the key to creating an appearance of a power- 
free workplace. While self-help authors argue that empathetic listening would 
equally benefit men and women, the reality is that their advice helped to conceal 
the relationship between emotions and gendered power. 

McDaniel’s book is an interesting and worthwhile read, but I offer three points 
for consideration. (1) Personally I find Hochschild's distinction between emotion 
work and emotion labor a useful and meaningful distinction; McDaniel uses them 
interchangeably. (2) Due to limitations of available evidence, McDaniel's analysis 
is mostly about norms for white, middle-class women and men. Therefore, the 
analysis for African Americans and the working class is shallow. (3) At times, 
McDaniel’s analysis became more about gender than about shyness and emotions 
per se. Those most interested in the social construction of emotions might find this 
troubling; I did not. Throughout this highly readable book, McDaniel "highlights 
how shyness norms frequently function as ideology" that supports gender, race, 
and class hierarchies. Based on a convincing argument, the thoroughness of her 
analysis, and the uniqueness of her subject matter, I recommend her book. 
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Minding Our Own Business: Local Retail 
Establishments and the Future of Southern 
Civic Community* 


CHARLES M. TOLBERT II, Baylor University 





Abstract 


The civic community perspective focuses on local social and economic institutions 
that buffer communities from external, often global forces. Important community 
organizations such as locally oriented business establishments, civic organizations, 
associations, and churches are emphasized. These critical entities are posited to 
benefit a community through an enhanced quality of life, more civic engagement 
by the citizenry, and a strong capacity for local problem-solving. This address 
argues specifically for the beneficial consequences of locally oriented retail businesses 
(single-establishment enterprises and local chains). The relative presence of these 
local firms is shown to be associated in a positive way with small manufacturing 
establishments, associations, public gathering places ("third places"), social capital, 
and voter turnout. Negative correlations are presented for the relationships between 
locally oriented retail businesses and rates of poverty, infant mortality, and crime. 
Throughout the presentation, concern is expressed that the South lags behind other 
U.S. regions in employment and wages derived from these locally oriented firms. 
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0049033), and by the U.S. Department of Agriculture National Research Initiative (cooperative 
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My focus here is the relationship between the organization of the local business 
sector and a variety of community- and individual-level outcomes. I will make 
the claim that this relationship can be quite positive and beneficial to both 
communities and the people who reside in them. In doing so, you will be 
provided with an overview of a line of work that my collaborators and I have 
been pursuing for several years.’ Our research has been national in scope. In light 
of our meeting’s theme — Southern Sociology — I would like for us to consider 
today national trends as they bear on the South. 

Many of us have fond recollections of small-town life in the South and 
elsewhere. My spouse’s grandfather ran a jewelry store in Prescott, Arkansas. 
Visits there took on more meaning to me as I studied sociology and came to 
appreciate the role of this proprietor in the social and economic organization 
of a 1970s small town. For years, his establishment had been next door to his 
brother-in-law’s general store. They often watched each other’s store while the 
other ran an errand. The stores were across the street from city hall where the 
jewelry store owner’s wife worked parttime. The couple lived in an apartment 
above the jewelry store. Neighbors drove downtown, parked in front of the 
store, and strolled by the storefronts along the main street. Folks on jury duty 
would stop by to say hello and pass on news about friends and acquaintances. 
In many ways, the built environment of this small town created spaces for civic 
engagement. The store owner was at the nucleus of a local social network of 
proprietors, professionals, and other citizens. He was keenly aware that, as the 
town thrived, so did his business and the livelihoods of not only his family, but 
also just about everyone he knew. To move from Prescott would mean losing all 
of this. Indeed, throughout the post-war period, business was steady, and the 
farmers and timber owners in Nevada County did well. At that time, it seemed, 
small-town quality of life was unsurpassed. If you went back to Prescott today, 
you would find the main street less busy and a number of storefronts empty. 
The jewelry store building was lost in a fire not long after it was shuttered. A 
retail supercenter has opened in nearby Hope, diverting traffic and drawing large 
numbers of shoppers away. 

As a rural sociologist, it is tempting for me to conclude that progress in the 
form of industrial agriculture and globalization has left this small-town way of 
life behind. Are we rural sociologists left with only professional and personal 
nostalgia? Our expectations for many small towns these days are indeed rather 
grim. Boarded-up storefronts and empty parking spaces suggest little evidence 
of human activity. Any remaining retail activity has relocated to a big-box 
store or two on the outskirts of town. Physical spaces that once enabled civic 
engagement are decaying. An independent or merchant middle class is in decline. 
Key community stakeholders are losing attachment to communities that once 
defined their social status and sustained their livelihoods. They see few options 
other than migrating to areas where more opportunities present themselves. 
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More recently, Abbeville, Louisiana, has been the subject of two of my research 
projects. Unlike Prescott, Abbeville has a thriving downtown with stiff competition 
for any available storefronts. This is despite the presence for several years of big- 
box retail stores on a bypass on the edge of town. Our studies have concluded that 
Abbeville is a remarkably resilient place that somehow buffers itself from global 
social and economic forces that have reshaped other small communities. This is a 
central paradox for us: not every rural town is in decline; in fact, some are thriving. 
As this became clearer to us in our work, we sought to understand this. We are 
finding that some locales have a thriving civic community that enables them to 
weather internal and external ups and downs. Led by local stakeholders — public 
officials, locally oriented businesspersons, and civic leaders — an impressive and 
flexible problem-solving capacity develops. In Abbeville's case, this capacity has 
buttressed the community through boom and bust episodes of the offshore oil 
and gas industry, the closing of a textile mill that employed the equivalent of 10 
percent of the town's population, and a collapse in export prices for rice. 

Black's Oyster House in Abbeville exemplifies this resiliency. The building is 
a general store dating to the early 1900s. Later in time, it was a locally owned 
department store, fashionably complete with a mezzanine. In a lot of small towns, 
the department store would have closed by now, and the building would have been 
shuttered. This was not the case in Abbeville. With a large local support network, 
confident entrepreneurs stepped up and invested in renovations necessary to 
convert the building to a restaurant. Now diners on the mezzanine have a great 
vantage point for local people watching. Our experiences in these two southern 
communities have helped us shape a perspective we call civic community. 


Civic Community 


In a number of papers and books, we have elaborated on what we mean by civic 
community.’ For the present purposes, let me just say that the perspective focuses 
on social and economic structures and institutions that buffer communities from 
external, usually global, forces. This leads us to identify and study important 
community organizations such as locally oriented business establishments, civic 
organizations, associations, and churches. These critical entities, in turn, are 
posited to benefit communities through an enhanced quality of life, more civic 
engagement by the citizenry, and a strong capacity for local problem solving. 
Locally oriented businesses have stakes in the local labor market, the local 
economy, the local infrastructure, and — usually — the local product market. 
As the community goes, so go these entities. Persons owning, managing, and 
working in these establishments rarely employ a multinational or corporate 
perspective. The worldview is local. To lay off workers in slack times, for example, 
would mean taking jobs from relatives, friends, and neighbors. In Abbeville, one 
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service station owner was extremely proud that — at the height of the oil bust 
— he had employed 10 relatives to pump gas. To this day, folks drive into town 
and pay more at his station than they would at the self-service quick stops along 
the bypass. Let me describe further how we have defined the notion of locally 
oriented businesses. 

Our earliest efforts (Irwin, Tolbert, and Lyson 1999; Lyson and Tolbert 1996; 
Tolbert, Lyson, and Irwin 1998) focused on small manufacturing establishments. 
What we observed in thriving rural areas were clusters of small production 
facilities almost always nested in cooperative networks. Economists refer to 
agglomerations of these as industrial districts. In our work, we operationalize 
small manufacturing as 19 or fewer workers. 

In addition to small manufacturing establishments, my colleagues and 1 
have more recently studied other locally oriented enterprises primarily in retail 
and services (Irwin et al. 2004; Tolbert et al. 2002). We classify single business 
establishments as locally oriented. The entire enterprise exists in one location. We 
add to our local establishments, local chains or enterprises with all establishments 
confined to the local area. We have experimented with various spatial definitions, 
but have primarily defined local chains as those with all establishments located 
in the same county. These local distinctions cannot be made with public-domain 
data. Instead, we have developed measures using confidential Economic Census 
microdata. These establishment-level data are housed at Census (in its Center 
for Economic Studies) and at its affiliated Research Data Centers around the 
country. Measured in this way, locally oriented businesses are the subject of 
analysis I present below. 

Third places are public gathering places such as barber shops, beauty shops, 
cafes, pubs, libraries, museums, and the like. Oldenburg (1991) coined the term 
third place. If the first place is home and the second place is work, then third 
places are the places we go for commercial or leisure reasons. They are built 
environments for interaction, engagement, and problem solving. Using our 
definition of locally oriented enterprises, we can identify local third places that 
are single establishments or parts of local chains. That is, they are not connected 
to regional, national, or international corporate entities. 

Associations constitute another important category of social institutions, and 
we have found that areas do indeed vary in the number of associations. As with 
third places, associations can also be defined as exclusively local. 

We have also treated churches as important community institutions. The 
church censuses provide approximate counts of congregations and adherents 
by counties. These religious organizations turn out to be significant venues 
for interaction, engagement, and problem solving. In some of our work, we 
have studied denominations whose members exhibit high rates of organization 
memberships outside the church. Calling these civic denominations, we have 
found such entities to be primarily — but not exclusively — traditional mainline 
Protestant denominations.’ One of the most civic religious groups, for example, 
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is the Latter Day Saints. Areas where its numbers are large rate highly on our 
civic indices. Lee and Bartkowski (2004a, 2004b) have since used our notion of 
civic denominations in investigations of community homicide levels. 


Theoretical Issues 


How does our civic community perspective relate to the civic engagement 
approach championed by Robert Putnam (1993, 2000) and others? By including 
the civic attributes of the local business sector, we see our perspective as 
broader and somewhat distinct from a focus on civic engagement that counts 
memberships and occasions of interaction. More importantly, our use of the 
term "community" implies that our focus is bounded spatially and/or socially 
delineated by a collective sense of place. 

How does our perspective relate to social capital? If social capital is defined as 
the sum total of one's weak and strong network ties and other network resources, 
then social capital is clearly an important part of our study. What we observe 
in the aggregate (e.g., community quality of life, voter turnout, crime rates) 
surely reflects, in part, the consequences of individual stocks of social capital. 
But, we are not carrying out the sort of micro-level social network and social 
resources analyses that adequately measure and model social capital (see, e.g., 
Hurlbert, Haines, and Beggs 2000). Thus, we make no claims about measuring 
social capital per se. 

How does our perspective treat the concepts of place and space? We have 
focused for the most part on meaningful geographies (townships, villages, cities, 
counties) — places with which residents identify. We also refer to some territory 
as simply “space,” by that we mean an area with which residents or those nearby 
do not identify. Where there is no sense of place, there will be very little civic 
community. For example, the present restructuring of the urban fringe suggests 
that rural and once-rural places are being transformed into less meaningful space. 
Consolidation and concentration of the retail and service sectors are reshaping and 
transforming these areas spatially, socially, and economically. This restructuring 
poses a challenge for civic community. In extreme cases, these changes undermine 
entirely the social fabric of communities. We have observed that a decline of rural 
civic community leads to a loss of economically productive residents, a decline 
in local businesses, fewer civic spaces for interaction, and an erosion of local 
physical and social infrastructure. Communities so changed exhibit a reduced 
capacity for local problem solving and are increasingly vulnerable to the social 
disruptions of globalization. Once-meaningful rural places become commodified, 
yet hollow space. 

How does our perspective’s emphasis on an independent merchant class 
mesh with the conventional sociological notion of the middle class? It seems we 
sociologists may have written off the middle class too quickly. Even Mills — whose 
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early career work on small businesses is seminal for our civic community approach 
— dispatched the middle class to corporate America in White Collar. But, in the 
postwar era, it was indeed intellectually stylish to muse about the organization 
men who were populating the new suburbs and commuting to work in city- 
center skyscrapers (Whyte 1956). Another effort to write the obituary of the 
middle class arose out of institutional and labor economics as well as stratification 
sociology. As it became clear that the prosperity of the 1950s and 1960s had not 
been duplicated in the 1970s, studies in the 1980s — including some of mine 
— aimed to show that the middle class was shrinking (Harrison and Bluestone 
1988; Tolbert and Lyson 1992). Yet another thread of these arguments came from 
left-leaning writers who bemoaned the prison-like lives led by what was once a 
large and relatively prosperous class (Davis 1986; Ehrenreich 1989). 

Historian Robert Johnston's Radical Middle Class chastises sociologists for 
clinging to a one-dimensional view of the middle class that leaves no room fora 
petty bourgeoisie, a merchant class, or an economic standing based on anything 
other than corporate niches (Johnston 2003). In his analysis of the progressive 
era in Portland, Oregon, it is in fact small business owners, proprietors, and 
shopkeepers who led radical reform movements — often expressly on behalf of 
their workers. Johnston certainly articulates his argument better than we have 
so far. Still, we have for some time considered anchoring our civic community 
perspective in basic, structural, social stratification theory. The term we are using 
is independent middle class. This group consists of business owners or those who 
are closely tied to local small businesses (i.e., family members, unpaid family 
workers, other employees, or customers). They have a substantial vested interest in 
the place where their businesses are located. In rural development parlance, they 
are “stakeholders.” They are also deeply embedded in local professional, business, 
religious, social, and civic organizations and institutions. More importantly, we 
doubt this class is limited to rural America, though it may be much easier to 
identify and study there. 

The radicalism we are beginning to identify on the part of this contemporary 
independent middle class is its less-than-rational economic behavior. At the firm 
level, strategic business plans do not necessarily follow a profit maximization 
path or take the least expensive route. Similarly, at the individual level, behaviors 
are not necessarily conditioned on purchasing a product at the cheapest price or 
even taking the best paying job. 


Evidence for the Independent Middle Class: The Case of Locally 
Oriented Retail 


For this presentation today, I limit our consideration to a nontrivial part of the 
economy: the retail sector. Our ongoing and planned work is taking a broader view 
that includes most other economic sectors. But, given the frequency with which 
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we read of community efforts to resist supercenters and/or big-box operations, 
the retail sector offers an interesting and at times controversial focus. From our 
developing civic community perspective, there is clear reason for our emphasis 
on the transforming retail sector: we could be observing the mechanisms that 
bring about the demise of this hypothésized independent middle class and the 
local business establishments that sustain it. 

In terms of our annual meeting's theme, "Southern Sociology," it is no 
small coincidence that the largest of the big-box retailers — indeed the largest 
employer in the world — is a southern corporation that built an empire, first, in 
the rural South. This suggests that locally oriented businesses would fare worse 
in recent times in the South. To provide some data on this, I will introduce one 
way that my colleagues and I are measuring locally oriented business. Then, in 
a simple state-level analysis, I will show how local businesses are related to other 
community attributes and to the behavior of individuals. 

For this presentation, I developed a number of state-level measures of the 
presence of these locally oriented establishments from confidential internal 
Census data files. As noted above, local retail establishments are those that either 
exist entirely in one location or are components of local (within-county) chain 
enterprises. The data are for the Economic Census years 1977-1997. There are 
two measures: 


1. Percent of a state's total retail employment that is accounted for by locally 
oriented establishments. 


2. Percent of a state's total retail wages paid by locally oriented establishments. 


First, we will examine regional trends in these two measures over time, 
comparing the South to other U.S. regions. Second, we will present some 
correlations between these measures and various outcomes of interest. The 
coefficients are multiple correlations that control for state median income and 
population in most cases. Lastly, we will look at some illustrative multilevel 
modeling results, which relate these state-level measures of local retail business 
orientation to civic behaviors reported by survey respondents. 

Figure 1 displays trends in retail employment in local establishments by 
regions of the country. The bottom dashed line is the South. The South begins the 
data series in 1977 with proportionately less locally oriented retail employment. 
The regional difference accelerates somewhat over time. While there is clearly 
a secular national trend, the South distinguishes itself as a leader with only 40 
percent of retail employment in locally oriented establishments by 1997. 

Figure 2 shows the share of total retail wages paid by locally oriented 
establishments. The regional patterns are much the same as those we observe 
for employment. The South starts out with the lowest percentage of retail wages 
paid by local firms and the southern share declines further over time. 

How do we account for the distinctiveness of the employment and wage 
patterns for the South? Retail in the South was surely late in developing. By the 
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Figure 1. Percent Retail Employment in Local Establishments 











—*— Northeast 
—B — Midwest 
- À - South 
—e- West 


T977 1982 1987 1992 1997 


Year 





Figure 2. Percent Retail Wages Paid by Local Establishments 
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Figure 3. Local Orientation and Small Manufacturing 


time it did, corporate entities were dominating the national scene. Anecdotally, I 
recall many small southern towns in the 1950s and 1960s that had Sears catalog 
stores and Western Auto stores located in prime downtown spaces. Clearly, 
research is needed on the seemingly distinctive development of the retail sector 
in the South.? Another explanation is that the nonunion South was particularly 
fertile ground for the development and expansion of big-box retail. In the 
1960s, there were several state-wide and regional chains building warehousing 
and distribution networks (Walton 1992). In the 1970s, of course, most of these 
enterprises gave way to a single large retailer, changing the structure of retail 
employment and wages in the South first and, later, in other regions. If this thesis 
holds, we would expect the other regions to exhibit trends more like that of the 
South over time. Readers can draw their own conclusions, but my observation 
is that the American retail sector is heading south. 

To be sure, sociologists are unlikely to halt the steady transition of the U.S. 
retail sector. Yet, I submit that there is reason for an informed concern about these 
trends. Even if we cannot personally resist the savings and variety large retailers 
offer, we ought to worry about the decline of locally oriented retail because it may 
be undermining civic community. The analysis that follows makes a reasonably 
compelling case for concern. 

The next few figures show the relationship between locally oriented retail 
and civic community measures we have commonly employed in our work. 
The relationships are depicted as multiple correlations between local retail and 
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Figure 4. Local Orientation and Per-Person Associations 
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Figure 5. Local Orientation and Per-Person Third Places 
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other civic measures (controlling where appropriate for state population and 
median income). Figure 3 shows the relatively strong relationship between 
locally oriented retail and the presence of small manufacturing establishments. 
Whether we measure local orientation in terms of retail employment or wages, 
the correlations with state-level percentages of manufacturing establishments that 
have 19 or fewer workers are positive and moderately strong. If anything, the 
relationships become a bit stronger over time. In work going back to the mid- 
1990s, my colleagues and I as well as others have shown small manufacturing to 
be associated with a variety of beneficial local outcomes (Lee and Ousey 2001; 
Lyson and Tolbert 1996). These results suggest that we may see the same for 
locally oriented retail enterprises. 

Another civic community variable we have used extensively is per-person 
associations. The correlations of associations with locally oriented retail are 
modest at best, but still positive (see Figure 4). When we look at per-person 
third places — those public spaces where people gather — the relationships are 
quite strong and also appear to be going up over time (Figure 5). Where there 
are relatively larger percentages of employees and wages in locally oriented retail, 
there are also more third places. 

Figures 3 through 5 show us that the locally oriented retail measures vary in 
reasonably similar ways to some of our earlier civic community measures. The 
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Table 1. Local Orientation and Social Capital 


Measures for 49 States Based Correlation with Putnam 
on 1997 Retail Census Social Capital Index 
Employment in Local Establishments .74 

Retail Wages Paid by Local Establishments 21 


table and figures that follow show correlations between local retail orientation 
and key quality of life outcomes. We begin with Robert Putnam’s state-level social 
capital index (see Putnam 2000). In Table 1, the correlations between his index 
and our measures are positive and at least of moderate strength. There is a quite 
strong relationship between share of employment in locally oriented retail and 
social capital as measured by Putnam. States with higher levels of locally oriented 
businesses also have higher levels of social capital (as measured by this index). 
Unfortunately, this part of our analysis is necessarily cross-sectional as the social 
capital index is not available over time. 

Figure 6 shows the relationship between local retail and national election 
turnouts for various years. These correlations are positive and moderately strong. 
It is safe to say that where there are relatively larger proportions of locally oriented 
retail establishments, voter turnout is higher. 

In our work, most of our civic community measures have been found to 
be negatively associated with poverty. In Figure 7, we see that the relationship 
between local retail and poverty is indeed negative, but only of modest strength 
around 1990. This is a likely reflection of cyclical economic performance, but a 
resolution of that thesis is beyond the scope of this article. 

Infant mortality is a readily available proxy measure of overall public health 
and health infrastructure. In Figure 8, we observe that infant mortality is 
negatively associated with locally oriented retail employment and wages. Restated 
less technically: lower levels of infant mortality are associated with higher levels of 
employment and wages in locally oriented retail establishments. This is another 
relationship that appears to become stronger over time. 

The Uniform Crime Rate composite index is another quality of life indicator. 
Here, we would also expect a negative correlation, and that is what we observe 
in Figure 9. As locally oriented retail goes down, the crime index goes up. Like a 
number of the other relationships, this one appears to strengthen over time. 

Thus far, the analysis gives us a sense of the implications of local retail at the 
state level. But, friendly critics have posed a fair question: "Is civic community 
associated with individual behaviors?" Put another way, does a high level of civic 
community mean that residents exhibit behavior that differs from residents of 
areas with low levels of civic community? In a multilevel modeling framework, 
the question would be addressed by testing for intercept differences for higher and 
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Figure 7. Local Orientation and Poverty Rate 
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Figure 8. Local Orientation and Infant Mortality 
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Figure 9. Local Orientation and Crime Index 


lower civic places. We might further ask: “Does the relative presence or absence 
of a civic climate mediate individual-level processes such that persons behave 
differently than we would expect based on their personal attributes alone?” In 
a multilevel approach, this question would be addressed by modeling effects of 
civic structures on relationships between individual-level characteristics and 
behavioral outcomes (i.e., testing for slope effects). We do just that in the last 
analysis section. 


Civic Behavior in Civic Context: A Multilevel Analysis of State Locally 
Oriented Retail and Individual Propensity to Engage in Civic Behavior 


This analysis employs individual-level data that identify state of residence from 
the 1996-1998 DDB marketing surveys. These surveys have been made available 
on the Internet by Robert Putnam.’ Among the many items in the DDB survey 
data are three that reflect civic participation. They are how many times a year a 
person (1) goes to club meetings, (2) works on community projects, and (3) does 
volunteer work. Preliminary assessment of the distributions of these variables 
indicated that the responses were essentially dichotomous. Respondents reported 
doing these things once or twice at most, or not at all. About 60 percent of the 
respondents had taken at least one of these civic actions. Thus, for this illustrative 
analysis, I formed a binary index coded 1 if the respondent reported at least one 
civic behavior. 
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At the individual level, we estimate this index with personal attributes 
including age, education, business ownership, gender, church attendance, and 
home ownership — all of which have been shown to be associated with civic 
participation. At the state level, we include the percent employed in local retail 
establishments since this measure performed best in the state-level analysis 
above. 

For brevity, I focus on two particular multilevel relationships that plausibly 
stem from this model. First, it is well established that older persons exhibit 
more civic participation than younger persons. The prevailing rationale for this 
positive relationship is that it reflects the life course. In particular, persons with 
older children or none at home are more likely to participate in civic activities. 
But, what about the civic community context in which such behavior occurs? Do 
state differences in local retail employment condition the relationship between 
age and civic participation? For example, do young people participate more in 
a civic climate? Second, it is also known that educated persons are more likely 
to exhibit civic behaviors. To what extent might a civic context condition this 
relationship? Do less educated persons engage in more civic behaviors in a locality 
where locally oriented businesses thrive? Or, does a civic context enhance the 
likelihood that educated persons will act in one of the civic ways measured by 
the DDB data? 

To address this, I estimated a hierarchical linear model with HLM, employing 
a logistic response function (Raudenbush and Bryk 2002). The results at the 
individual level indicated the expected positive relationships between civic 
behavior and the explanatory factors age and education (measured as years of 
schooling).* The state-level measure of percent employment in local retail was 
also positive and statistically significant. This implies a difference in intercepts 
such that persons living in states with high levels of locally oriented retail are 
more likely to exhibit civic behaviors. The coefficient for the state-retail effect on 
age was negative, indicating that the state-level measure of local-retail attenuates 
the positive individual-level relationship between civic behavior and age. A civic 
context as represented by locally oriented business establishments promotes civic 
behavior on the part of younger persons. In contrast, the local-retail variable 
had a positive effect on the relationship between civic behavior and education. 
Persons with more education were even more likely to exhibit civic behaviors in 
contexts with high levels of locally oriented retail. Figures 10 and 11 illustrate 
these multilevel relationships. 

In Figure 10, the X-axis depicts age and the Y-axis shows predicted values of 
the civic index. The thinner black line shows the estimated relationship between 
age and civic activity for someone living in a low local-retail-employment state 
(measured at the 25th percentile). The thicker line shows the same for someone in 
a high local-retail-employment state (75th percentile). The different slopes show 
that the relationships at the individual level do vary between these types of states. 
The model shows young persons in high local-retail-employment states with 
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Figure 10. Age and Civic Behavior by Local Retail 


higher civic participation than their counterparts in low local-retail-employment 
states. The model predicts higher civic participation for persons from high local- 
retail-employment states all the way out to aged 65. Thus, there does appear to 
be a multilevel relationship in which an area's local retail employment influences 
the association between age and civic participation. 

Figure 11 depicts the relationship between education and civic behavior. These 
lines do not intersect, and the slope of schooling is a bit steeper for states with 
high levels of local retail (the thicker of the two lines). The model results suggest 
that the presence of locally oriented retail increases the likelihood that educated 
persons will exhibit civic behaviors. In sum, this simple analysis suggests that a 
civic context can indeed condition individual-level processes. 


Concluding Remarks 


My colleagues and I have for some time been advising rural development 
practitioners to attend to the small business sector (Tolbert et al. 2001, 2002). 
Smokestack chasing, as a development tool, is hard to resist because of its potential 
payoff. But, it seems lately that the chase stops at the shoreline as production 
operations uproot and head offshore. That leaves large-scale rural development 
specialists with only a handful of other prospects: tribal casinos, prisons, dams, 
and power plants. None of these will occur in numbers sufficient to reverse the 
decline of rural America. The one constant communities have is an independent 
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Figure 11. Education and Civic Behavior by Local Retail 


middle class of locally oriented businesspersons. Our advice: nurture the small 
business sector. 

In our ongoing work, my colleagues and I are now turning our attention 
to the urban fringe as a natural laboratory for studying the clash between civic 
community and the sprawl of meaningless space. Studies of the fringe hold 
much promise for understanding the dynamics of community decline and the 
mechanisms of community resilience. There is a sociological literature on ethnic 
enclaves suggesting that urban (and more recently suburban) ethnic enclaves are 
important places to study the varieties of civic communities and the roles they 
play in assimilation. That vast literature should be revisited to shed more light 
on local business persons. I think we will find that enclave researchers have been 
telling us all along that small businesses are at the core of local social structures.? 
Our planning and architectural colleagues who follow the new urbanism also 
have much to contribute. When they design combined residential, commercial, 
and civic spaces, they are in effect building venues for civic community. Yet, they 
often face structural barriers in terms of outdated zoning ordinances and other 
restrictions enacted to keep large developments out of our backyards. 

What are we willing to tolerate personally in order that civic community 
prospers? Higher prices? Higher buildings looming over our neighborhoods? 
And, what about the South? Here, we need to think regionally and act locally. We 
need to ask ourselves: “What are the local costs of low prices?” Could it be that 
the most rational choice is the economically irrational choice of paying more for 
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something? I believe one thing is certain. If we don't attend to the restructuring 
retail sector and the associated consequences, we will pay more. 


Notes 


1. My associates in this research program are Troy C. Blanchard, Mississippi State University; 
Michael D. Irwin, Duquesne University; Thomas A. Lyson, Cornell University; and Alfred R. 
Nucci, U.S. Census Bureau. 


2. For an overview of this line of work, see Irwin et al. (2004); Lyson (2004); Lyson and 
Tolbert (2003); Tolbert, Lyson and Irwin (1998); and Tolbert et al. (2001). 


3. For details on our definition of civic denominations, see Tolbert, Lyson and Irwin (1998) 
in this journal. 


4. Mills and Ulmer's (1946) work is important to us because it was one of the early sociological 
statements on the beneficial consequences of small businesses for communities. Subsequently, 
Mills (2002) saw small business yielding to larger corporate interests. In both studies, there 
are clear social class implications. 


5. For a more detailed, county-level analysis along the same lines, see Blanchard et al. 
(2003). 


6. It is plausible that the West was also late in developing. Yet, we do not see much 
correspondence between the southern and western data series. My collaborator from the 
Census Bureau, Alfred Nucci, has noted in private communication that data on western 
business establishments are often dominated by patterns in the state of California. Were 
we to remove the California data, we might indeed observe a convergence of southern and 
western trends. 


7. The data were downloaded from the bowlingalone.com Web site. See Putnam (2000) for 
other applications of these surveys. 
8. The estimated level-one (individual) equation is the following: 


= 2.72 + .012 (age) + .18 (education) + .38 (own business) + .19 (own home) + 
0.25 (female) + .00 (household income) 


The estimated level-two (state) prediction equation for the effect of age on civic behavior 
is the following: 


Y = .0124 — .0004 (local retail employment) 


The estimated level-two (state) prediction equation for the effect of education on civic 
behavior is the following: 


Y = .1757 + .0020 (education) 
9. Portes and Zhou (1996) make this case in a most cogent manner. 
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The Welfare State and Relative Poverty in Rich 
Western Democracies, 1967-1997* 


DAVID BRADY, Duke University 





Abstract 


This study investigates the relationship between the welfare state and poverty 
with multiple measures of the welfare state and poverty in an unbalanced panel 
of 18 Western nations from 1967 to 1997. While addressing the limitations of past 
research, the analysis shows that social security transfers and public health spending 
significantly reduce poverty. Less robust evidence exists that social wages reduce 
poverty, while public employment and military spending do not significantly affect 
poverty. The welfare state's effects are far larger than economic and demographic 
sources of poverty. The significant features of the welfare state entirely account for 
any differences in poverty between welfare state regimes, and these features have 
similar effects across welfare state regimes. The welfare state's effects on poverty did 
not change in the 1990s. Sensitivity analyses show the results hold regardless of the 
U.S. cases. The welfare state emerges as the primary causal influence on national 
levels of poverty. 


Perennially a contentious issue, the welfare state has recently come under a storm 
of criticism. For the first time since inception, the generous West European welfare 
states appear unsustainable and possibly even counterproductive. Regarded for 
decades as egalitarian havens, many social democracies appeared to struggle with 
rigid, unproductive labor markets in the 1990s. The social democratic Sweden 
suffered a crisis of three straight years of negative economic growth, massive 
budget deficits, and high unemployment (Freeman et al. 1997). By contrast, the 
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liberal, flexible and efficient U.S. seemed to experience an unprecedented decade 
of robust economic expansion. 

More specifically, the relationship between the welfare state and poverty has 
received tremendous scrutiny. Critics variously claim that even the most generous 
welfare states fail to eradicate poverty (Cantillion 1997) and that the welfare 
state provides disincentives (Murray 1984), carries a middle class bias (Burtless 
1994; Reingold et al. 2001), is inadequate (Freeman 1999), and inefficient (Mayer 
1997). In his history of poverty, Katz (1989:113) lamented that poverty, hunger, 
and a lack of health insurance and housing remain unacceptably prominent and 
further noted that, *Neither public policy nor private enterprise has moderated 
the great forces that generate poverty in America.” In his widely read text on 
the declining standing of working Americans, Krugman (1994:28) analyzed 
historical trends in poverty and concluded, *Neither generosity nor niggardliness 
seems to make much difference to the spread of the underclass.” Ultimately, the 
mounting criticism culminated with the 1996 U.S. Personal Responsibility and 
Work Opportunities Act, which both scientific studies and political rhetoric 
claim overwhelmingly vindicated social policy retrenchment (Bernstein 1999; 
Hernandez 1999; Pear 1999). 

At the same time, recent scholarly advances promise new opportunities to 
study the welfare state and poverty. While the welfare state appears increasingly 
unsustainable, the potential for research on the welfare state and poverty has never 
been more secure. The building of comparative historical data sets on welfare 
states has reached a zenith. Further, measurement innovations and pioneering 
high-quality income data have greatly advanced the study of poverty. Finally, 
theoretical developments in the sociology of the welfare state give an increased 
sophistication to empirical analyses of the relationship between the welfare state 
and poverty. 

In such a context, this study provides an unbalanced panel analysis of 18 
rich Western democracies from 1967 to 1997, multiple measures of the welfare 
state and poverty, and controls for economic performance and demographics. 
Building on its contributions, the paper explicitly addresses five limitations of 
past research. First, I incorporate the liberal economic critique by controlling for 
economic performance and demographics. Second, this paper compares various 
specific features of the welfare state to explore their precise effects on poverty. 
Third, I examine the impact of welfare state regimes and the effects of the key 
welfare state features across regimes. Fourth, I expand the comparative historical 
scope of inquiry. Fifth, I utilize more valid and reliable measures of poverty. 


Contributions and Limitations of Past Research 


An accumulating body of research has begun to investigate the relationship 
between the welfare state and poverty (Cantillion 1997; McFate et al. 1995; Nolan 
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et al. 2000; Smeeding et al. 2001). In general, the literature falls into in two modes. 
First, a community of analysts using the Luxembourg Income Study (LIS) has 
found a difference between poverty before and after taxes and transfers. Typically, 
these researchers utilize individual-level microdata and consider poverty among 
particular demographic groups. Based on significantly different means between 
poverty before and after taxes and transfers, scholars infer that the welfare state 
effectively reduces poverty among women, children, the elderly, and the rest of 
the population (Casper et al. 1994; Christopher et al. 2002; Jantti and Danziger 
2000). 

Second, a smaller but growing cluster of macrolevel cross-national or historical 
studies demonstrates a negative association between welfare state generosity and 
poverty. Most studies focus on the U.S. history of social policy and poverty trends 
(Blank 1997; Burtless 1994; Burtless and Smeeding 2001; Danziger and Weinberg 
1994; DeFina and Thanawala 2001; Page and Simmons 2000; Tomaskovic-Devey 
1991). Some show that welfare state generosity is associated with less poverty in 
a cross-section of rich Western democracies (Cantillion 1997; Korpi and Palme 
1998; Smeeding et al. 2001). For example, Kenworthy (1999) found a negative 
association between the welfare state and poverty across 15 affluent democracies. 
Very few studies examine both comparative and historical variation (Hanratty 
and Blank 1992; Hicks and Kenworthy 2003). Recently, Moller and her colleagues 
(2003) found that welfare generosity reduces adult poverty in 14 affluent countries 
from 1970 to 1997. 

On balance, this research substantially contributes to understanding 
how welfare states affect poverty. Yet, we are only beginning to untangle this 
complicated relationship. As mentioned above, this study addresses five limitations 
of past research: the liberal economic critique; diverse features of the welfare 
state; varieties of types of welfare states; historical and cross-national scope; and 
the measurement of poverty. To my knowledge, no study has addressed all five 
limitations. 


THE LIBERAL ECONOMIC CRITIQUE 


Arguably, the most powerful critique of the welfare state emerges from liberal 
economics (Gordon 1972; O'Connor 2001). I recently explained that liberal 
economics distills to four claims about poverty: (1) a nation's poverty depends on 
levels of economic growth; (ii) free market capitalism is, at least in the long term, 
a more successful route to poverty reduction; (iii) raising worker productivity 
through the acquisition of human capital reduces poverty; and (iv) the supply 
and demand of jobs, as represented by unemployment, predicts levels of poverty 
(Brady 2003b). Liberal economics does not represent all economic or economists' 
explanations of poverty, but is a set of implicit core assumptions anchored in 
classical economic theory and underlying policy, conventional wisdom, and social 
science. Two issues are relevant to this analysis. 
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Liberal economics contends primarily that economic performance is the most 
important determinant of poverty (Cain 1998; Freeman 2001). Ellwood and 
Summers (1986:79) argue, "Economic performance is the dominant determinant 
of the poverty rate.” Blank (2000:6, 10) asserts, “A strong macroeconomy matters 
more than anything else,” and “the first and most important lesson for anti- 
poverty warriors from the 1990s is that sustained economic growth is a wonderful 
thing.”! After the 1996 U.S. welfare reform, several analysts concluded that the 
strong economy was mostly responsible for the decline in welfare recipiency 
(Ziliak et al. 2000). With economic growth, the greatest priority is to reduce 
unemployment and to foster employment among the poor (O’Connor 2001; 
Tobin 1994). Haveman and Schwabish (1999:18) conclude, “Strong economic 
growth and high employment may again be the nation’s most effective antipoverty 
policy instrument.” Unemployment is routinely identified as the most important 
poverty issue in Western Europe (Gallie and Paugam 2000; Hauser et al. 2000). If 
the government has any role, the state should focus on raising the productivity 
of workers through increased education for skills and human capital (Karoly 
2001; Murnane 1994; O’Connor 2001). In liberal economics, increasing economic 
growth and productivity, and reducing unemployment are much more important 
to reducing poverty than the welfare state. 

A second claim of liberal economics is that the welfare state is counterproductive 
(Gilbert 2002). The welfare state purportedly has direct and indirect effects that 
actually increase poverty (Banfield 1970; Gilder 1981; Glazer 1988; Lindbeck 1995; 
Mead 1986; Murray 1984). Directly, welfare generosity encourages dependency 
and longer poverty spells (Bane and Ellwood 1994; Darity and Myers 1987; 
Leisering and Leibfried 1999). Thus, some features of the welfare state may have 
positive effects on poverty. Indirectly, welfare generosity provides an incentive 
for single parenthood, unemployment, and labor force nonparticipation (e.g., 
early retirement or single-earner couples) (Danziger, Haveman, and Plotnik 1981; 
Lichter et al. 1997; Moffitt 2000). Relatedly, many scholars contend that generous 
West European welfare states contribute to labor market rigidity and inefficiency 
and hence limit economic performance (Alesina and Perotti 1997; Freeman et 
al. 1997; Lindbeck 1998). If welfare generosity encourages unemployment, labor 
force nonparticipation, and single parenthood, and reduces economic growth 
and productivity, any poverty-reducing effects of the welfare state would be 
counterbalanced. If these economic and demographic variables have large effects 
on poverty, the welfare state could indirectly increase poverty.’ 

Recent research on the welfare state and poverty has not fully considered the 
liberal economic critique. For example, Kenworthy (1999) and Moller and her 
colleagues (2003) control for the level of economic development with GDP per 
capita. However, those authors did not include economic growth in their models. 
Also, Kenworthy did not control for the main mechanisms of the welfare state’s 
potential counterproductivity: single parenthood, unemployment, and labor 
force nonparticipation. 
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DIVERSE FEATURES OF THE WELFARE STATE 


The sociology of the welfare state sharply contrasts liberal economics. A wealth 
of research presumes the welfare state is beneficial and concentrates instead on 
the sources of the welfare state. If the welfare state definitively reduces poverty, 
the only questions that remain are why some welfare states are more generous or 
developed more quickly than others. One of the contributions of this literature 
is that the welfare state has a diversity of specific features. To better understand 
its effects on poverty, researchers need to examine the specific features of the 
welfare state. 

Traditionally, analysts relied on measures of social spending or social security 
transfers (Burtless 1994; Huber and Stephens 2001). Such measures gauge the 
general extent of redistribution or welfare effort. By contrast, Esping-Andersen 
(1990) argued that the quality of welfare programs was actually a more crucial and 
valid measure of the welfare state than the quantity of welfare effort.? Generalist, 
encompassing welfare states that sustain equal social citizenship — as opposed to 
those that guarantee basic economic security — are considered more successful at 
reducing poverty (Korpi and Palme 1998). In turn, scholars have emphasized the 
qualitative nature of the welfare state and followed Esping-Andersen's concept 
of decommodification with measures like social wages. Social wages are the 
amount of income workers would receive if they were to stop working and rely 
solely on the state. Kenworthy (1999) finds that social wages reduce poverty. 
Freeman (1999:24) endorses social wages over other social policies arguing that 
"Recentering the welfare state in this way is a way to institutionalize commitments 
to greater equality.” 

Beyond general measures, scholars suggest the value of at least three specific 
features (Huber and Stephens 2001). While prior studies have not analyzed 
whether these features alleviate poverty, it is equally valuable to assess their relative 
importance. First, public health spending is particularly salient (Bergman 1996; 
Conley and Springer 2001; Currie and Yelowitz 2000; Huber and Stephens 2001; 
Mullahy and Wolfe 2001; Wolfe 1994). The absence of health insurance is often 
the main obstacle to labor market entry for low-income families (Blank 1997). 
Further, the public provision of healthcare distinguishes the more encompassing 
social democratic welfare states from the most liberal variants. Second, public 
employment is essential in terms of the civil servants who provide social services 
and as a program to alleviate unemployment (Blank 1997; Huber and Stephens 
2001). Referring to public employment, Huber and Stephens (2000:323) argue that 
the “public delivery of a wide range of welfare state services is the most distinctive 
feature of the social democratic welfare state” (also Huber and Stephens 2001). 
Public employment provides work experience and income for the poor (Blank 
1994; Ellwood and Welty 2000; Gans 1995; Newman 1999); enhances the social 
mobility of the disadvantaged (Hout 1984); and reduces gender inequality and 
women’s poverty (Gornick and Jacobs 1998). Third, military spending has recently 
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been neglected in the study of the welfare state. While it seems unorthodox to view 
the military as part of the welfare state, we should recall that the military functions 
as a Keynesian economic development institution (Hooks 1991). The state often 
boosts military spending to counteract economic stagnation and unemployment 
(Griffin et al 1982; Mintz and Hicks 1984). Further, the military can serve as a 
turning point in the life course of disadvantaged young men by enabling upward 
social mobility (Mare and Winship 1984; Sampson and Laub 1996). 

Recent sociological research has begun to explore diverse features of the 
welfare state (Kenworthy 1999; Moller et al. 2003). However, no study specifically 
examines the effects of public health spending, public employment, or military 
spending. Moreover, past research has not compared these five features in one 
model. Doing so provides evidence about their relative effectiveness, and their 
specific influence after controlling for welfare generosity generally.“ 


VARIETIES OF TYPES OF WELFARE STATES 


The second main contribution from the sociology of the welfare state emphasizes 
the variety of welfare state types. The present generation of scholarship broadly 
contends that welfare states cluster into different types that are not necessarily 
directly comparable. These institutional clusters, regimes, or systems reflect the 
genetic historical legacies of social policy development and the state’s particular 
institutionalized tradition of intervention into the market (Freeman 1995; Hall 
and Soskice 2001; Hicks and Kenworthy 2003;).? Most influentially, Esping- 
Andersen’s (1990, 1999) typology of socialist, liberal, and conservative regimes 
reoriented scholarship away from universal explanations of social policy expansion 
and effects. In turn, Esping-Andersen’s work has generated two claims that are 
relevant to this analysis. 

First, social policies are expected to have different effects across the three 
regimes (Esping-Andersen 1990, 1999).° Because of different historically 
institutionalized traditions of social policy development and execution, the 
consequences for stratification should vary across regimes (Hall and Soskice 
2001). By extension, social policies should have different effects on poverty 
across regimes. For example, Korpi and Palme (1998) argue that universal social 
policies reduce poverty more effectively than targeted social policies. Second, 
these regimes reflect deeper differences in the institutionalization of equality than 
can be captured by simply analyzing the levels of different welfare programs.’ 
Accordingly, examining different welfare programs will neglect the more crucial 
institutional sources of variation in societal inequality. Thus, analyzing welfare 
state regimes will ultimately yield a greater understanding of poverty. 

Despite tremendous influence in the social science of the welfare state, no 
study has examined whether welfare state regimes influence levels of poverty or 
if welfare state programs have differing effects across regimes. Esping-Andersen's 
work has certainly guided the conceptualization and selection of welfare state 
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measures (Kenworthy 1999; Moller et al. 2003). Nevertheless, only descriptive 
evidence exists on the patterns in poverty across welfare state regimes (Esping- 
Andersen 1999; Korpi and Palme 1998;). 


HISTORICAL AND CROSS-NATIONAL SCOPE 


It is imperative that scholars broaden the cross-national and historical scope of 
research on the welfare state and poverty. When studies examine many countries, 
the historical scope is typically limited. Overwhelmingly, past cross-national 
analyses of the welfare state are limited to a cross-section of OECD nations at 
one point in time (Kenworthy 1999; Korpi and Palme 1998; Smeeding et al. 
2001). Microanalyses commonly examine individuals within a smaller number of 
welfare states at one or two points in time (Casper et al 1994). When a longer time 
period is examined, the cross-national scope is severely constrained. For example, 
much research has focused on only the U.S — despite its unique and potentially 
anomalous position in the global economy (Burtless 1994; Danziger and Weinberg 
1994).? Hence, the generalizability of past findings remains uncertain. Moller and 
her colleagues (2003) provide a rare exception by analyzing 14 countries from 
1970 to 1997. Nevertheless, the present study has a sample that is more than 3096 
larger, including 18 countries from 1967 to 1997. 

In addition, it is crucial that studies scrutinize the 1990s. The 1990s were a 
particularly challenging period of welfare state contraction and retrenchment 
(Clayton and Pontusson 1998; Esping-Anderson 1999; Gilbert 2002; Huber and 
Stephens 2001; Kenworthy 2002; Western and Healy 1999). Surveying welfare 
states, Stephens et al. remark, "Overall, then, by the late 1980s and early 1990s 
a picture of widespread cuts emerges, in some cases at least of considerable 
magnitude" (1999:191). While the social democracies faced a series of economic 
crises that threatened their sustainability, the minimalist liberal welfare state of the 
U.S. has seemingly produced both dynamic economic expansion and less poverty 
and welfare recipiency. As a result, the classic trade-off between social protection 
and economic efficiency (Okun 1975) has seemed more immediate and acute 
(Esping-Andersen 1996, 1999). For example, after its economic crisis in the 1990s, 
Freeman and his colleagues characterized the encompassing Swedish welfare 
state as, ^nearly impossible for the country to afford" (1997:11), *unsustainable" 
(1997:25), and “dysfunctional” (1997:27). While past studies suggested the welfare 
state influenced poverty prior to the 1990s, it remains highly debatable whether 
that relationship still held in the 1990s. To my knowledge, no study has tested 
the temporal stability of the welfare state’s effects. 
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THE MEASUREMENT OF POVERTY 


Research on the U.S. overwhelmingly relies on the official measure of poverty. 
This measure has many well-documented reliability and validity weaknesses 
that raise questions about this research (Betson and Warlick 1998; Brady 2003a; 
Citro and Michael 1995; DeFina and Thanawala 2001; Foster 1998; O'Connor 
2001). International poverty researchers have made great advances in poverty 
measurement (Atkinson 1987; Hagenaars 1991; Sen 1992; Smeeding et al. 2001). 
Recently, I synthesized these advances into five criteria for more valid and reliable 
measures of poverty: (a) measure comparative historical variation effectively; (b) 
be relative rather than absolute; (c) conceptualize poverty as social exclusion; (d) 
integrate the depth of poverty and the inequality among the poor; and (e) assess 
the impact of taxes, transfers, and noncash benefits (Brady 20032). Building on 
my recent article leads to three departures from recent research. 

First, this study incorporates the depth of poverty. Much poverty research 
utilizes simple headcount measures, for example, 5096 of the median income 
(Casper et al. 1994; Kenworthy 1999; Moller et al. 2003; Smeeding et al. 2001). 
These measures justifiably operationalize poverty relatively, usually incorporate 
comprehensive definitions of household income, and effectively describe the 
percentage of the population that is poor. However, these measures fail to consider 
the depth of poverty, treating all poor people as equal regardless of their distance 
from the threshold. This neglects the intensity of poverty and problematically 
homogenizes the poor. In turn, this study incorporates the depth of poverty of 
the average poor household.” 

Second, I focus on poverty after taxes and transfers — what I call state-mediated 
(SM) poverty. SM poverty is based on the most comprehensive definition 
of household income, so it is the central, most valid and reliable measure of 
economic resources. By contrast, some scholars examine poverty before taxes 
and transfers — what I call market-generated (MG) poverty (Brady 2003b, 
2003c; Christopher et al. 2002; Moller et al. 2003). MG poverty is a simulated 
counterfactual, which is useful when examining individual adults. Unfortunately, 
there are serious limitations to MG poverty. First, MG poverty makes little sense 
when including the elderly in your sample, since the elderly often have little 
income before taxes and transfers in many countries. Second, it may be unrealistic 
to reify MG poverty on a macrolevel since states and markets inherently constitute 
each other (Esping-Andersen 1990, 2003; Fligstein 2001). Taxes and transfers 
always shape the income distribution, and it may be impossible to truly simulate 
a pretax and pretransfer income distribution." Finally, I explain below why MG 
poverty is less relevant than SM poverty. There is much more variation in 5M 
than MG poverty, and it is this crucial variation that needs explanation. Further, 
MG and SM poverty are simply not empirically associated in a way that suggests 
the relevance of MG poverty. 

Third, as a supplement to my focus on SM poverty, I follow recent research 
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and examine poverty reduction (Hicks and Kenworthy 2003; Moller et al. 2003). 
Poverty reduction is calculated as the rate of change between poverty before taxes 
and transfers and poverty after taxes and transfers. While there are limitations 
to examining MG poverty, poverty reduction provides a dependent variable that 
incorporates the information contained in measures of MG poverty. Thus, poverty 
reduction provides a supplementary perspective on the relationship between the 
welfare state and poverty (Hicks and Kenworthy 2003; Moller et al 2003). 


Methods 


To investigate the welfare state and poverty, I follow recent studies of the macro- 
level variation in poverty and inequality (Alderson and Nielsen 1999, 2002; Brady 
2003b, 2003c; Gustafsson and Johansson 1999; Moller et al. 2003;). I utilize an 
unbalanced panel research design where the unit of analysis is a country-year. 
Because of limited observations for my dependent variable (see below), cases 
are unevenly distributed across 18 countries (Ns) and 31 years (Ts). Due to 
unmeasured time-invariant cross-national heterogeneity, ordinary least squares 
(OLS) regression is inappropriate (Hsiao 2003). Using Stata, I analyzed models 
with several techniques. 

For theoretical and methodological reasons, I present random-effects (RE) 
models. First, both cross-national and historical variations are crucial. The RE 
model better facilitates estimation of the effects of the independent variables 
on the dependent variables when both cross-national and historical variation 
are essential (Beck 2001; Beck and Katz 1996; Greene 1990). It is valuable 
to understand why some nations have more or less poverty and why poverty 
increases or decreases over time. In fact, the standard deviations between nations 
are much larger than within nations for most of the key variables. Further, the 
number of countries (18 Ns) far exceeds the average number of time points (4.5 
Ts). Hence, the cross-national (between) variation is arguably more important 
than the historical (within) variation. Second, statistical tests accept RE models.'4 
Third, in small and unbalanced samples with more Ns than Ts, RE models may 
perform better than the alternatives (Beck 2001; Bhargava and Sargan 1983; 
Greene 1990; Hsiao 2003). By contrast, the alternatives are often problematic 
in small and unbalanced samples, especially when the N far exceeds the Ts." 
Fourth, including dummy variables for welfare state regimes (see below) prevents 
the use of FE models (Beck 2001; Beck and Katz 2001). Fifth, the conclusions 
are not sensitive to these technical choices. In analyses available upon request, I 
estimated a host of alternative techniques. Though these alternatives may be less 
appropriate, the conclusions are consistent. 

In addition to statistical significance and basic fit statistics, the Bayesian 
Information Criterion Prime (BIC’) assists model selection. Raftery (1995) shows 
that BIC’ selects the more parsimonious model unless model fit is significantly 
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enhanced. Specifically, the greater negative (i.e., lower) value of BIC' is preferred, 
with positive values of BIC' indicating that a model is less preferred than a model 
with no independent variables.'^ 


DEPENDENT VARIABLES 


The source of poverty data is the Luxembourg Income Study (LIS). The LIS 
provides cross-nationally and historically comparable individual-level, nationally 
representative datasets. Cumulatively, LIS provides almost standardized data on 
household income — what LIS calls “Lissified,” with similar variables across data 
sets, similar samples, and equal weights, which allows for population estimates. 
I utilize an updated version of my estimates of relative income poverty (Brady 
2003a). The analysis includes observations from 18 nations with an average of 
4.5 time points each, generating an unbalanced sample of 79-81 cases." Unlike 
Moller and her colleagues, who focus on 25 to 59 year olds, I examine poverty 
in the entire population. Doing so includes children and the elderly, two groups 
that are much more vulnerable to being in poverty (Brady 2004; Smeeding et 
al. 2001). 

This study focuses on what I call the Interval (HI) measure of poverty, which 
is the product of H and I (Brady 20032). H is the headcount, the percent of the 
sample that is below 50 percent of the median income. I is the average depth of 
poverty, the difference between the median income in the sample and the mean 
income of the poor subsample. Hence, HI is not a rate but synthesizes the poverty 
rate and average depth of poverty into one index.? For comparison purposes, 
I also present the main models for H. As mentioned above, I analyze both state 
mediated (SM) poverty (after taxes and transfers) and poverty reduction (the 
rate of change between MG and SM poverty). 


WELFARE STATE VARIABLES 


This study analyzes five features of the welfare state and three welfare state 
regimes." All measures are in the present year, as lagged values and moving 
averages produce identical results. Social Security Transfers as a 96 of GDP includes 
state-sponsored cash transfers for sickness, old age pensions, family allowances, 
unemployment and workers’ compensation, and other assistance (OECD 2000a). 
Social Wage is Kenworthy’s (1999) measure of the percent of former income that 
an average worker would receive if work halted. This calculates the average income 
received over a period of several years through maternity leave, unemployment, 
disability assistance and other transfers for nonworking adults, as a proportion 
of the average production wage. Since it is calculated bi-annually on odd years, 
I used the preceding year for even years.” Neither of these two general measures 
incorporates state spending for services or in-kind benefits. Public Health Spending 
as a 96 of Total Health Spending summarizes all public spending on healthcare, 
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medicine, and public health and includes transfers and in-kind benefits and 
services (OECD 1998; see Huber and Stephens 2001). Public Employment as a 96 
of Civilian Employment measures all civilian nonmilitary government employment 
as a percent of the working-age population (Cusack 2004).? Military Spending as 
a 96 of GDP includes all spending on military personnel, technology, equipment, 
weapons, and research (SIPRI 1999). 

Beyond these specific welfare state features, I analyze the interaction of 
the most important features with two relevant contexts. First, I examine 
Esping-Anderson's (1990) schema of socialist, liberal, and conservative welfare 
state regimes. In addition, I code Luxembourg as socialist owing to its very 
encompassing programs, and code Spain as conservative due to its traditional 
family relations and authoritarian legacy. Hence, I include dichotomous variables 
for conservative and socialist regimes, while liberal is the reference. I also include 
the interaction of conservative and socialist regimes with the main welfare state 
features. Second, I investigate the welfare state's impact in the 1990s. I created 
a dummy variable for the 1990s period and include interactions of that period 
with the main welfare-state features. 


CONTROL VARIABLES 


Following my earlier research, I control for the liberal economic model: economic 
growth, productivity, and unemployment (Brady 2003b). First, I measure economic 
growth as the current annual rate of change in gross domestic product (GDP) 
of purchasing power parity (PPP) dollars (OECD 1998). Second, I measure 
productivity as GDP in PPP per civilian employee utilizing comparable data on 
labor force participation (OECD 2000b). This productivity measure, which is 
lagged one year, proxies the level of economic development and human capital.? 
Third, I measure the supply and demand of workers as unemployment in the 
current year with standardized unemployment rates that permit cross-national 
and historical comparison of the percent of the labor force unemployed (OECD 
2000c). 

In addition, I control for two demographic variables. According to liberal 
economics, the welfare state may be indirectly counterproductive by increasing 
these, which are expected to increase poverty. First, each model controls for the 
percentage of the population not in the labor force (OECD 1998). This variable 
sums the percentage of the population that is retired or too young to participate 
in the labor force as well as adults who are not participating in the labor force.** 
Second, I included the percentage of children in single mother families. 
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DESCRIPTIVE STATISTICS 


Table 1 presents the means and standard deviations for all variables. Unfortunately, 
data on both MG and SM poverty is not available for two cases (see fn. 17), so 
the poverty reduction analyses contain two fewer cases. Since the sample changes 
slightly, I present descriptive statistics for the independent variables for each 
dependent variable. 

Additionally, one crucial finding deserves mention. The coefficient of variation 
is considerably larger for SM poverty than for MG poverty (Brady 2003a)."* Thus, 
there is much more variation in poverty after taxes and transfers than before. 
While all countries generate poverty in the market, only some of those reduce 
the poverty. Thus, a crucial comparative-historical pattern emerges. With little 
variation, all nations produce poverty. Much greater variation exists in poverty 
after states use taxes and transfers. This pattern so fundamentally shapes the 
distribution of poverty that SM and MG poverty are not empirically associated. 
In fact, MG and SM poverty are not significantly, and surprisingly negatively, 
correlated (r = -.20 for HI, and r = -.18 for H). In RE models with MG poverty 
as the only independent variable, it is only weakly associated with SM poverty. 
As a result, an independent variable is not likely to have an indirect effect on 
SM poverty through MG poverty. Since MG poverty is weakly associated with 
SM poverty, any influence on MG poverty is less relevant to the more valid and 
reliable SM poverty. Coupling these facts with the greater validity of SM poverty, 
the essential variation occurs after taxes and transfers. To better understand how 
the welfare state affects poverty, scholars should focus on SM poverty. 

Table 2 displays the models of state-mediated poverty interval (HI) and 
headcount (H) poverty on the five measures of the welfare state, and the economic 
and demographic controls. In the first full model for each dependent variable, 
I include all five features of the welfare state." In the second preferred model, I 
include only the two most important features of the welfare state. 

In the first SM poverty models, social security transfers, social wage, and 
public health spending significantly reduce poverty, while public employment 
and military spending are insignificant. Social security transfers and public health 
spending have the most robust significant effects. Social wage is significant and 
appears to have a larger standardized coefficient than social security transfers 
in the first SM HI model. However, in models without public employment and 
military spending (not shown), social wage has a smaller coefficient and t-score 
than social security transfers. As I explain below, social wage does not significantly 
affect poverty reduction. Though Table 2 provides some evidence that social 
wage reduces poverty, social wage's effects are less robust than the effects of 
social security transfers and public health spending. BIC' very strongly prefers 
the second models for SM H and HI poverty. 
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Table 1. Descriptive Statistics (Means and Standard Deviations in 
Parentheses) Models of State-Mediated Poverty 


SM Poverty Poverty Reduction 

(N=81) (N=79) 

Interval (HI) 6.511 76.153 
(2.375) (2.683) 

Headcount (H) 9.710 68.915 
(3.426) (12.854) 

Social Security Transfers 15.201 15.099 
(4.519) (4.529) 

Social Wages 2277. 2277 
(.129) (121) 

Public Health Spending 76.661 76.713 
(11.869) (12.014) 

Public Employment 11.850 11.832 
(4.437) (4.490) 

Military Spending 2.641 2.683 
(1.250) (1.237) 

Economic Growth 2.149 2.138 
(3.612) (3.644) 

Productivity 45236.270 45338.950 
(7429.575) (7476.835) 

Unemployment 7.067 7.147 
(3.584) (3.593) 

96 of Population Not in Labor Market 54.487 54.475 
(5.236) (5.391) 

96 of Children Single Mother Families 10.384 10.346 
(4.596) (4.645) 

Conservative Regime .309 .316 
(.465) (10.835) 

Socialist Regime .346 :329 
(.479) (.473) 

Conservative x Social Security Transfers 4.966 5.092 
(7.579) (7.633) 

Socialist x Social Security Transfers 6.371 6.046 
(9.193) (9.074) 

Conservative x Public Health Spending 23,595 24.190 
(35.576) (35.825) 

Socialist x Public Health Spending 29.092 27.940 
(40.485) (40.330) 

1990s Period 457 456 
(.501) (.501) 

Social Security Transfers x 1990s 7.440 7.381 
(8.591) (8.550) 

Public Health Spending x 1990s 34.769 34.721 





(38.963) 


(39.020) 


4, June 2005 
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Hence, social security transfers and public health spending most effectively 
reduce SM poverty. These two welfare state features produce large reductions in 
SM poverty. A standard deviation increase in social security transfers reduces 
SM HI poverty by between .2 and .26 standard deviations, holding all other 
variables constant at their means. A standard deviation increase in social security 
transfers reduces SM H poverty by between .3 and .38 standard deviations. 
For a standard deviation increase in public health spending, SM HI poverty is 
expected to decline by between .56 and .58 standard deviations. For a standard 
deviation increase in public health spending, SM H poverty is expected to decline 
by between .59 and .62 standard deviations. The standardized coefficients for 
the significant welfare state features dwarf the standardized coefficients for the 
economic and demographic control variables. Notably, public health spending 
has the largest effect. Table 2 suggests that the welfare state has similar effects 
for H and HI poverty. 

The nonsignificant effects of the other two features are noteworthy. Though 
insignificant, public employment has surprisingly positive t-scores near 1.0. 
Military spending never approaches statistical significance. Importantly, the 
nonfindings for public employment and military spending demonstrate that the 
effects of social security transfers and public health spending (and social wages) 
are not simply true by definition. If those significant effects occurred solely due 
to the relative definition of poverty, one would observe that all welfare programs 
reduce SM poverty. In fact, two measures do not affect poverty. Considering the 
control variables included, it is unlikely that the welfare state’s effects are simply 
due to the relative poverty measure. 

The SM poverty results do not support liberal economic claims. The welfare 
state, not economic performance, is the most important influence on poverty. 
While economic growth significantly reduces SM poverty, its standardized 
coefficient is less than a sixth as large as the combined effect of social security 
transfers and public health spending. Productivity and unemployment do not have 
significant effects, and their standardized coefficients would be small. Moreover, 
none of the welfare state measures directly increases poverty with significant 
positive effects. As well, the results suggest that the welfare state does not have 
indirect positive effects on poverty. The two demographic variables — the percent 
of the population not in the labor market and the percent of children in single 
mother families — are insignificant and their small standardized coefficients suggest 
they are much less important than the welfare state. Even if the welfare state 
increased unemployment, labor force nonparticipation, and single parenthood 
and undermined economic growth and productivity, the welfare state does not 
indirectly increase poverty. Since those economic and demographic variables 
do not significantly affect poverty, the welfare state cannot indirectly increase 
poverty. 
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MODELS OF POVERTY REDUCTION 


The next four columns in Table 2 display models of poverty reduction. These 
analyses supplement the analyses of SM poverty and further scrutinize if the 
welfare state has nonspurious effects on poverty. Broadly, the results are consistent 
with the analyses of SM poverty. The welfare state is the largest determinant of 
poverty reduction, despite controlling for economic and demographic variables. 
Two welfare state features significantly increase poverty reduction: social security 
transfers and health spending. Again, health spending is most powerful. A 
standard deviation increase in social security transfers raises HI poverty reduction 
by between .29 and .33 standard deviations and H poverty reduction by between 
.35 and .4 standard deviations. A standard deviation increase in public health 
spending augments HI poverty reduction by about .59 standard deviations and 
H poverty reduction by about .58 standard deviations. These welfare state effects 
are similar across H and HI poverty reduction. 

Importantly, social wages do not significantly affect poverty reduction. Thus, 
social wages have a less robust relationship with poverty. As with SM poverty, 
public employment and military spending do not affect poverty reduction. Given 
the results for both dependent variables, it is unlikely that these two features 
effectively reduce poverty. Welfare states most successfully reduce poverty by 
emphasizing social security transfers and public health spending. As with SM 
poverty, BIC' very strongly prefers the second poverty reduction models. 

The economic and demographic variables are far less relevant to poverty 
reduction. Economic growth significantly increases poverty reduction, but its 
standardized coefficient is much smaller than the significant aspects of the welfare 
state. While economic growth contributes to poverty reduction, the welfare state 
is far more important. Productivity, unemployment, and the two demographic 
variables do not significant affect poverty reduction. 


INTERACTIONS WITH WELFARE STATE REGIMES AND THE 1990S 
PERIOD 


Esping-Andersen (1990) and others contend that welfare states are of different 
types, where social policies function differently across regimes. Also, many scholars 
mark the 1990s as a period when the welfare state began to lose its effectiveness 
and became counterproductive. Tables 3 and 4 examine these issues. As in Table 
2, all analyses control for the economic and demographic controls. From this 
point, I confine my presentation to HI poverty. Similar analyses of H poverty 
are available on request. Testing Esping-Andersen’s claims, I first test if the 
socialist-conservative-liberal typology explains poverty; second, I examine if any 
inter-regime differences can be explained by the two main features of the welfare 
state; and third, I examine if the effects of these two features differ across regimes. 
Evaluating the 1990s, I first analyze if the 1990s had different poverty levels after 
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controlling for the main features of the welfare state; and second, I test whether 
the effects of these features differed in the 1990s. At the bottom of each table, 
I display the BIC' statistics for the baseline models in Table 2. Comparing BIC’ 
provides more evidence of whether these analyses are preferred to the baseline 
models in Table 2. 

Table 3 displays these analyses for SM poverty. The first model demonstrates 
that both conservative and socialist regimes have significantly less SM poverty 
than liberal regimes. 

However, the next model shows that after including social security transfers 
and health spending, the effects of conservative and socialist regimes attenuate 
to non-significance. Thus, differences in these two welfare state features entirely 
account for any inter-regime differences in SM poverty. Model 3 shows that the 
effects of these welfare state features do not differ across regimes. None of the 
interaction terms is significant, and the main effects for social security transfers 
and health spending remain significantly negative. BIC’ strongly prefers Table 2’s 
baseline model, with only social security transfers and health spending, over these 
three models. Thus, the results in Table 3 contradict Esping-Andersen’s arguments. 
Levels of these welfare state features explain any differences between welfare state 
regimes in SM poverty, and these features do not have significantly different effects 
across regimes. SM poverty is best explained by simply examining social security 
transfers and health spending and neglecting welfare state regimes. 

Table 3 shows that the 1990s did not experience significantly different SM 
poverty. The coefficient for the 1990s period is not significant. Further, the effects 
of social security transfers and health spending did not change in the 1990s. 
The coefficients for the interaction of 1990s period and the two features of the 
welfare state are not significant and are in opposite directions. Moreover, BIC’ 
prefers the baseline model in Table 2 — with the two welfare-state features and 
without the 1990s period effect and interaction terms. Thus, arguments that the 
1990s ushered in a changed relationship between the welfare state and poverty 
are not supported. 

Table 4 shows the results for poverty reduction are consistent with the results 
for SM poverty. Both conservative and socialist regimes have significantly more 
poverty reduction than liberal regimes. However, variation in social security 
transfers and health spending account for these inter-regime differences. The 
main effects for social security transfers and public health spending are significant 
while the regime effects are not. In model 3, the main effects for the two welfare 
state features continue to be significantly positive, and the regime effects continue 
to be insignificant. But, the interaction of social security transfers and socialist 
regime is significantly negative at the .10 level. Thus, some evidence exists that 
after controlling for the main effect of social security transfers, social security 
transfers are slightly less effective in socialist regimes. This surprising finding is 
not consistent with Esping-Andersen and should be taken with caution. Also, 
BIC’ very strongly prefers the baseline model in Table 2 over models 1-3. 
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Table 3. Random-Effects Models of State-Mediated Interval Poverty on 
Interaction Effects of Welfare State Regimes and 1990s Period 
with Social Security Transfers and Public Health Spending in Rich 
Western Democracies, 1969-1997 (N=81) 





Model 1 Model 2 Model 3 Model 4 Model 5 





Socialist -3.103*** .048 -4.438 
(-3.02) (.05) (-.75) 
Conservative -1.797* -.282 91977 
(-1.66) (-.35) (.51) 
Social Security Transfers 15177 -.230* = 152: = 
(-2.15) (-1.78) (-3.16) (-3.17) 
Public Health Spending 1272 ee S l2 E L1 OE. 
(-4.87) (-3.42) (-5.89) (-4.56) 
Social Security Transfers 133 
x Socialist (1.00) 
Social Security Transfers -.084 
X Conservative (-.50) 
Public Health Spending .032 
X Socialist (.49) 
Public Health Spending -.032 
x Conservative (-.34) 
1990s Period 2127. 1.858 
(.46) (1.04) 
Social Security Transfers .052 
x 1990s Period (.74) 
Public Health Spending -.034 
x 1990s Period (-1.38) 
BIC’ -9.556 -62.583 -50.503 -65.645 -60.214 
R? Within .046 .114 .128 31945; 2152 
R? Between .457 .806 1822 .794 .807 
R? Overall 2902 717 733 2». .724 


Baseline Model BIC’ = -70.405 


Notes: For each independent variable, the unstandardized coefficient and t-score in parentheses 

are displayed. All models also include Economic Growth, Productivity, Unemployment, Percent of 
Population Not in Labor Market, Percent of Children in Single Mother Families, and a Constant. The 
Baseline Model includes the control variables, Social Security Transfers and Public Health Spending. 


** p<. 00330 05: p10 
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Table 4. Random-Effects Models of Interval Poverty Reduction on Interac- 
tion Effects of Welfare State Regimes and 1990s Period with Social 
Security Transfers and Public Health Spending in Rich Western 


Democracies, 1969-1997 (N=79) 


Model 1 
Socialist 16.546*** 
(4.02) 
Conservative 71255 
(1.65) 


Social Security Transfers 
Public Health Spending 
Social Security Transfers 


x Socialist 


Social Security Transfers 
x Conservative 


Public Health Spending 
x Socialist 


Public Health Spending 
x Conservative 


1990s Period 


Social Security Transfers 


x 1990s 

Public Health Spending 

x 1990s 

BIC'-16.071 -65.948 
R? Within .228 
R? Between ol 
R? Overall 446 


Baseline Model BIC’ = -77.305 


Model 2 
4.703 
(1.07) 


1-259 
(.34) 


.602** 
(2.08) 


dog 
(3.99) 


-59.027 
.250 
.817 
.736 


Model 3 
32.14 
(1.36) 


30.130 
(.98) 


123977 
(2.45) 


Shae 
(3.42) 


-.937* 
(-1.84) 


.200 
(.30) 


-.197 
(-.74) 


-.472 
(-1.26) 


-75.579 
310 
853 
.769 


Model 4 


2a 
(3.58) 


(S455 
(5.67) 


.765 
(.56) 


-67.774 
.230 
.837 
799 


Model 5 


95) 2 
(3.46) 


500222 
(4.59) 


-2.353 
(-.34) 


-.182 
(-.64) 


.079 
(.82) 


.236 
.836 
.756 


Notes: For each independent variable, the unstandardized coefficient and t-score in parentheses 
are displayed. All models also include Economic Growth, Productivity, Unemployment, Percent of 
Population Not in Labor Market, Percent of Children in Single Mother Families, and a Constant. 
The Baseline Model includes the control variables and Social Security Transfers and Public Health 


Spending. 


*** p<.01 **p<.05 *p<.10 
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The 1990s did not experience significantly different poverty reduction than 
earlier years, as the coefficient for 1990s period is not significant in model 4. 
Further, the effects of the welfare state did not change in the 1990s as interaction 
terms are not significant in model 5. In both models 4 and 5, the main effects of 
social security transfers and public health spending are the only significant effects. 
Finally, BIC' prefers the baseline model in Table 2 over both models. 


Sensitivity Analyses 


Table 5 scrutinizes the unique dynamics created with the inclusion of the U.S. in 
the sample. The U.S. has the least generous welfare state and the most poverty, 
and it is possible that the finding of a negative relationship depended on these 
anomalous cases. Though the random-effects models control for cross-national 
differences, readers may be concerned that the U.S. cases bias the results. In this 
table, I first include a dummy variable for the U.S. cases and second replicate the 
analyses after dropping the U.S. cases. 

The results of Table 5 are broadly consistent with the earlier tables. The 
most important causes of SM poverty and poverty reduction remain the two 
features of the welfare state. The effects of social security transfers are similar and 
public health spending are slightly smaller, but both remain significant. While 
economic growth has slightly larger and more significant effects after dropping 
the U.S. cases, it is still less important than the welfare state. Also, productivity 
has statistically significant effects after dropping the U.S. cases. But, its coefficient 
is similar to those in Table 2 — much smaller than the effects of the welfare state 
variables. Finally, the U.S. dummy variable only nearly significantly affects poverty 
reduction. Overall, these results suggest that the analyses in the prior tables are 
not biased by the U.S. cases. The conclusions are robust even after controlling 
for the idiosyncrasies of the U.S. 


Discussion 


This study investigates the welfare state's relationship with poverty in 18 rich 
Western democracies with multiple measures of the welfare state and poverty, and 
economic and demographic controls. Consistent with others, I provide evidence 
that the welfare state effectively reduces poverty (Kenworthy 1999; Moller et 
al. 2003). However, this study advances our understanding by addressing five 
limitations of past research. First, this paper incorporates the liberal economic 
critique by controlling for economic performance and demographics. Second, this 
paper evaluates five specific features of the welfare state to compare their precise 
effects on poverty. Third, I examine the impact of welfare state regimes and effects 
of the key welfare state features across regimes. Fourth, this study expands the 
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Table 5. Random-Effects Models of Interval Poverty on Social Security 
Transfers, Public Health Spending and Control Variables in Rich 
Western Democracies With and Without the U.S., 1969-1997 


Welfare State 
Social Security Transfers 


Public Health Spending 


Economic 
Economic Growth 


Productivity 


Unemployment 


Demographic 
Percent of Population Not in 
Labor Market 


Percent of Children in 
Single Mother Families 


U.S. Dummy 


Constant 


BIC’ 

R? Within 
R? Between 
R? Overall 
N 





State Mediated 
With U.S. Dropping 
Dummy U.S. 

ISL s:163:55 
(-2.93) (-3.14) 
2,103357 SOS 
(-3.12) (-3.24) 
-.090** $1155 
(-2.25) (-2.71) 
-.00003 -.00004* 
(-1.54) (-1.67) 
.051 .059 
(.87) (.99) 
-.013 .000001 
(-.31) (-.00)( 
.039 .038 
(.69) (.65) 
1.601 
(.91) 
L19317 18.289555 
(4.52) (4.47) 
-65.588 -22.427 
Bud» .164 
.800 .638 
ake. .504 
81 E 


Poverty Reduction 
With U.S. Dropping 
Dummy U.S. 

eS 29195523 
(3.61) (3.76) 

23/63 50091 
(2.76) (2.90) 

56777" 662*7* 
(3.59) (3.92) 

.0002 .0002* 
(1.65) (1.74) 

.128 .094 

(.53) (.38) 

39 .101 

.82) (.58) 

.114 LOU 

(.50) (.55) 
-11.649 
(-1.59) 
18.446 17.494 
(1.11) (1.04) 
-70.828 -30.498 

.263 .292 

.819 .710 

.738 .564 
79 73 


Note: For each independent variable, the unstandardized coefficient and t-score in parentheses are 


displayed. 
oe.) ** p«.05 


*p«.10 
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comparative historical scope of inquiry. Fifth, the analysis uses more valid and 
reliable measures of poverty. I now review the study's evidence on each. 

First, this study does not support liberal economics. Economic performance, 
measured by economic growth, unemployment, and productivity, does not have 
larger effects than the welfare state. In fact, only economic growth significantly 
influences poverty. However, its standardized coefficient is far smaller than the 
significant features of the welfare state. Productivity and unemployment do not 
have significant effects. Even if these variables were significant, their standardized 
coefficients would be far smaller than the key features of the welfare state. Simply 
put, this study challenges liberal economic claims that economic performance is 
more important than the welfare state to reducing poverty. 

This study provides no evidence that welfare state generosity counterproductively 
increases poverty. None of the welfare state measures significant increases SM 
poverty or significantly undermines poverty reduction. So, the welfare state does 
not directly increase poverty through dependency, deeper poverty, or longer 
poverty spells (Sanders 1990). The results also challenge claims of any indirect 
poverty-increasing effects of the welfare state. Even if welfare generosity increases 
unemployment, single parenthood, and labor force nonparticipation, none of 
these three has a significant effect on SM poverty or poverty reduction. Again, if 
these three were significant, their standardized coefficients would be far smaller 
than the key features of the welfare state. Since research on welfare disincentives 
has found them to have very small effects, it would be necessary for these three to 
have large effects for the welfare state to counterproductively increase poverty. 

It is important to recognize that the evidence for welfare disincentives remains 
highly debatable. Most studies have been done solely on the U.S. Studies of other 
industrialized democracies conclude that welfare generosity does not impede 
economic performance (Blank 1995). In my data, there is little prima facie 
evidence that welfare generosity undermines economic performance or increases 
labor force nonparticipation and single parenthood.” These findings, along 
with the growing body of literature that is skeptical of the liberal economics of 
poverty (Brady 2003b), raise questions about its prominent position in debates 
about poverty. 

Second, two features of the welfare state are most influential in explaining 
poverty. As a general measure, social security transfers are more salient than 
social wages. While the effects of social wages are significant for SM poverty and 
insignificant for poverty reduction, the traditional social security transfers have 
consistently significant effects and the second largest effects in the model. Social 
wages focus only on the working population, while transfers are essential to the 
elderly, children, and those not in the labor force. Since these groups are much 
more vulnerable to poverty, generous transfers are essential for integrating them 
into society (Brady 2004). Though social wages reduce inequality by stabilizing 
workers' incomes through maternity leave and unemployment compensation, 
transfers integrate those that otherwise would be socially excluded. This study 
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suggests that social security transfers have greater predictive validity for poverty. 
This is surprising since so much has been written about the purported theoretical 
superiority of social wages and decommodification compared to traditional 
measures like social security transfers (Esping-Andersen 1990; Korpi and Palme 
1998). 

Public health spending has the largest effect of the welfare state features, and 
in fact, of any variable. Though more research is needed to examine why it has 
such large effects, I can offer three explanations. First, health spending may have 
the largest effects because its denominator (total health spending) is far more 
precise than the denominator for social security transfers and military spending 
(GDP). In turn, health spending is measured with much less measurement error 
than these two. Still, though, health spending is comparatively more effective 
than social wages and public employment, since their denominators are equally 
precise (total employment and average wages). 

Second, health spending captures the redistributiveness of the welfare state, a 
concept for which there are few good measures. Because state-sponsored health 
care is expensive, it requires larger government budgets and higher taxes on 
households above the median. The higher taxes and large government budgets 
needed to finance these services redistribute income and lower poverty after taxes 
and transfers (Blank 1997; Huber and Stephens 2001; Korpi and Palme 1998).”° 
By providing services to low-income households, public health care effectively 
redistributes resources across the entire income distribution and in turn reduces 
poverty. While transfers are contingent on previous employment income, health 
services are granted universally as a citizenship right in most welfare states. As 
Huber and Stephens (2000:324) argue, “The redistributive effect of the free or 
subsidized provision of public services and goods should differ from, and be 
greater than, the redistributive effect of transfer payments.” 

Third, this measure quantitatively tracks fundamental differences between 
encompassing and minimalist welfare states. Almost all welfare states publicly 
cover over 75% of total health costs. But, the minimalist, high poverty U.S. 
provides less than 50%. This measure may capture the substantive difference 
between liberal, conservative and socialist regimes. Health spending might 
be a quantitative means to understand the qualitative differences between 
encompassing and minimalist welfare states (Esping-Andersen 1990; Korpi and 
Palme 1998). Importantly, however, the effect of health spending does not merely 
mask differences between welfare state regimes or between the US and other 
nations. If that was the case, health spending would not have significant effects 
in Tables 3, 4, and 5. This variable measures unique variation in welfare state 
generosity and does not simply proxy qualitative differences between countries. 
While these three explanations are plausible, more research is obviously needed 
to explore the specific causal processes. 

In contrast to the two key features, public employment and military spending 
do not significantly affect poverty. After controlling for welfare state generosity 
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generally (with the other features), the analysis provides no evidence that 
countries could expect less poverty to result from increased public employment 
or military spending. The non-significance of public employment challenges 
arguments that it is the most important feature of the welfare state (Huber 
and Stephens 2001). Though not significant, public employment has positive 
t-scores near 1.0 for SM poverty. Interestingly, Tomaskovic-Devey (1991) found 
that public employment was positively associated with official poverty in U.S. 
counties. Tomaskovic-Devey's and this study's findings are inconsistent with the 
view that public employment is an effective antipoverty strategy. In turn, more 
research is needed to scrutinize how public employment affects poverty. 

Third, there is no evidence that welfare state regimes have independent effects 
on poverty or that social policies have different effects across regimes. While 
regime differences in poverty initially appear, all of the regime differences can 
be explained by social security transfers and public health spending.?? Since the 
regime differences attenuate to non-significance after including the welfare state 
features, the welfare state features explain variation in poverty, not historically 
institutionalized differences across welfare state regimes (contrast with fn. 3). 
Also, since the interactions of the welfare state features and regimes are mostly 
insignificant, there is little evidence that social policies operate differently 
across regimes (contrast with fn. 6 and 7). Only one of the interaction effects 
is significant, and that was the unexpected finding that social security transfers 
may be less effective in socialist regimes. Moreover, the simpler models in Table 
2 are consistently preferred over the more complicated models in Tables 3 and 4. 
Ultimately, this study is skeptical of the value of Esping-Anderson's typology for 
understanding poverty. Given the influence of Esping-Anderson and the tradition 
of typologies, these non-findings are quite surprising. 

Fourth, this study expands the comparative and historical scope of inquiry. 
On one level, this study expands beyond the samples of other recent sociological 
analyses. Given the robustness of earlier findings to this larger sample, this study 
provides even stronger evidence that welfare state generosity has negative effects 
on poverty. On a second level, this study provides the first scrutiny of the 1990s 
period. For both SM poverty and poverty reduction, the 1990s period did not have 
significantly different patterns. Also, the two key features of the welfare state did 
not have different effects in the 1990s. Again, the models with the 1990s period 
and related interactions are less preferred than the simpler models in Table 2. 
These non-findings challenge the view that the 1990s ushered in a different period 
for welfare states, and particularly social democracies. In the 1990s, generous 
welfare states continued to reduce poverty effectively. 

Fifth, this study extends beyond recent research by using more valid and. 
reliable measures of poverty. I use a relative measure of poverty, which is an 
advance beyond the problematic official U.S. measure. Also, this study contributes 
to recent research in three ways. First, I incorporate the depth of poverty. The 
significance and size of the effects of the welfare state and other independent 
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variables were similar across HI and H poverty. Second, I concentrate on poverty 
after taxes and transfers (SM). Third, I supplement the analysis by also examining 
poverty reduction. Unlike similar recent research, this study includes the entire 
population. Since the elderly and children are much more vulnerable to being 
poor than working-age adults, it is essential to incorporate the entire population 
to fully observe the welfare state's effects on poverty (Brady 2004). By advancing 
poverty measurement in these ways, this study provides novel evidence on the 
relationship between the welfare state and poverty. 

In sum, the welfare state is a generalizable, stable, and essential poverty- 
reduction mechanism. The many vocal critics of the welfare state are premature, 
and probably mistaken, in claiming that the welfare state is no longer effective 
at reducing poverty. Regardless of the era or welfare state regime, social security 
transfers and health spending are robust as the main predictors of poverty. 
Sociological research on poverty should recognize the central role of the welfare 
state in explaining poverty in rich Western democracies. The welfare state should 
feature prominently in theories of poverty. 


Notes 


1. As one of the most visible welfare state advocates (and occasionally one of the more 
persuasive critics of liberal economics), Blank’s ambiguities should be noted. Three years 
prior, she argued, “Economic growth is not likely to be effective in the near future in 
reducing poverty” (1997:221); “Poverty is harder to address through broad-based economic 
growth policies now than thirty years ago” (222); and, “Changes in the jobs available to less- 
skilled workers have made those jobs less effective in helping people escape poverty” (222). 
Moreover, she documents welfare’s clear success (Chapter 4): “All of this evidence supports 
the view that cash transfers generally improve the economic well-being of their recipients” 
(1997:138); and “Many of our antipoverty efforts have accomplished exactly what they 
set out to accomplish” (1997:291). At the same time, Blank is skeptical that the U.S. will 
ever have a more generous welfare state. As a result, she proposes a three-tiered system of 
narrow, means-tested, targeted programs that almost exclusively rely on the market. Blank 
(1997:291) “propose[s] a reconfigured system of public assistance that moves us away from 
large-scale cash support and toward a more work-focused system.” While in 1994, she (204) 
noted, “At best, employment programs can serve as one piece of a larger overall strategy to 
fight poverty,’ her policy recommendations then (1994:199) and in what was labeled “A New 
Agenda For Fighting Poverty” in 1997, concentrate on a strong macroeconomy, improving the 
human capital of children, job training, work incentives for welfare beneficiaries, tax relief, 
employment services, and targeting youth in high-unemployment neighborhoods. In this 
research (except Hanratty and Blank 1992), Blank used the absolute official U.S. measure of 
poverty. However, Blank also was a member of the NRC Panel that proposed abandoning the 
official measure and replacing it with a relative measure (Citro and Michael 1995). 


2. Welfare generosity has only been linked to very small effects on single parenthood, 
unemployment, and labor force nonparticipation (Lichter et al. 1997). Hence, it will take large 
effects of these variables to result in large indirect positive effects on poverty. For example, 
since welfare generosity has very small effects on single parenthood, single parenthood needs 
to have large effects on poverty for welfare generosity to be counterproductive. 
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3. Esping-Andersen writes specifically, *The existence of a social program and the amount 
of money spent on it may be less important than what it does" (1990:2), *Expenditures 
are epiphenomenal to the theoretical substance of welfare states” (1990:19), and, “Welfare 
states may be equally large or comprehensive, but with entirely different effects on social 
structure” (1990:58). 


4. For example, Kenworthy (1999) does not have enough information to compare different 
features in one model with only a cross-section of 15 countries at one time. Instead, he 
analyzes social spending, social wages, and decommodification in separate models. We do not 
know if each of these reduces poverty since they are correlated. When he finds a significant 
effect of social wages, this could be because it proxies general levels of welfare generosity. By 
including all five in the same model, one can assess the effect of social wages while controlling 
for the other features and, in turn, general levels of welfare generosity. 


5. The literature has experienced a proliferation of institutional typologies, including Korpi 
and Palme’s (1998) targeting versus universalism; Huber and Stephens’s (2001) social 
democratic, Christian democratic, and liberal; and classic distinctions between residual and 
institutionalized welfare states and civil, political, and social rights. Across typologies, there 
is actually a great deal in common with Esping-Andersen’s schema. 


6. Esping-Andersen (1990: 76-77) argues, “In other words, if we are willing to accept that 
welfare states play an important role in the patterning of social stratification . . .we find 
that it is misleading to compare welfare states as merely ‘more’ or ‘less’ egalitarian. We 
discover, instead, entirely different logics of social stratification embedded in welfare-state 
construction." Later, he (1999:16—17) argues, "When also the welfare state is residual and 
social protection highly privatized . . . market inequalities are unlikely to be affected much 
by social redistribution... .Clearly, where also the welfare state is-strongly universalistic, 
the distribution of resources and life chances should be additionally egalitarian, creating 
homogeneity not only within the working class, but also between the social classes." Finally, 
he (1999:32) argues "The Scandinavian social democrats have now for decades defined 
their target as the equalization of social resources, a multidimensional across-the-board 
programmatic effort to level social capital; in contrast, the liberal Anglo-Saxon approach is 
selective, singling out disadvantaged groups for ‘sponsored mobility.. . . Egalitarianism is a 
derivative consequence of what is and always was the foremost objective behind social policy, 
namely, insuring the population against social risks. How, to what degree, and which kinds 
of risks are pooled collectively will certainly have immediate — but none the less derivative 
— consequences for poverty, income distribution, economic opportunities and, more generally, 
for social solidarities and stratification outcomes." 


7. Esping-Andersen (1990:2) explains, "the concept of the welfare state is too narrowly 
associated with the conventional social-amelioration policies . . .contemporary advanced 
nations cluster not only in terms of how their traditional social-welfare policies are 
constructed, but also in terms of how these influence employment and general social 
structure. To talk of “a regime' is to denote the fact that in the relation between state and 
economy a complex of legal and organizational features are systematically interwoven.” See 
also the quotes in fn. 3. 


8. Smeeding et al. (2001:162) explain, “The United States has a long tradition of measuring 
income poverty and weighing the effectiveness of government policies aimed at poverty 
reduction. Although this analysis has been valuable to policymakers, it rests on an inherently 
parochial foundation, for it is based on the experiences of only one nation.” 


9. Blank (1997:139) explains, “Understanding that cash assistance programs may make 
families less poor without necessarily removing them from poverty highlights a problem with 
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how we evaluate these programs. Expansions in cash assistance over time could make many 
poor families substantially better off, but as long as this does not actually move them above 
the poverty line, this improvement will not be reflected in reduced poverty rates. Similarly, 
reductions in cash assistance will have no effect on poverty if they make already-poor families 
even poorer. Thus, changes in poverty rates may seriously understate the impact of changes 
in cash transfers on the well-being of poor families." 


10. Smeeding et al. (2001:165) write, “The best current definition is disposable cash and 
noncash income (that is, money income minus direct income and payroll taxes, and including 
all cash and near-cash transfers, such as food stamps and cash housing allowances, and 
refundable tax credits, such as the Earned Income Tax Credit).” 


11. Wright (2003:3-4) explains, “The state plays a pivotal role in establishing the very 
possibility of markets through the coercive enforcement of property rights that directly 
impact on the nature of market-generated distributions . . . In all sorts of ways the state is 
involved in regulating aspects of market exchanges and production — from health and safety 
rules, to credentialing requirements in many labor markets, to labor laws — that impact on 
the income distribution process. It is therefore misleading to talk about a clear distinction 
between pure ‘distribution’ of income and a process of politically shaped ‘redistribution.” 
Since the state is always involved in the market, it may be problematic to claim to measure 
income before state mediation (i.e., pretax and pretransfers). 


12. Potentially, poverty reduction might examine if the welfare state is efficient as well as 
effective. Imagine two nations A and B. Imagine that the headcount rate of SM poverty is 
20% in A and 15% in B, and that social security transfers are 5% of GDP in A and 15% of 
GDP in B. This would result in a negative association between social security transfers and 
SM poverty, and we could infer that transfers are effective. However, do transfers efficiently 
reduce SM poverty? B spends three times as much as A on transfers, yet has only 25% less 
poverty. Further, imagine the MG headcount is 30% in A and 17% in B, so poverty reduction 
would be 66.6% in A and only 13.3% in B. In this scenario, transfers would not be positively 
associated with poverty reduction. Thus, social security transfers would not be an efficient 
poverty reduction mechanism and the results for SM poverty alone would not reveal this 
valuable information. 


13. Fixed effects (FE) models allow the independent variables to explain the historical 
variation within nations while removing the variation between nations. FE models perform 
OLS after including nation-specific constant terms and subtracting all variables from their 
nation-specific means (Alderson and Nielsen 1999; Hsiao 2003). Between-effects (BE) 
models allow the independent variables to explain the between nation variation while 
removing the variation within nations. The RE model is the matrix weighted average of the 
within-nations (FE) and between-nations (BE) estimators (Greene 1990:488; Hsiao 2003). 
RE models include a country-specific error term in addition to the general error term and 
subtract a smaller portion of the nation-specific means (Alderson and Nielsen 1999). Last, 
cross-national differences in poverty and the welfare state are not constant over time, but 
relative stability exists in the cross-national ranking of nations for these variables — hence, 
FE models effectively mask this crucial variation (see Beck and Katz 2001:492). As Beck and 
Katz (2001:487) explain, “Fixed effects are problematic in the presence of [the] temporally 
stable regressors.” Further, understanding historical trends in poverty is essential as well, and 
unfortunately, BE models would mask this essential within-nation variation. 


14. Recently, methodologists have demonstrated that the Bayesian Information Criterion 
(BIC’) can be used to select between techniques (Beck and Katz 2001; Teachman et al. 2001). 
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BIC’ very strongly prefers RE over FE models. Hausman's (1978) chi-square test accepts RE 
and does not require FE models. 


15. FE models consume a degree of freedom for every N. In this analysis, with 78 cases and 
18 N's (average of 4.33 T's), FE models produce inefficient estimates (Beck and Katz 2001; 
Greene 1990; Hsiao 2003;). Nickell (1981) also shows that FE models may produce biased 
estimates when N far exceeds T. Population average (PA) models are problematic in small 
samples since they are a maximum likelihood estimator (MLE), which is designed for much 
larger samples. Another alternative is to use techniques with heteroskedasticity-consistent 
standard errors, for example, OLS or RE with robust clustered errors (Moller et al. 2003). 
Importantly, however, Long and Ervin (2000) show that the popular Huber-White-Sandwich 
(HCO) estimator produces incorrect inferences in samples with less than 250 cases. The 
alternative HC3, which works well even in samples as small as 25, does not allow for the 
clustering of errors within countries — the principal reason for using HCO. Finally, Beck 
(2001; and Beck and Katz 1995) emphasizes that OLS with panel-corrected standard errors 
should not be used when there are less than 10 or 15 Ts. Importantly, Beck (2001) draws a 
sharp distinction between time-series-cross-section data with more Ts than Ns and panel 
data with more Ns than Ts. Beck (274) explains, “Panel methods [e.g., RE] are designed for 


and work well with very small Ts (three, or perhaps even two).” 


16. Raftery (1995) shows that a BIC’ difference of 0-2 offers weak evidence for model 
selection, 2-6 offers positive evidence, 6-10 offers strong evidence, and greater than 10 offers 
very strong evidence. Of course, I do not solely rely on BIC’ but consider it along with the 
significance and size of coefficients (e.g., Manza and Brooks 1999). 


17. My updated data include observations for Australia (4), Austria (2), Belgium (4), Canada 
(7), Denmark (4), Finland (3), France (5), Germany (7), Ireland (4); Italy (3), Luxembourg 
(3), Netherlands (4), Norway (4), Spain (2), Sweden (6), Switzerland (2), U.K. (7), U.S. (7). 
For the Austria cases, data on income before taxes and transfers (MG) was unavailable. For 
the U.S. in 1969, data on income after taxes and transfers (SM) was unavailable. Therefore, 
my sample is 81 for SM poverty and 79 for poverty reduction. This list represents all cases 
of rich Western democracies available in the LIS (up to 1997) available as of August 2004. 
Unfortunately, some variables are not available after 1997. 


18. Though HI has theoretical and methodological advantages over the traditional H, the 
two are highly correlated (regardless of whether one uses my or the official LIS estimates 
of H) (Brady 2003a). 


19. SM poverty is calculated from the LIS variable DPI. Market generated (MG) poverty 
(before taxes and transfers) is calculated from all sources of income before government taxes 
and transfers (Brady 2003a). Poverty reduction is calculated as PR = (MG-SM/MG)*100. 
Poverty reduction as a dependent variable produces similar results to having SM poverty as 
the dependent variable with MG poverty included as a control. Models of poverty reduction, 
however, are more parsimonious. 


20. Data for many of the independent variables were proximately from Huber et al. 
(2004). Also, I added Luxembourg and Spain to the dataset. Solely because sufficient 
data are unavailable, I do not analyze tax progressivity as a feature of the welfare 
state. In another piece, I examined current government receipts as a % of GDP as a 
measure of state revenue and found results consistent with this study (Brady 2003b). 


21. For Luxembourg, the social wage measures were unavailable, so I constructed estimates 
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on my own. Dropping those cases does not change the results. 


22. For Spain and Luxembourg, Cusack's measure was unavailable. So I substituted data on 
general government services as percent of civilian employment from the OECD (20002). The 
two measures are highly correlated. Dropping those cases does not change the results. 


23. In previous work, I justify the use of productivity (Brady 2003b). I acknowledge 
productivity is not a perfect proxy for human capital. Unfortunately, valid and reliable 
pooled time series data on educational attainment are simply not available. Still, however, 
productivity effectively controls for the level of economic development. This is typically done 
with GDP per capita, while my measure of productivity is GDP per employee. The two are 
highly correlated, and productivity is slightly more correlated with poverty. I have shown that 
the results are robust while only including productivity or economic growth in the model 
(Brady 2003b). Also, I substituted GDP per capita for productivity, and the conclusions are 
identical (with or without economic growth in the model, available on request). 


24. In analyses available upon request, I decomposed this variable and analyzed the percentage 
of the population over 65, the percentage of the population children, and total and female 
labor participation. Including these variables instead of this summary measure does not 
alter any of the conclusions. Solely to preserve degrees of freedom, I rely on this more 
parsimonious summary measure. 


25. This variable is based on the LIS data and is calculated by the LIS administrators (see 
www.lisproject.org). 


26. The coefficients of variation are MG H = .134, MG HI = .169, SM H = .353, and SM 
Hii=—365) 


27. Tests for collinearity among the welfare state features reveal that it is not a serious 
problem. The correlations with social security transfers are .45 with social wages, .28 with 
public health spending, .06 public employment, and -.24 with military spending. Social 
wages correlates with health spending .32, with public employment .21, and with military 
spending -.29. Health spending correlates with public employment .37 and with military 
spending -.34. Public employment and military spending correlate -.04. In analyses available 
upon request, I estimated these models with one feature of the welfare state at a time, and 
the conclusions are consistent. 


28. Economic growth correlates with social security transfers (.07), social wages (.24), 
health spending (.01), public employment (-.17), and military spending (-.26). Productivity 
correlates with social security transfers (-.28), social wages (-.29), health spending (-.26), 
public employment (-.10), and military spending (.10). Unemployment correlates with 
social security transfers (.11), social wages (.10), health spending (-.15), public employment 
(-.21) and military spending (-.14). The percent of the population not in the labor market 
correlates with social security transfers (.08), social wages (-.09), health spending (.001), 
public employment (-.59), and military spending (-.02). The percent of children in single 
mother families correlates with social security transfers (-.09), social wages (-.08), health 
spending (-.42), public employment (.31), and military spending (.27). 


29. Some evidence of this can be shown by including government receipts as a % of GDP 
in the models of Table 2. Receipts is significant if health spending is not in the model. 
However, if health spending is in the model, receipts attenuates to non-significance, while 
health spending is significant. Thus, health spending may reflect the larger government 
budgets and higher taxation needed to finance expensive welfare state services. Unlike 
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simply measuring the size of the government budget however, health spending more closely 
measures redistributiveness. 


30. Unlike the public health spending measure, the poverty measures do not incorporate in- 
kind benefits. Hence, it is possible that the public health spending * welfare regime interactions 
may be less appropriate for testing Esping-Andersen. However, this concern does not apply 
to social security transfers. i 


References 


Alderson, Arthur S,. and Francois Nielsen. 2002. “Globalization and the Great U-Turn? 
American Journal of Sociology 107:1244—99. 





. 1999. “Income Inequality, Development and Dualism: A Reconsideration.” American 
Sociological Review 64:606—31. 


Alesina, Alberto, and Robert Perotti. 1997. “The Welfare State and Competitiveness.” American 
Economic Review 87:921—39. 


Atkinson, Anthony B. 1987. “On the Measurement of Poverty.” Econometrica 55:749-64. 
Bane, Mary Jo, and David T. Ellwood. 1994. Welfare Realities. Harvard University Press. 
Banfield, Edward C. 1970. The Unheavenly City. Little, Brown and Company. 


Beck, Nathaniel. 2001. “Time Series-Cross Section Data: What Have We Learned in the Past 
Few Years.” Annual Review of Political Science 4:271—93. 


Beck, Nathaniel, and Jonathan N. Katz. 2001. “Throwing Out the Baby with the Bath Water.” 
International Organization 55:487—95. 


. 1996. “Nuisance Over Substance.” Political Analysis 6:1-36. 





Bergmann, Barbara R. 1997. Saving Our Children From Poverty. Russell Sage Foundation. 


Bernstein, Nina. 1999. “Giuliani Proclaims Success on Pledge to Curb Welfare.” The New York 
Times (12 December):A1, A22. 


Betson, David M., and Jennifer L. Warlick. 1998. “Alternative Historical Trends in Poverty.” 
American Economic Review 88:348—51. 


Bhargava, Alok, and J.D. Sargan. 1983. "Estimating Dynamic Random Effects Models from 
Panel Data Covering Short Time Periods." Econometrica 51:1635—60. 


Blank, Rebecca M. 2000. “Fighting Poverty: Lessons From Recent U.S. History.” Journal of 
Economic Perspectives 14:3-19. 





. 1997. It Takes a Nation. Princeton University Press. 


. (ed.). 1995. Social Protection Versus Economic Flexibility: Is There a Tradeoff? University 
of Chicago Press. 





. 1994. “The Employment Strategy: Public Policies to Increase Work and Earnings.” 
Pp. 168-204 in Confronting Poverty, edited by Sheldon H. Danziger, Gary D. Sandefur 
and Daniel H. Weinberg. Harvard University Press. 


Brady, David. 2004. “Reconsidering the Divergence Between Elderly, Child, and Overall 
Poverty.” Research on Aging 26(5):487—510. 





1360 / Social Forces 83:4, June 2005 





. 20032. *Rethinking the Sociological Measurement of Poverty." Social Forces 
81:715—52. 


. 2003b. *The Poverty of Liberal Economics.” Socio-Economic Review 1(3):369—409. 


. 2003c. *The Politics of Poverty: Left Political Institutions, the Welfare State and 
Poverty.” Social Forces 82:557—88. 


Burtless, Gary. 1994. “Public Spending on the Poor: Historical Trends and Economic Limits.” 
Pp. 51-84 in Confronting Poverty, edited by Sheldon H. Danziger, Gary D. Sandefur, and 
Daniel H. Weinberg. Russell Sage Foundation. 








Burtless, Gary, and Timothy M. Smeeding. 2001. “The Level, Trend, and Composition of 
Poverty.” Pp. 27-68 in Understanding Poverty, edited by Sheldon Danziger and Robert 
Haveman. Russell Sage Foundation and Harvard University Press. 


Cantillion, Bea. 1997. “The Challenge of Poverty and Exclusion.” Pp. 115-166 in Social 
Policy Studies No. 21: “Family, Market, and Community: Equity and Efficiency in Social 
Policy? OECD. 


Casper, Lynne M., Sara S. McLanahan and Irwin Garfinkel. 1994. *The Gender-Poverty 
Gap: What We Can Learn From Other Countries.” American Sociological Review 
59:594—605. 


Christopher, Karen, Paula England, Timothy M. Smeeding, and Katherine Ross. 2002. "The 
Gender Gap in Poverty in Modern Nations: Single Motherhood, the Market, and the 
State.” Sociological Perspectives 45:219-242. 


Citro, Constance, and Robert T. Michael. 1995. Measuring Poverty: A New Approach. National 
Academy Press. 


Clayton, Richard, and Jonus Pontusson. 1998. “Welfare State Retrenchment Revisited: 
Entitlement Cuts, Public Sector Restructuring, and Inegalitarian Trends in Advanced 
Capitalist Societies.” World Politics 51:67—98. 


Conley, Dalton, and Kristen W. Springer. 2001. “Welfare State and Infant Mortality.” American 
Journal of Sociology 107:768—807. 


Currie, Janet, and Aaron Yelowitz. 2000. “Health Insurance and Less Skilled Workers.” Pp. 
233-61 in Finding Jobs, edited by David E. Card and Rebecca M. Blank. Russell Sage 
Foundation. 


Cusack, Thomas R. 2004. Data on public employment and wages for 21 OECD countries, 
Science Center Berlin, May. 


Danziger, Sheldon D., Robert H. Haveman, and Robert Plotnik. 1981. “How Income 
Transfers Affect Work, Savings and Income Distribution.” Journal of Economic Literature 


19:975-1028. 


Danziger, Sheldon H,. and Daniel H. Weinberg. 1994. "The Historical Record: Trends in Family 
Income, Inequality, and Poverty.” Pp. 18-50 in Confronting Poverty, edited by Sheldon 
H. Danziger, Gary D. Sandefur, and Daniel H. Weinberg. Russell Sage Foundation. 


Darity, William A. Jr, and Samuel L. Myers, Jr. 1987. “Do Transfer Payments Keep the Poor 
in Poverty?" American Economic Review 77:216-22. 


DeFina, Robert H., and Kishor Thanawala. 2001. “The Impact of Taxes and Transfers on 
Alternative Poverty Indexes.” Review of Social Economy 59:395-416. 


Welfare State and Poverty / 1361 


Ellwood, David T., and Lawrence Summers. 1986. “Poverty in America: Is Welfare the Answer 
or the Problem.” Pp. 78-105 in Fighting Poverty, edited by Sheldon Danziger and Daniel 
Weinberg. Harvard University Press. 

Ellwood, David T., and Elisabeth D. Welty. 2000. “Public Service Employment and 
Mandatory Work: A Policy Whose Time Has Come and Gone and Come Again?” Pp. 
299-372 in Finding Jobs, edited by David E. Card and Rebecca M. Blank. Russell Sage 
Foundation. 


Esping-Andersen, Gosta. 2003. “Why No Socialism Anywhere? A Reply to Alex Hicks and 
Lane Kenworthy.” Socio-Economic Review 1:63—70. 





. 1999. Social Foundations of Postindustrial Economies. Oxford University Press. 





. 1996. Welfare States in Transition. Sage. 





. 1990. The Three Worlds of Welfare Capitalism. Princeton University Press. 


Foster, James E. 1998. "What is Poverty and Who are the Poor? Redefinition for the United 
States in the 1990s: Absolute Versus Relative Poverty? American Economic Review 
88:335-41. 


Freeman, Richard B. 2001. “The Rising Tide Lifts..." Pp. 97-126 in Understanding Poverty, 
edited by Sheldon Danziger and Robert Haveman. Russell Sage Foundation and Harvard 
University Press. 





. 1999. The New Inequality. Beacon Press. 





. 1995. "The Large Welfare State as a System." American Economic Review 85:16—21. 


Freeman, Richard B., Robert Topel, and Birgitta Swedenborg. 1997. The Welfare State in 
Transition: Reforming the Swedish Model. University of Chicago Press. 


Gans, Herbert. 1995. The War Against the Poor. Basic Books. 

Gilbert, Neil. 2002. Transformation of the Welfare State. Oxford University Press. 
Gilder, George. 1981. Wealth and Poverty. Basic Books. 

Glazer, Nathan. 1988. The Limits of Social Policy. Harvard University Press. 

Gordon, David M. 1972. Theories of Poverty and Underemployment. Lexington Books. 


Gornick, Janet C., and Jerry A. Jacobs. 1998. "Gender, the Welfare State, and Public 
Employment: A Comparative Study of Seven Industrialized Countries." American 
Sociological Review 63:688—710. 


Greene, William H. 1990. Econometric Analysis. Macmillan. 


Griffin, Larry J., Joel A. Devine, and Michael Wallace. 1982. "Monopoly Capital, Organized 
Labor, and Military Expenditures in the United States, 1949-1976.” American Journal 
of Sociology 88 (Supplement):$113-8153. 


Griffin, Larry J., Michael Wallace, and Joel Devine. 1982. "The Political Economy of Military 
Spending: Evidence from the United States.” Cambridge Journal of Economics 6: 1-14. 


Gustafsson, Bjorn, and Mats Johansson. 1999. “In Search of Smoking Guns: What Makes 
Income Inequality Vary Over Time in Different Countries?” American Sociological 
Review 64: 585—605. 


Hagenaars, Aldi J.M. 1991. “The Definition and Measurement of Poverty." Pp. 134—156 in 
Economic Inequality and Poverty: International Perspectives. M.E. Sharpe. 


1362 / Social Forces 83:4, June 2005 


Hall, Peter A., and David Soskice. 2001. Varieties of Capitalism. Oxford University Press. 


Hanratty, Maria J., and Rebecca M. Blank. 1992. *Down and Out in North America: Recent 
Trends in Poverty Rates in the United States and Canada.” The Quarterly Journal of 
Economics 107:233—54. 

Hausman, J.A. 1978. “Specification Tests in Econometrics.” Econometrica 46:1251—71. 

Haveman, Robert, and Jonathan Schwabish. 1999. *Economic Growth and Poverty: A Return 
to Normalcy?" Focus 20:1—7. 

Hernandez, Raymond. 1999. *More Who Leave Welfare Rolls in New York Are Found to Get 
Jobs? New York Times (5 August):A19. 

Hicks, Alexander, and Lane Kenworthy. 2003. “Varieties of Welfare Capitalism.” Socio-Economic 
Review 1:27—61. 

Hooks, Gregory. 1991. Forging the Military Industrial Complex. University of Illinois Press. 

Hout, Michael. 1984. *Occupational Mobility of Black Men: 1962 to 19732 American 
Sociological Review 49:308—22. 

Hsiao, Cheng. 2003. Analysis of Panel Data, 2nd Ed. Cambridge University Press. 

Huber, Evelyne, John D. Stephens, Charles Ragin, David Brady, and Jason Beckfield. 2004. 


Comparative Welfare States Data Set, University of North Carolina, Northwestern 
University, Duke University, and Indiana University. 


Huber, Evelyne, and John D. Stephens. 2001. Development and Crisis of the Welfare State. 
University of Chicago Press. 





. 2000. *Partisan Governance, Women's Employment, and the Social Democratic 
Service State" American Sociological Review 65:323—42. 


Jantti, Marcus, and Sheldon Danziger. 2000. "Income Poverty in Advanced Countries.” Pp. 
309-78 in Handbook of Income Distribution, edited by Anthony B.Atkinson and Francois 
Bourguignon. Elsevier. 


Katz, Michael B. 1989. The Undeserving Poor. Pantheon. 


Kenworthy, Lane. 2002. “Corporatism and Unemployment in the 1980s and 1990s.” American 
Sociological Review 67:367—88. 


. 1999. *Do Social Welfare Policies Reduce Poverty? A Cross-National Assessment." 
Social Forces 77:1119—40. 





Korpi, Walter, and Joakim Palme. 1998. "The Paradox of Redistribution and Strategies of 
Equality: Welfare State Institutions, Inequality, and Poverty in the Western Countries." 
American Sociological Review 63:661—87. 


Krugman, Paul. 1994. The Age of Diminished Expectations. The MIT Press. 


Leisering, Lutz, and Stephan Leibfried. 1999. Time and Poverty in Western Welfare States. 
Cambridge University Press. 


Lichter, Daniel T. 1997. *Poverty and Inequality Among Children." Annual Review of Sociology 
23:121-45. 


Lichter, Daniel T., Diane K. McLaughlin, and David C. Ribar. 1997. Welfare and the Rise of 
Female-Headed Families" American Journal of Sociology 103:112—43. 


Lindbeck, Assar. 1998. “The Welfare State and the Employment Problem.” American Economic 
Review 84:71—75. 


Welfare State and Poverty / 1363 





. 1995. "Hazardous Welfare-State Dynamics.” American Economic Review 85:9-15. 


Long, J. Scott, and Laurie H. Ervin. 2000. “Using Heteroscedastic Consistent Standard Errors 
in the Linear Regression Model.” The American Statistician 54:217-24. 


Manza, Jeff, and Clem Brooks. 1999. Social Cleavages and Political Change. Oxford University 
Press. 


Mare, Robert, and Christopher Winship. 1984. “The Paradox of Lessening Racial Inequality 
and Joblessness Among Black Youth: Enrollment, Enlistment, and Employment, 1964— 
1981.” American Sociological Review 49:39-55. 


Mayer, Susan E. 1997. What Money Can’t Buy Cambridge, MA.: Harvard University Press. 


McFate, Katherine, Roger Lawson, and William Julius Wilson. 1995. Poverty, Inequality and 
the Future of Social Policy. Russell Sage Foundation. 


Mead, Lawrence M. 1986. Beyond Entitlement. Free Press. 


Mintz, Alex, and Alexander Hicks. 1984. “Military Keynesianism in the United States, 
1949-1976: Disaggregating Military Expenditures and Their Determination.” American 
Journal of Sociology 90:411—17. 


Moller, Stephanie, David Bradley, Evelyne Huber, Francois Nielsen, and John D. Stephens. 
2003. “Determinants of Relative Poverty in Advanced Capitalist Democracies.” American 
Sociological Review 68:22—51. 


Moffitt, Robert. 2000. “Welfare Benefits and Female Headship in U.S. Time Series.” American 
Economic Review 90:373—77. 


Mullahy, John, and Barbara L. Wolfe. 2001. “Health Policies for the Non-Elderly Poor.” Pp. 
278—313 in Understanding Poverty, edited by Sheldon Danziger and Robert Haveman. 
Russell Sage Foundation and Harvard University Press. 


Murray, Charles. 1984. Losing Ground. Basic Books. 
Newman, Katherine S. 1999. No Shame in My Game. Russell Sage Foundation and Knopf. 


Nickell, Stephen. 1981. “Biases in Dynamic Models with Fixed Effects.” Econometrica 
49:1417-26. 


Nolan, Brian, Richard Hauser, and Jean-Paul Zoyem. 2000. “The Changing Effects of 
Social Protection on Poverty." Pp. 87-106 in Welfare Regimes and the Experience of 
Unemployment in Europe, edited by Duncan Gallie and Serge Paugam. Oxford University 
Press. 


O'Connor, Alice. 2001. Poverty Knowledge. Princeton University Press. 

OECD. 2000a. OECD Labor Force Statistics. OECD. 

. 2000b. OECD Laborforce Statistics. OECD. 

. 2000c. OECD Main Economic Indicators. OECD. 

. 1998. OECD Health Data CD-Rom. OECD. 

Okun, Arthur M. 1975. Equality and Efficiency, The Big Tradeoff . The Brookings Institution. 


Osberg, Lars, and Kuan Xu. 2000. "International Comparisons of Poverty Intensity: Index 
Decomposition and Bootstrap Inference.” The Journal of Human Resources 35:51-81. 











Page, Benjamin I., and James R. Simmons. 2000. What Government Can Do. University of 
Chicago Press. 


1364 / Social Forces 83:4, June 2005 


Pear, Robert. 1999. “Clinton Hears Success Stories of Ex-Welfare Recipients.” New York Times 
(4 August :A10. 


Raftery, Adrian. 1995. “Bayesian Model Selection in Social Research.” Sociological Methodology 
25:111-63. 


Reingold, David A., Gregg G. Van Ryzin and Michelle Ronda. 2001. “Does Urban Public 
Housing Diminish the Social Capital and Labor Force Activity of Its Tenants?” Journal 
of Policy Analysis and Management 20:485—504. 


Sampson, Robert J., and John H. Laub. 1996. “Socioeconomic Achievement in the Life Course 
of Disadvantaged Men: Military Service as a Turning Point, Circa 1940-1965.” American 
Sociological Review 61:347—67. 


Sanders, Jimy M. 1990. “Public Transfers: Safety Net or Inducement into Poverty?” Social 
Forces 68:813—34. 


Sen, Amartya. 1992. Inequality Reexamined. Harvard University Press. 


SIPRI. 1999. Armaments, Disarmament and International Security: Stockholm International 
Peace Research Institute Yearbook. Oxford University Press. 


Smeeding, Timothy M., Lee Rainwater, and Gary Burtless. 2001. “U.S. Poverty in Cross- 
National Context." Pp. 162-89 in Understanding Poverty, edited by Sheldon Danziger 
and Robert Haveman. Russell Sage Foundation and Harvard University Press. 


Stephens, John D., Evelyne Huber, and Leonard Ray. 1999. “The Welfare State in Hard 
Times.” Pp. 164-93 in Continuity and Change in Contemporary Capitalism, edited by 
Herbert Kitschelt, Peter Lange, Gary Marks and John D. Stephens. Cambridge University 
Press. 


Teachman, Jay, Greg J. Duncan, W. Jean Yeung, and Dan Levy. 2001. “Covariance Structure 
Models for Fixed and Random Effects.” Sociological Methods and Research 30:271-88. 


Tomaskovic-Devey, Donald. 1991. “A Structural Model of Poverty Creation and Change: 
Political Economy, Local Opportunity, and U.S. Poverty, 1959-1979.” Research in Social 
Stratification and Mobility 10:289-322. 


Western, Bruce, and Kieran Healy. 1999, “Explaining the OECD Wage Slowdown: Recession 
or Labour Decline?” European Sociological Review 15:233—49. 


Wright, Erik Olin. 2004. "Introduction." Politics and Society 32:3—6. 


Wolfe, Barbara. 1994. “Reform of Health Care for the Nonelderly Poor.” Pp. 253-88 in 
Confronting Poverty, edited by Sheldon H. Danziger, Gary D. Sandefur and Daniel H. 
Weinberg. Russell Sage Foundation. 


Ziliak, James P., David N. Figlio, Elizabeth E. Davis, and Laura S. Connolly. 2000. “Accounting 
for the Decline in AFDC Caseloads.” Journal of Human Resources 35:570—86. 


. Reforming a Breadwinner Welfare State: Gender, 
Race, Class, and Social Security Reform* 


PAMELA HERD, University of Texas at Austin 





Abstract 


A key challenge facing western welfare states is that they offset income risks faced 
by those in breadwinner families. Social Security is an excellent example. It best 
protects individuals with lengthy work histories or individuals who get married, 
stay married, and are never employed. Most women fit neither model. Thus, I 
analyze “women-friendly” approaches (benefit improvements for parents or those 
divorced) and a social democratic approach (minimum benefit) to reform. Benefits 
disconnected from marital status (parent and minimum benefits) are most effective 
at insuring against the new risks women face. Women-friendly approaches, however, 
do tend to reinforce inequality among women. Ultimately, my results emphasize that 
analyses must account for the intersection of gender, race, and class to understand 
how the state shapes stratification. 


A key challenge facing western welfare states in the 21st century is the “revolution” 
in families and the workplace (Esping-Andersen 1999, 2002). Most western 
welfare states best offset the income risks faced by families where a male 
breadwinner worked for 30 years in the same job while his wife raised children 
at home. But this family has irrevocably changed, and consequently the risks it 
faces have also changed. This issue is exemplified by current challenges to the 
_ US. Social Security system. The “breadwinner” system best protects individuals 
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who either have consistent lifetime work histories, through contributory worker 
benefits, or individuals who get married, stay married, and do no paid work, 
through noncontributory spousal and widow benefits. Most women, however, 
fit neither model. While they work more than their mothers, their child-rearing 
responsibilities mean their work histories still do not match their fathers. 
Moreover, they have a lesser chance of getting and staying married compared 
to their parents. The result is that women currently in their 20s and 30s will be 
just as likely to be poor as are older women today (Smeeding, Estes, and Glasse 
1999). 

This study uses Social Security as a case study to examine how welfare 
states should cope with the changes in women's work and family lives. Given 
the magnitude of Social Security, and the potential issues arising because of 
its outdated benefit structure, it has received surprisingly little attention in 
the scholarly literature, particularly in comparison to the 1996 welfare reform 
and the debates preceding it. Comparatively, however, Social Security dwarfs 
traditional welfare in its overall impact on society and particularly on women. 
No other policy has the capacity to shape social relations to the extent that Social 
Security could (Esping-Andersen 1990). It accounts for almost one-quarter 
of federal spending, outranking any other single area of the budget including 
defense, compared to about 196 spent on TANF (traditional welfare). And even 
more significantly, Social Security constitutes two-thirds of the average woman's 
retirement income and is the sole source of income for one in five elderly women 
(Porter, Larin, and Primus 1999). 

Given these issues, I use the 1992 Health and Retirement Study with linked 
Social Security earnings data to analyze the three different strategies, which for 
over 20 years have been repetitively proposed to update the system. Despite reform 
discussions, there has been little analysis of how reforms would impact women 
beneficiaries. Will reforms offset new family and work risks? Each reform is also 
linked to a key ideological rationale for how the state should redistribute resources 
to reduce gender and class inequality; though policy research is considered 
applied, the roots of policy debates can always be traced to larger theoretical 
questions. Two proposals reflect characteristics of a ^women-friendly" welfare 
state. One offsets the costs of divorce and the other offsets the costs of raising 
children. The third proposal, a minimum benefit, is a common approach in social 
democratic welfare states to reduce class inequities. 

A key contribution of this study is its emphasis on inequality among women. 
How do different ways of updating the welfare state impact race and class 
inequality among women? While other studies have examined how changing the 
welfare state to cope with demographic change affects gender or class inequality, 
they do not address the intersection of gender, race, and class inequality. Further, 
this intersection remains little examined in the welfare state literature more 
broadly, with the significant exception of historical welfare state analyses. 
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Changing Risks across the Life Course: How Should Policy Cope? 


A WOMEN-FRIENDLY WELFARE STATE APPROACH 


The current Social Security structure is a breadwinner model of distribution. 
Women who marry and are never employed reap the highest rewards from the 
system. How could Social Security be altered to reflect dramatic changes that 
have taken place in the family and workplace? One option is to adhere to a 
women-friendly design (Hernes 1987; Sainsbury 1996). As Ann Orloff (1993:315) 
argues, “Men make claims as worker citizens to compensate for failures in the 
labor market: women make claims as workers, but also as members of families, 
and they need programs especially to compensate for marriage failures and for 
the need to raise children.” 

Feminists have long argued that the state needs to recognize women’s unpaid 
care work (Acker 1990; Herd and Harrington Meyer 2002; Hobson 2000; Lister 
1997; Mink 1998; Orloff 1993; Sainsbury 1994). Even mainstream political 
economy theorists now agree that the state should offset children’s impact on 
women’s employment and their (as well as their families’) consequent income 
security to reduce gender stratification (Esping-Andersen 2002; Korpi 2000). 
The costs of raising children should be shifted off the backs of women and onto 
society through the welfare state (Cancian and Oliker 2000; Knijn and Kremer 
1997). Thus, the case for care credits in Social Security, which would reward 
women for raising children (Crittenden 2001). 

The debate about how to offset the cost of raising children has evolved over 
the past 20 years. A primary concern has been how to reward care work without 
discouraging women’s employment (Fraser 1994). The most recent approach, 
the dual breadwinner model, is best represented by Sweden’s welfare state, which 
assumes both parents will be breadwinners while the state provides support 
through universal day care, paid family leave, and parental income supplements 
(Gornick and Meyers 2001). But while feminists and welfare state scholars 
emphasize that family policy should encourage women’s employment, analysts 
do expect some employment lapses (Esping-Andersen 1999; Orloff 1993). Hence, 
the inclusion of paid family leave and parental income supplements. The care 
credit proposal I analyze is in this spirit, particularly compared to current benefits 
where women who do no paid work fare best. 

Another fundamental source of gender inequality is women’s economic 
dependence on men, which is strikingly revealed through their economic 
vulnerability after divorce. Given that almost one in two marriages will now end in 
divorce, marital dissolution has an enormous impact on women’s income security 
and gender inequality more broadly. Men benefit more than women during 
marriages because women provide most of the child and domestic labor thus 
allowing for men’s increased earnings. When marriages dissolve, as they inevitably 
do in death or divorce, women’s financial resources and paid-work prospects are 
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shakier (Hoffman and Duncan 1988; Holden and Smock 1991). Indeed, despite 
women's increased employment, poverty rates among older women in the United 
States are expected to be the same in the twenty-first century as they were in 
the twentieth century, partially due to high divorce rates (Smeeding, Estes, and 
Glasse 1999). Given how many women will divorce during their lives, many argue 
the state should insure against this risk. Women should not be stuck in bad, and 
most especially abusive, marriages because of economic hindrances (Orloff 1993). 
Thus, policies should offset the financial costs of divorce. 


THE SOCIAL DEMOCRATIC APPROACH 


In contrast to the women-friendly approaches, a classic social democratic 
solution to the risks associated with the labor market and consequent income 
loss is a universal income guarantee within a universal social insurance policy, 
such as Social Security (Esping-Andersen 1999; Myles 2002). It should be noted, 
however, that though women-friendly and social democratic approaches represent 
different underlying ideologies and specific policies can be one or the other, social 
democratic welfare state regimes are, generally, women friendly as well. Income 
guarantees are a common feature of Esping-Andersen's (1990) social democratic 
regimes, which are characterized by very low poverty rates. Recently, they have 
come back in vogue. Many U.S. Social Security privatization proposals have 
included them, with varying degrees of generosity, to allay concerns about the 
fate of poorer beneficiaries in a riskier scheme. 

Universal minimum benefits provide beneficiaries with a guaranteed base- 
level income to insure against all risks associated with employment. They can 
insure against chronic job loss, unemployment, or simply the inability to access a 
reasonably high-paying job. While unemployment and disability insurance could 
also insure against these risks, a minimum benefit is a very broad approach that 
catches any risk that would reduce an individual's earnings. 

Cross-national studies show that the systems that combine a universal 
minimum benefit with an earnings-related benefit most effectively reduce poverty 
and inequality as compared to countries with just one or the other (Korpi and 
Palme 1998).! Countries like the United States, which only have an earnings- 
related benefit, cannot protect many low-earning residents. Low lifetime earnings 
lead to a low benefit in old age. Contrastingly, countries that only have a minimum 
benefit lack political support to keep a generous benefit. Higher earners look to 
markets instead of the state to secure retirement income, thus weakening state 
pensions (Korpi and Palme 1998). 
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WOMEN-FRIENDLY POLICIES, SOCIAL DEMOCRATIC POLICIES, AND 
INEQUALITY AMONG WOMEN 


But how would these approaches affect inequality among women? Inequality looks 
different among women than it does among men. For example, a man with low 
lifetime earnings is almost certainly poor, while for many women low lifetime 
earnings signal a well-off husband subsidizing her exit from paid work. In fact, 
black and poor mothers have historically been more likely to work than married 
white women (Goldin 1977). This remains true; among new black mothers 6696 
work compared to 5396 of new white mothers (Bachu and O'Connell 2000). 

Given the different ways inequality is manifested among women as compared 
to men, the effect of the women-friendly approach versus the social democratic 
approach to reform is unclear. Minimum benefits were designed with the lives 
of poor men in mind. Consequently, a social democratic approach will not 
necessarily target the poorest women. A disproportionate amount of these benefits 
could end up with wealthy married women who do no paid work. Certainly, some 
portion of these benefits will go to wealthy women; the issue is how much? 

Contrastingly, while women-friendly approaches are designed with women's 
lives in mind, they are not necessarily designed to reach the poorest women. 
Because most women reduce their employment after having children, proposals 
that fill in these gaps would be beneficial for most women. The question is to 
which women would it be most beneficial? On the one hand, a care credit proposal 
tied to paid labor force participation may reinforce inequities among women 
generated in the labor market. On the other hand, given that poor, black, and 
single women are significantly more likely to work, care credits that meld with 
a benefit tied to earnings may prove advantageous for these women. Proposals 
to increase benefits for divorced women could go either way as well. Black and 
low-educated women are more likely to divorce than white and well-educated 
women, but black and low-educated women are much less likely to marry in 
the first place. 

Ultimately, it is unclear how a woman friendly versus traditional social 
democratic approach. will affect the intersection of gender, race, and class 
inequality. Stratification manifests itself differently among women than it does 
among men. While we can hypothesize how these different approaches would 
affect certain “types” of women, only a generalizable population-based study 
can tease out how these approaches actually would affect women. But before 
addressing policy alternatives that represent each of these theoretical approaches, 
what are the specifics of the current eligibility structure and why is it problematic 
for women? 
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THE SPECIFICS OF SOCIAL SECURITY AND DEMOGRAPHIC CHANGES 


Social Security eligibility is based on work or marriage. Though many Americans 
think that Social Security only distributes benefits to workers, over 4096 of new 
beneficiaries are the wives and widows of workers. Worker benefit eligibility 
requires 10 years of earnings. The eligibility criterion for spousal and widow 
benefits is marriage. A worker's spouse receives a benefit that is 5096 of her 
husband's worker benefit. If he dies she receives the full benefit. Around 9896 of 
spousal and widow beneficiaries are women (Harrington Meyer 1996). Divorced 
beneficiaries qualify for spousal and widow benefits with a minimum 10-year 
marriage. Beneficiaries eligible for the worker and the spousal/widow benefit 
receive the higher of the two. 

It is spousal and widow noncontributory benefits that have come under 
increasing fire for being outdated (Burkhauser and Duncan 1991; Burkhauser 
and Holden 1982; Harrington Meyer 1996; Holden 1979; O'Grady-LeShane 
1993; Smeeding, Estes, and Glasse 1999; Steuerle, Spiro, and Carasso 1999). The 
criticism is based on the fact that these benefits most highly reward women 
who get married, stay married, and do no paid work, though few women fit 
this mold anymore. 

A main justification for either improving divorced women's benefits or shifting 
away from marriage as an eligibility criterion, with care credits or minimum 
benefits, is the “retreat from marriage.” From 1970 to 2000 the percentage of 
women married dropped from 60% to 52%, the percentage of women divorced 
rose from 6% to 13%, and the percentage of families headed by single mothers 
rose from 12% to 26% (U.S. Census Bureau 2000). But while marriage is a 
prerequisite for these benefits, the most vulnerable beneficiaries in the coming 
decades will be single women, particularly former single mothers. Poverty in early 
and midlife negatively affect savings, pensions, and Social Security benefits in old 
age. Around one-third of single mothers live below the poverty line, compared to 
less than 10% of single women without children (U.S. Census Bureau 2000). 

A key argument for care credits is the changes in women’s work patterns. 
Spousal and widow benefits were designed for women who were never employed, 
but growing numbers of women are employed, albeit with continued breaks in 
their participation and lower earnings than men. Between 1960 and 1997 the 
percentage of married mothers in the workforce rose from 28% to 71% (Castro 
1998). Still, women do reduce employment to care for children. Thirty percent 
of mothers are not employed and half of new mothers do not work (Bachu and 
O’Connell 2000). 

Finally, a prime justification for both minimum benefits and care credits is that 
they move the system away from benefits tied to marriage. Spousal benefits, in 
particular, reduce the program’s progressivity (Harrington Meyer 1996; Liebman 
2001). With the worker benefit, the lower an individual’s average lifetime earnings, 
the higher the percentage of earnings the benefit replaces. Someone earning 
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$6,000 a month across his or her life would have a benefit that replaces 31% of 
this level ($1,866 a month), while an individual earning $1,000 a month would 
have a benefit that replaces 68% of this level ($679 a month). But for married 
couples, the spousal benefit changes the replacement rates for total household 
income significantly; the replacement rate for the $6,000 monthly earner rises 
to 47% after including his wife’s spousal benefit. 


POLICY PROPOSALS 


As a consequence of these demographic and structural issues, for the past 20 years 
applied policy researchers and policy makers have been discussing three types of 
reforms to alter Social Security family benefits, which each represent a different 
ideological approach to distributing welfare state benefits. The proposals are 
interesting at a theoretical level, in terms of thinking about how women-friendly 
versus social democratic approaches affect inequality among women, but also at 
a real policy level, in terms of the effects of insuring against differing social risks. 
Even so, there has been little empirical analysis of these proposals.’ 

The options most often referenced include two of the primary women-friendly 
approaches to addressing gender inequality, (1) instituting care credits to offset 
the costs of raising children and (2) altering the current marital status benefits 
to better protect divorced women, and the primary and classic social democratic 
approach to reducing class inequality, (3) creating a universal minimum benefit. 
The care credit and minimum benefit proposals share one common feature; 
they move away from insuring against specific risks associated with marriage. 
Consequently, these proposals eliminate spousal benefits. Eliminating spousal 
benefits also allows for these changes to be basically revenue neutral. Widow 
benefits, however, are not altered under any of these proposals.’ In contrast, 
the proposal that buffers divorced women’s benefits preserves spousal benefits. 
This change does entail additional costs. The specific proposals analyzed are the 
following: 


Option 1: Divorce Benefit. This option improves benefits for divorced 
women (Hartmann and Hill 1999). Presently, divorced women are only 
eligible for a regular spousal benefit (50% of their ex-spouses benefit). 
This reform increases the spousal benefit for divorced women from 50% 
to 75% of their ex-spouses’ benefits. Unlike the care credits and minimum 
benefit, no one receives a decrease in benefit under the divorce benefit. 


Option 2: Care Credit. The second policy allows individuals to substitute 
up to half of the U.S. median annual wage ($15,000), for up to five years 
if they have one child and up to nine for those with more than one child, 
within the 35 earnings years used to calculate her worker benefit (Gore 
2000). Of course, the only years that would be substituted would be where 
earnings fell below $15,000.* Spousal benefits are removed. 
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Option 3: Minimum Benefit. The minimum benefit would be set at the 
current federal Supplemental Security Income (SSI) level ($545 a month 
in 2001). It is strictly an individual benefit. Residents qualify regardless 
of marital status or work history. The minimum benefit is better than 
SSI for two reasons. First, unlike SSI, there would be no income or 
asset tests, which dramatically limit the number of poor elderly eligible 
for it (Advisory Council on Social Security 1979; General Accounting 
Office 1996; Harrington Meyer 1996; Steuerle, Spiro, and Carasso 1999). 
Moreover, a minimum benefit would not have SSI’s eligibility and 
bureaucratic difficulties that keep half of those eligible from receiving it 
(McGarry 2000). Spousal benefits are removed. 


A NOTE ABOUT NONCONTRIBUTORY BENEFITS 


Throughout this article I refer to spousal benefits under the current system, 
as well as each of these proposals, as noncontributory benefits. Basically, these 
benefits are unrelated to individuals’ contributions through their employment 
taxes to Social Security. But many women receive a combination of contributory 
and noncontributory benefits under both the current system and the proposed 
reforms. Imagine a woman who receives a $200 monthly worker benefit, but her 
husband has a $1,600 monthly worker benefit. Her spousal benefit gives her a 
total $800 benefit. My calculations show her as having $600 in noncontributory 
benefits, though she gets an $800 benefit whether she had worked or not. The 
same premise applies for the three reforms. Care credits might buffer her $200 
worker benefit to $400, depending on her earnings and number of children. The 
minimum benefit raises her benefit by $345 to the $545 minimum.° 


Research Questions 


This project examines how three reforms to update family benefits, or rather 
noncontributory benefits, will offset the new family and employment risks future 
older women will face. Further, how will these approaches affect class and race 
inequality among women? Noncontributory benefits require no tax contributions 
or earnings from beneficiaries. Consequently, to understand how the state shapes 
stratification it is important to understand which groups receive the bulk of these 
benefits. The ties between earnings-related benefits and gender, class, and race 
stratification are much more self-evident. Those who earn more (men, whites, 
and the highly educated) receive greater benefits. Thus, the first question is how 
do benefit reforms affect the overall distribution of noncontributory benefits; how 
much do different groups of women gain under the reforms? Currently, almost 
40% of all the dollars devoted to noncontributory benefits are distributed to 
women in the top asset household quartiles. How do the reforms change this? 
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The second set of research questions is more familiar to traditional policy 
analyses. First, who receives benefit increases and decreases? Second, how do 
the policy proposals impact the size of individuals benefits, or the magnitude 
of benefit change? I answer the second question in three ways. What are the 
percentage sizes of the benefit increases and decreases? How do the reforms affect 
replacement rates, individuals’ monthly benefit as a percentage of their average 
monthly lifetime earnings? And finally, how do the reforms affect average benefit 
sizes as a percentage of the U.S.-defined 2001 poverty level? 


Data and Methods 


With the 1992 Health and Retirement Study (HRS) and the 2000 Current 
Population Study (CPS), I examine how women-friendly proposals (care credits 
and the divorce benefit), which insure against specific risks women are more likely 
to face, versus a social democratic class-based proposal (minimum benefit), which 
insures against a broad array of risks, will affect women reaching age 62 from 
2020 to 2030. The reforms’ effects on future older women’s benefits are obtained 
with a simplified nonparametric approach. In short, I readjust individual weights 
in the HRS based on information from women in the 2000 CPS born between 
1960 and 1970. The results are robust and comparable to more sophisticated 
parametric simulations. 


SAMPLE 


I use the 1992 HRS with restricted-access Social Security earnings history data 
and the 2000 CPS to analyze how implementing care credits, a minimum benefit, 
or improvements in benefits for divorced beneficiaries would affect benefit 
distribution among American women retiring between 2020 and 2030. The HRS 
is a unique nationally representative study of over 9,800 Americans born between 
1931 and 1941 (spouses are also interviewed). There are currently five waves of 
data collected from 1992 to 2000. It is the most comprehensive survey of aging 
Americans, with extensive questions on income, current and past employment, 
and health. 

The HRS has some unique features that make it the most appropriate data set 
for this analysis. First, it is one of a just a few studies that can be linked to Social 
Security earnings and benefit records, which given my need to calculate benefits 
is imperative. Second, because it interviews spouses of age-eligible respondents 
I have the necessary information needed to calculate spousal benefits. Further, 
it asks extensive marital history questions of its respondents, which is another 
necessary component to correctly estimating spousal benefits. The sample size 
is 3,427 after dropping men, age-ineligible women, and those who did not give 
permission to access their Social Security records. I use the HRS-restricted Social 
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Security weight to offset missing data for those who did not give access to their 
Social Security records. 

There are some problems with the HRS data, however. First, while there is 
information on spouses for currently married women, there is not information 
on former spouses for divorced and widowed women. Consequently, I employed 
a multiple imputation matching approach to provide me with spouses to calculate 
spousal and widow benefits for divorced and widowed women so that I could 
properly calculate their potential spousal and widow benefits. Multiple imputation 
permits the introduction of appropriate random error into the imputation 
process thus allowing for relatively unbiased parameter estimates. Unlike single 
imputations, standard errors are adjusted to account for errors associated with 
imputation (Rubin 1987). 

I also projected earnings to age 62 because individuals were ages 51-61 in 
wave 1, though imputed income was needed for limited numbers of individuals 
and years due to income reported in waves 2, 3, 4, and 5. Finally, I created a Social 
Security benefit calculator to calculate current and proposed benefits using the 
restricted access Social Security earnings data file. 

A demographic simulation to project what women born between 1960 and 
1970 would look like at age 62 was the last stage of data preparation. To avoid 
all of the complications (and some of the potential pitfalls) associated with 
a full-blown microsimulation, I use a nonparametric "splicing" method. The 
conceptual underpinning of this approach is that lifetime work and family profiles 
of younger women can be approximated by those of older cohorts of women. 
This is the same strategy employed in much larger (and more expensive in time 
and dollars) simulations, such as the Modeling Income in the Near Term (MINT) 
model funded by the Social Security Administration, which also use a "synthetic 
cohort" analysis. Thus, just as in more complicated models, I use the observed 
histories of older women to simulate the unobserved future of younger women 
(Iams and Sandell 1997). 

The underlying assumption is that there are younger women who look like 
older women, but the composition of the female population changes over time 
with respect to certain characteristics. For example, there have always been 
married women who work, but in current cohorts there are greater percentages 
of those women. Thus, I used the 2000 CPS to develop adjusted weights that 
account for how a younger cohort of women differs from the HRS cohort. I 
first take the HRS cohort of women (using the Social Security adjusted weight), 
categorize them into varying descriptive cells that represent race, marital status, 
labor force participation, education, and parenting status based on what these 
women looked like in 1971 when they were between the ages of 30 and 40 years 
old. I then used women ages 30-40 in the 2000 Current Population Survey and 
Fertility Supplement (using the CPS personal level weight) to categorize women 
as I did in the HRS sample, by race, marital status, labor force participation, 
education, and parenting status. The younger women in the CPS provided the 
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information to reweight the cells in the HRS to so that the results reflect how 
reforms would impact future cohorts of women. 


DEPENDENT VARIABLE 


The dependent variable is the current benefit subtracted from each of the 
proposed benefits. It is continuous (natural log of the absolute benefit change) 
but is also altered into an ordinal form where 0 is a decrease in benefit, 1 is no 
change in benefit, and 2 is an increase in benefit. 


INDEPENDENT VARIABLES 


The major demographic variables include birth year and race. Race is categorized 
as black and white, with white as the reference category. This is a critical variable 
given the substantial differences in marriage patterns between white and black 
women over the last 30 years. 

Family variables include marital and parental status, both of which are 
coded as dummies. The reference category for marital status is married women, 
compared to divorced, widowed, and never married women. Parental status is 
ever had or never had a child. Having had a child is the reference category. Both 
variables are critical given new marriage and fertility patterns. 

Socioeconomic variables, which include education, asset quartiles, and benefit 
size, are important in determining whether benefit proposals meet programmatic 
adequacy goals. Education is categorized as no high school degree and college 
degree and above, with having a high school degree as the reference category. The 
reference category for asset quartiles is the top quartile. Benefit size is a continuous 
variable and helps infer whether benefit increases and decreases are targeted 
toward those with the lowest or highest benefits under the current rules. 

Work history variables include timing and number of zero-earnings years. The 
number of zero-earnings years infers whether the proposals meet programmatic 
equity goals. How closely tied are benefits to the extent to which people are 
employed? The number of zero-earnings years is an ordinal variable with 0—5 
zero-earnings years in a woman's 20s, 30s, and 40s as a reference category, and 
then from 6-10, 11—15, 16—20, 21—25, and 26—30 zero earnings years coded as 1. 
The model does control for the timing of these zero earnings years. The reference 
category is having zero earnings years in a woman's 20s, 30s, and 40s (111). 
Alternatively, individuals could have zero earnings years in their 20s, but not in 
their 30s and 40s (100); in their 20s and 30s, but not in their 40s (110); in their 
30s, but not in their 20s or 40s (010); or not in their 20s, but in their 30s and 40s 
(011). Do periods in which women leave the labor force affect their benefits? 
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EMPIRICAL MODEL 


The goal of the regression analyses is to isolate the key correlates associated with 
both who would receive increases and decreases under each reform and the size 
of those increases and decreases. The respondent's benefit under the current rules 
is the basis for comparison. For the care credit and minimum proposals, women 
could have benefit increases, no change, or cuts due to the elimination of spousal 
benefits, while under the divorced benefit, they could have an increase or no 
change. Of course, many beneficiaries will have no change resulting in a zero. 

Consequently, I use an altered form of the two-part model to investigate 
correlates of benefit change under each reform proposal (Manning et al. 1987). 
The first equation in the model is an ordered probit for the care credit and 
minimum benefit proposals where y* is the underlying tendency of the observed 
phenomenon and € is assumed to be standard normally distributed. f! is the 
vector of parameters, and x is the vector of demographic, socioeconomic, family 
status, and work history individual characteristics. 


VAIK E 


The dependent variable is ordinal with categories corresponding to a decrease 
in benefit (y = 0), no change in benefit (y = 1), and an increase in benefit (y = 
2) for the care credit and minimum benefit. For the divorced benefit, the first 
equation is a probit with a dependent variable where there is no change in benefit 
(y = 0) and an increase in benefit (y = 1). The ws are threshold parameters. The 
first parameter is set to zero, so there are two threshold parameters (u, and t.) 
for the minimum and care credit benefits. Thus, the model is the following: 


ENE TA E 0), 
y-leMWwsy'su, 
y=2 @ hi, s y* < u,. 

Of course, the divorced benefit has only one threshold parameter. The 
second equation is an ordinary least squares regression of individuals' increases 
in benefits given that they are in the increase category in the first equation. The 
third equation, which applies only to the care credit and minimum benefit, is 
an ordinary least squares regression of individuals’ decreases in benefits among 
those experiencing a decrease. The dependent variable in the latter two equations 
is the natural log of the absolute benefit change.’ 


Results 


The majority of these results focus on women turning age 62 from 2020 to 2030 
to address how the proposed Social Security reforms would affect younger cohorts 
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Table 1. Changes in Composition of Women Reaching Age 62 between 2020 





and 2030 
Birth Year 
Variable 1931-41 1960—70 
Marital status 
Married 70 60.4 
Divorced 13.9 16.1 
Widowed 122 115 
Never married 4 12 
Worker benefit 
Eligible 80.2 88.7 
Not eligible 19.8 11.3 
Spousal benefit 
Eligible 92.9 82.3 
Not eligible ZI 1777 
Drawing 42.2 26.4 
Not drawing 57.8 73.6 


of women, as opposed to women currently retiring, given the demographic 
differences between these groups. 

Table 1 shows the major differences between current (women born between 
1931 and 1941 or the current HRS women) and future (synthetic cohort of 
women, which was created by adjusting the HRS weights based on women born 
between 1960 and 1970 in the 2000 CPS) cohorts of elderly women in regards 
to family, work, and benefit eligibility. One of the most significant differences 
between younger and older cohorts regards marital status. The percentage of 
married older women will drop by 10 percentage points, whereas the percentage 
of never married women will triple from 496 to 1296. These changes lead 
to a 10 percentage-point reduction in the proportion of women eligible for 
noncontributory spousal benefits. But increases in women's paid work experience 
led to an almost 9 percentage-point increase in women eligible for the worker 
benefit. Ultimately, the combination of women working more and marrying 
less reduced the overall percentage of women drawing on spousal benefits from 
4296 to 2696. While any woman currently married or with a minimum 10-year 
marriage is eligible for 5096 of her (ex)husband's benefit, if her worker benefit is 
higher than this level she would not draw on the spousal benefit. Thus, Table 1 
shows not only that fewer women are eligible for the spousal benefit but also that 
there was a large drop in the percentage of women drawing on spousal benefits. 
The rest of the analyses are based on women turning age 62 between 2020 and 
2030. Table 2 provides an overview of the policy proposals. 
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Table 2. Reform Proposals 

rare E erae en rs sa ei titt Uie a E 
Proposal Who's Eligible? Benefit Design Theoretical Origin 
VEI E E 7 22 Y yU 2 I Y 2 2 === 


Divorce benefit Divorced beneficiaries ^ Raises spousal benefit Women-friendly 
from 50 to 75 percent 
of former spouse’s 
worker benefit 


Care credit Parents Replaces 5-9 of the 35 Women-friendly 
earnings years used to 
calculate the Social 
Security benefit with 
$15,000 if earnings fell 
below this level; 
eliminates spousal 


benefits 
Minimum benefit All U.S. residents Guaranteed flat benefit Social 
of $545 a month; democratic 
eliminates spousal 
benefits 





DO THE REFORMS CHANGE WHO RECEIVES THE BULK OF 
NONCONTRIBUTORY BENEFITS? 


The first question I consider is how the reforms alter the overall distribution of 
noncontributory benefits. Noncontributory benefits require no tax contributions 
on the part of the beneficiary. They are pure subsidies from the state. In theory, 
they are intended to improve benefit adequacy. Consequently, to understand how 
the state shapes stratification it is important to understand which groups receive 
the bulk of these benefits. The ties between earnings-related benefits and gender, 
class, and race stratification are much more self-evident. Those who earn more 
(men, whites, and the highly educated) receive greater benefits. 

Figures 1 and 2 display how much poor and black women, as groups, gain 
from the reforms. That is, how is the total sum of noncontributory spousal 
benefits distributed among women, reaching age 62 between 2020 and 2030, by 
household asset quartile and race? The first bar of Figure 1 shows that under the 
current rules just 14% of all noncontributory spousal benefits will be distributed 
to women in the lowest asset household quartile, while 37% will be distributed 
to the wealthiest women (those in the top quartile). 

Figure 1 also shows, however, how minimum benefits, care credits, and 
improvements for divorced beneficiaries could alter this distribution for younger 
cohorts of women. All three increase the percentage of noncontributory benefits 
distributed to women in the bottom asset quartile. Minimum benefits, however, 
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Note: Asterisk indicates percentage of women receiving noncontributory benefits. 


Figure 1. Distribution of Noncontributory Benefits, by Household 
Asset Quartile, under Current Rules and Reforms for 
Beneficiaries Reaching Age 62 in 2020 to 2030 


are more successful than care credits or the divorce proposal at shifting resources 
toward poor women. Minimum benefits raise the percentage of benefits 
distributed to those in the bottom quartile to 28%, care credits to 23%, and 
reforms for divorced beneficiaries to 19%. Nonetheless, the percentage that 
remains in the pockets of women in the top asset household quartile elucidates 
the extent to which many wealthy women do not work. Under the divorced 
proposal, 35% of these benefits go to women in the top asset quartile. Under 
the minimum benefit and care credit proposals, 28% of these benefits will go to 
women in the top asset quartile. So while the minimum benefit clearly shows 
the strongest improvement over the current system in terms of benefiting poor 
women, it still reflects a targeted benefit design modeled on men’s, as opposed 
to women’s, lives. 

Figure 1 (the percentages underneath each benefit model) also shows the 
percentage of women receiving any noncontributory benefits under current rules 
and each of the reforms. All the reforms raise the percentage of women receiving 
noncontributory benefits. Most strikingly, 36% of women will have such low 
lifetime earnings that they qualify for the minimum benefit. 

Figure 2 shows how these benefits are distributed by race. Though black 
women represent about 20% of the population, they receive just 11% of 
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Figure 2. Distribution of Noncontributory Benefits, by Race, under Current 
Rules and Reforms for Beneficiaries Reaching Age 62 in 2020 to 
2030 


noncontributory spousal benefits. This would change little under the divorce 
proposal, rising to 1396. But the care credit and minimum benefit proposals leave 
about 2096 of noncontributory benefits in the hands of black women. 


HOW DO THE REFORMS CHANGE WOMEN'S TOTAL BENEFITS? 


While Figure 1 and Figure 2 give a sense of the overall distribution of 
noncontributory benefits, they do not show women's individual benefits change. 
How different would one's monthly Social Security benefit be under the reforms 
as compared to current rules? Descriptive statistics in Table 3 show how groups 
of women, who have provoked particular concern (i.e., unmarried women), . 
reaching age 62 between 2020 and 2030 fare under these proposals. Table 3 
shows who receives benefit increases and decreases, the size of those increases 
and decreases, and the original benefit size. Overall, while 6% of women receive 
increases under the divorce proposal, 45% and 14% receive increases under care 
credits and minimum benefits respectively. 

The size of benefit increases is inversely related to the percentage of individuals 
receiving increases. Thus, women receive the largest benefit increases under the 
minimum benefit (56%) and divorce (35%) proposals compared to the care credit 
proposal (9%). About 20% of women receive benefit decreases under the care 
credit and minimum benefit proposals. The average benefit decrease under the 
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care credit proposal was 3096 compared to 1696 under the minimum benefit. The 
divorce proposal did not eliminate spousal benefits, thus it had no benefit cuts. 
In general, those with the lowest benefit under current rules fared best under 
the minimum proposal. Under the minimum benefit, those receiving increases 
had a $350 benefit under current rules, under the care credit, $770, and under 
the divorce benefit, $710. 

Table 3 also allows for distributional comparisons. Though black and white 
women fare relatively equally under the divorce proposal, black women fare 
better than white women under the care credit and minimum benefit proposals. 
Compared to 12% of white women receiving a 65% benefit increase, 24% of black 
women received an average 51% increase under the minimum benefit proposal, 
while 8% of black women versus 24% of white women receive an average benefit 
decrease of 16%. Under the care credit proposal, 51% of black women versus 
43% of white women have benefit increases of 12% and 8%, respectively, while 
12% of black women versus 22% of white women receive benefit decreases of 
about 29%. 

The minimum benefit, as compared to the divorce and care credit proposals, 
most heavily benefits those in the bottom income quartile, where 30% of those 
beneficiaries receive an average 72% benefit increase. Their current rule benefits 
were just $318. About 8% of women in the top asset quartile receive a 64% 
increase in benefits from a $329 benefit. Those in the bottom quartile receiving 
increases under the divorce and care credit proposals had current rule benefits 
that were $671 and $621, respectively. While relatively equal percentages of women 
receive benefit increases in the bottom and top asset quartiles under the care credit 
proposal (45%), those in the bottom quartile receive a 15% increase in benefits 
compared to 7% increase for those in the top quartile. While 14% and 3% of 
those in the bottom and top asset quartiles respectively receive benefit increases 
under the divorce proposal, they both receive a 38% increase. 

Even though, overall, as Figure 1 shows, the minimum benefit and care credit 
proposals shift more resources toward those women in the bottom quartile of 
household assets, Table 3 shows that some of these women receive benefit cuts. 
About 15% and 13% of low-asset women under the care credit and minimum 
benefit proposals received benefit cuts. The care credit proposal leads to an average 
30% lower benefit (from a $578 current rule benefit) compared to 15% (from a 
$666 current rule benefit) under the minimum benefit. No one can receive lower 
than a $545 monthly benefit under the minimum benefit proposal. 

Finally, how would single mothers fare? This is a particular concern given 
these women will likely represent an increasing percentage of poor older women 
among new cohorts of the elderly. Never married mothers fare quite well under 
care credits and minimum benefits but receive no increases under the divorce 
proposal. Under the care credit reform, 70% of these women receive an average 
21% increase in their $553 current rule benefit. Under the minimum benefit, 
34% of never married mothers receive a 167% increase in their benefit from their 
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Table 4. Replacement Rates 





Current Divorce Caret Minimum 

Single parents 

Divorced mothers 50 52 56 5] 

Never married mothers 50 SPERO 59 57 
Single parents 

Bottom asset quartile 60 67 66 71 

Second asset quartile 48 50 50 49 

Third asset quartile 45 47 46 46 

Top asset quartile 45 46 45 45 
Marital status 

Married 60 60 63 58 

Divorced 50 52 52 5] 

Widowed 70 70 77 78 

Never married 44 44 45 47 
Asset quartiles 

Bottom 64 67 71 74 

Second 54 55 58 55 

Third 50 50 53 50 

Top 53 53 55 53 
Race 

Black 54 54 58 56 

White 55 54 58 56 


$167 current rule benefit. None receive benefit decreases. Divorced mothers fare 
best under the divorced and minimum benefit proposals. While 2196 of these 
women receive a 3596 increase from a $720 current rule benefit under the divorce 
proposal, 1396 receive a 5196 increase from a $360 current rule benefit under the 
minimum benefit. Though about 1796 of divorced mothers receive an 1896 benefit 
cut under the minimum, their benefits never fall below $545 a month (in 2001 
dollars). Under the care credit, while around 7396 of divorced mothers receive 
a 996 benefit increase, 1696 do receive an average 2596 decrease from their $674 
current rule benefit. Unlike the minimum benefit, however, there is no floor to 
which benefits cannot fall below with care credits. 

Table 3 shows who gained and lost and the magnitude of those gains and 
losses. The next two tables show how different groups of women fared on average. 
Like the findings based on Table 3, the findings in Tables 4 and 5 generally find 
that poor and black women fare best under minimum benefits and care credits 
respectively. Neither group fares well under the divorce proposal. Another way to 
think about the magnitude of benefit change is to look at the replacement rates, 
which are the proportion of average lifetime earnings Social Security benefits 
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Table 5. Average Benefits as a Percentage of the Poverty Level* 


Current Divorce Care Minimum 
Benefit Benefit Credit Benefit 

Single parents 

Divorced mothers 123 138 126 127 

Never married mothers 101 101 114 118 
Single parents 

Bottom asset quartile 106 121 112 114 

Second asset quartile 141 142 144 140 

Third asset quartile 156 156 158 156 

Top asset quartile 154 154 156 154 
Marital status 

Married? 229 229 227 228 

Divorced 125 138 126 126 

Widowed 164 164 164 165 

Never married 128 128 151 140 
Asset quartiles 

Bottom 98 103 101 106 

Second 120 122 119 119 

Third 137 138 135 135 

Top 130 131 127 128 
Race 

Black 112 115 116 119 

White 126 128 124 125 


* U.S. defined poverty level in 2001. 
b This is the combined husband and wife benefit. 





replace. The monthly Social Security benefit is divided by average monthly lifetime 
earnings. Table 4 shows the median replacement rates for differing demographic 
groups. The divorce proposal has little impact on replacement rates because it 
benefits so few women (6%), though the replacement rate for divorced mothers 
in the bottom asset quartile does rise from 60% to 67%. Contrastingly, the care 
credit and minimum benefit proposals improve replacement rates for single 
parents and poor women generally. The replacement rate for never married 
mothers rises from 50% to 59% and 57% for care credits and minimum benefits, 
respectively. Moreover, the replacement rate for single mothers in households 
within the bottom asset quartile rises from 60% to 66% and 71% for care credits 
and minimum benefits, respectively. The minimum benefit is most progressive, 
raising the replacement rates from 64% to 74% for all women in the bottom 
household asset quartile. 

Finally, Table 5 assesses the magnitude of benefit change by showing how 
benefits, as a percentage of the U.S. poverty level, are affected by the reforms. 
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Table 6. Predicted Probabilities under Varying Scenarios* 
Working Nonworking 


Married Married 
Benefit Increase Low Asset” High Asset” Mother: Mother: 
Divorce benefit 47 2622 0 0 
Care credit 78 77 87 40 
Minimum benefit 45 25 92 44 


*Based on ordered probit analyses that are available from the author upon request. 

^ Married white mother; 6-10 zero earnings years distributed throughout her 30s, 40s, and 50s; high 
school graduate; 1931 birth year; $400 current benefit (for care and divorce). 

‘Married white mother; low asset; 16-20 zero earnings years distributed throughout her 20s, 30s, and 
40s versus 0-6 zero earnings years distributed in 20s and 30s; high school graduate; 1931 birth year; 
$400 current bxenefit (for care and divorce). 


The divorce proposal improves benefits for divorced single mothers, from 12396 
to 13896, but does nothing for never married mothers whose benefits under the 
current rules are just 10196 of the poverty level. Contrastingly, the care credit and 
minimum benefit proposals raise benefits for both divorced and never married 
mothers, though more so for never married mothers. Never married mothers' 
benefits rise from 101% of the poverty level to 11496 and 118% with the care 
credit and minimum benefit proposals, respectively. The benefits of divorced 
mothers in the lowest asset quartile rise from 106% to 121% of the poverty level 
with the divorce proposal. All single mothers’ benefits rise from 106% to 112% 
and 114% under the care and minimum benefit proposals, respectively. 

Also displayed in Table 5 is the combined household benefit for married 
couples as a percentage of the poverty level. Most benefit cuts are among married 
women. Thus, it is worth noting that the combined benefits of husbands and 
wives are at over 200% of the poverty level. 

While the previous descriptive analyses show how different groups of women 
fare, the results can only control for a few factors at a time. The following analyses 
are derived from the two-part regression-based model.!° It shows how wealth 
and work histories affect the probability of benefit increase. Women in low-asset 
households had the highest probability of benefit increase with care credits. But 
women in low-asset households fared relatively better than women in high-asset 
households under the divorce and minimum proposals. All else equal, with the 
divorce proposal, divorced women in low-asset households have a 47% probability 
of benefit increase, while those in high-asset households had a 22% probability. 
The comparable figures for the care proposal were 78% and 77%, respectively. 
For the minimum proposal, they were 45% and 25%, respectively. 

Women who work more fare best under care credits as compared to the divorce 
and minimum benefit proposals. A working married mother, regardless of her 
earnings, has an 87% probability of benefit increase with care credits compared 
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Table 7. Average Benefit Increases and Decreases: Ordinary Least Squares 








Estimates 
Care Credit Minimum Benefit 
Divorce Increase Decrease Increase Decrease 
Benefit Increase 
Assets 
Bottom quartile .2041 .0081 .1845 :282! —.439** 
(0 = top quartile) (.3517) (.0770) (.1201) C72) (.165) 
Second quartile —.0394 .0182 .0465 —.295* woe 
(.3456) (.0662) (.0932) (.162) (.090) 
Third quartile .1023 .0210 —.0305 .087 —.121 
(.3508) (.0650) (.0798) (.497) (.079) 
Number zero earning 
years in 20s, 30s, and 40s 
6—10 (reference 2 0-5) .2265 .5361 —.1605 .260 .335t 
(.4731) (.5537) (.4040) (2511) (.204) 
11-15 .2701 15385/99* .4443 .904* 3437 
(.4737) (.0731) (.4056) (.520) (.199) 
16-20 .4512 RS Oil ss .8263* 774 A72* 
(.5546) (.0942) (.4021) (.524) (.202) 
21-25 3191 2354 136°" 1.079* .610*^ 
(.4805) (59355) (.3996) (.476) (.205) 
26—30 .2986 .2250 [,97 5* 1.206* 516^ 
(.4711) (.1303) (.4049) (.531) (.209) 
R 19.79 660TA —.9559 —31.5000 —76.2*** 
73.82 4311 9295 
122 43 38 oy. .08 


Notes: Full models include age, race, education, timing of zero earnings years, and original benefit (for 
care and divorce). Standard errors are in parentheses. 


Tp<.10 *p«.05 **p<.01 ***p<.001 (two tailed) 


to 42% with the minimum benefit and no probability with the divorce proposal. 
A similar woman with limited employment has a 5996 probability of benefit 
increase with the minimum benefit and 4096 probability with care credits. 

Table 7 shows how the size of benefit increases and decreases is affected when 
controlling for multiple factors. In regards to improving benefits for poor women, 
minimum benefits clearly produced the largest relative increases and the smallest 
relative decreases. Household assets were not significantly predictive for either 
the care credit or the divorce proposal. Under the minimum benefit proposal, 
however, those in the bottom quartile had a 2896 larger increase than those in 
the top quartile. Furthermore, those in the bottom asset household quartile had 
a 4496 smaller benefit decrease than those in top asset quartiles. 

Table 7 also shows how work history impacts the size of benefit increases and 
decreases when controlling for multiple factors. Most telling, work history has 
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no significant impact on the size of women's benefit changes under the divorce 
proposal. Women with a more significant work history fared better under the 
care credit proposal than either the divorce or minimum benefit proposals. For 
example, women with 21—25 zero-earnings years had a 6196 larger benefit decrease 
than women with 0—5 zero-earnings years with the minimum benefit, while these 
same women had a 13696 larger benefit decrease with care credits. In general, 
women with more zero earnings years had significantly higher benefit increases 
and significantly smaller benefit decreases under the care proposal as compared 
to the divorce and minimum benefit proposals. 


Conclusion 


New patterns of work and family life have yet to be institutionalized in the policies 
that help shape economic well-being. This is one, if not the, key challenge facing 
Social Security, and more broadly western welfare states (Esping-Andersen 1999, 
2002). As new cohorts of women approach retirement their lives look different 
from their mothers, but still do not match their fathers. They are more likely to 
combine employment and childrearing than their mothers, but are not able to 
match the labor force participation and earnings of their fathers. Unlike both their 
parents, they are far more likely to move in and out of marriages. The kinds of 
risks women face today, such as gaps in labor force participation due to raising 
children (as opposed to no labor force participation) and single parenthood, are 
not incorporated in most policy. 

Yet, there have been very few empirical studies to examine different welfare 
state distribution schemes that better reflect the life courses of younger generations 
of women. Further, little attention has been paid to how new distribution schemes 
will affect inequality among women. Consequently, I analyzed the three most 
common strategies to "update" the system, improving benefits for divorced 
women, creating care credits, and instituting a minimum benefit, which each 
reflect the primary underlying ideologies for how the state should reduce gender 
and class inequality. 

One clear outcome is that linking noncontributory benefits to marital status, 
even when policies try to account for marital dissolution, reinforces race and 
class inequality among women by failing to protect many younger women 
against the risks of things like single parenthood and intermittent labor force 
exits, as opposed to not working at all. The proposal to improve benefits for 
divorced women, popular among feminist advocacy organizations, leaves 3496 
of noncontributory benefits in the hands of women in the top household asset 
quartile and just 18% of these benefits in the hands of women with households 
in the bottom asset quartile (Hartmann and Hill 1999). Moreover, while black 
women will represent over 20% of the population, they receive just 12% of those 
benefits. Though black women are more likely to divorce than white women, 
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their greater likelihood to be employed and never marry at all interacts to make 
this reform less useful for them. In general, this proposal benefited women who 
had comparatively higher benefits to begin with. For example, while divorced 
mothers’ benefits were 123% of the poverty level under current rules, they rose 
to 138% of the poverty level, while never married mothers’ benefits remained at 
just 101% of the poverty level. 

The care credit proposal, the other gender-based reform, was more successful 
than the divorce proposal at reducing race and class inequalities and offsetting 
the employment and family risks younger women face. Care credits increased 
the amount of noncontributory benefits going toward women in the lowest asset 
households from 14% to 23% and for black women from 12% to 19%. Moreover, 
71% of never married mothers received an average 21% increase in their benefits. 
This was a meaningful boost given that their benefits under current rules were just 
at the poverty level. And almost half of all women in the lowest asset households 
received a 15% benefit increase. Ultimately, the care credit proposal, as compared 
to the minimum benefit, divorce proposal, and current system of spousal benefits, 
created the tightest link between employment and benefits. Women with very 
weak labor force participation do not fare well. In general, working women were 
twice as likely to receive benefit increases, while nonworking women (only married 
and divorced women eligible for spousal benefits could have benefit cuts) were 
twice as likely to receive benefit cuts. 

Minimum benefits are more effective than the divorce and care credit proposals 
at reducing class inequality among women and are equivalent with care credits 
in their effect on race inequality. Minimum benefits also better protect women 
against new family risks, such as single parenthood, than marital based benefits, 
but they are more suited to women who are rarely employed than women who 
have intermittent withdrawals from employment. They increase the percentage 
of noncontributory benefits delivered to the poorest women from 14% under 
the current structure to 28%. And like care credits, minimum benefits raise the 
portion of noncontributory benefits directed at black women to 19%. Single 
poor parents fare especially well under minimum benefits. Their replacement 
rate rises to 71% from 60% under current rules. Of course, minimum benefits 
also heavily benefit those with limited employment. Women who had 26-30 
zero-earnings years had benefit increases that were 120% higher than women 
with up to 5 zero-earnings years. 

Care credit and minimum benefit approaches, though significantly more 
redistributive than marital based benefits, still have problems. They divert about 
28% of noncontributory benefits to wealthy women. This happens because 
inequality among women manifests itself differently than inequality among men. 
While a man with limited labor force participation is almost certainly poor, this is 
not the case for women. And both proposals, to greater and lesser extents, devote 
benefits towards women who are not consistently employed. Thus, many wealthy 
women benefit from care credits and minimum benefits. This is not necessarily 
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problematic, however. Any system that too heavily targets resources towards the 
poor, particularly in the U.S., may be politically vulnerable. 

Another problem with care credits and minimum benefits is that some 
women receive benefit cuts. And though most receiving those cuts are married 
and reasonably well off, there are some groups of women, particularly divorced 
women, who are more vulnerable. Ideally, benefit cuts would be offset for those 
who can least afford them. But if they are not offset, it is important to keep 
in mind that the current system disadvantages many women purely because of 
their marital status. Imagine two women with the exact same earnings history. 
Why should a never-employed divorced mother receive an $830 benefit based 
on her ex-husband's earnings, while a low-paid never-married mother receives a 
$400 benefit based on her low earnings' work history? Care credit and minimum 
benefit proposals level the playing field for these women. Declaring those changes 
should not be made because some will end up worse off than before condemns 
the state to perpetually reinforcing existing inequality. 

Though feminist scholars have carefully elucidated the link between gender 
and class inequality (1.e., Acker 1990), these results show the theoretical limitations 
of gender-based welfare state theory. “Women-friendly” welfare state policies are 
not necessarily friendly to poor and black women. Benefits connected to marital 
status, even those intended to offset the costs of marital dissolution, still favor 
wealthier white women. Further, rewarding care does more to soften gender 
inequality than class inequality. The care credit proposal’s strength is that it links 
noncontributory benefits more tightly to paid labor force participation than did 
spousal benefits. But this strength is also its weakness. It then reflects labor market 
inequities among women. 

These results also show the limitations of the class-based social democratic 
minimum income approach. While an income floor helps protect the poorest 
women, it cannot offset the income losses most women face due to motherhood 
and labor force discrimination. Almost 36% of women will qualify for a policy 
intended to catch the lowest-earning citizens. Of women retiring between 2020 
and 2030, 36% will have average yearly lifetime incomes of less than $9,000. 

Ultimately, research that focuses on the intersection of gender, race, and 
class needs to be pursued more vigorously in the welfare state literature. More 
analysis is needed to sort out how varying welfare state regimes and policies 
affect stratification among women. This is particularly true given the extent to 
which inequality among women is evolving over time. Whereas once very few 
women worked, leaving the majority dependent on a male breadwinner, women’s 
increased labor force participation has altered the kinds of stratification that 
exist among women. For example, black women have always been more likely to 
work than white women, but wage disparities between black and white women 
have risen over the past 20 years as more white women have entered the labor 
force. Furthermore, while race is difficult to analyze in a comparative fashion, it 
remains startlingly underanalyzed. Increasing immigration in Europe will make 
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it even more pertinent over the coming years. These trends make it ever more 
important for those interested in how the state shapes inequality to look at how 
it shapes the intersection of gender, race, and class inequality. 

Finally, this kind of study begs the question of political feasibility, and thus 
the relevance of the analysis. What is the point of the study if policy makers are 
unlikely to change the system in the first place? First, analyses of proposed policies, 
just like past and current policies, can lend insights into broader theoretical 
questions. This study sheds light on the ramifications of gender- and class-based 
approaches to benefit distribution on race and class inequality among women. 

But examining proposed policies is similar to examining current policies 
in another way. Both types of studies must be done with the assumption of a 
possibility for change. A major reason it is interesting to ask, for example, whether 
welfare reform (or social policy more generally) has increased or decreased 
poverty is because it lends insights into how we might alter policies based on the 
results. Social scientists would not be discouraged from exploring the question 
if the current political climate indicated that policy change was unlikely. It is 
important to understand the consequences not only of what we do (current 
and past policy) but also of what we do not do (proposed policies), regardless 
of whether we think policy makers will act. 


Notes 


1. It should be noted that pension policies in the Nordic countries have changed in the past 
few years. For example, though Norway still has a traditional first-tier citizenship minimum 
pension, Sweden's and Finland's minimum pensions are tested against their entitlement 
pensions. This is similar to the minimum pension that used to be a part of the United States’ 
Social Security program (it was eliminated in the early 1980s). Denmark further tests its 
minimum pension against both pension income and other forms of income. 


2. For exceptions see Iams and Sandell (1994) and Favreault and colleagues (2002). 


3. Though widow benefits have some of the same problems as spousal benefits, widows have 
disproportionately high poverty rates, while married women who are the primary recipients 
of spousal benefits have very low poverty rates. 


4. In married couples, the lower of the two earners would receive the benefit. But because 
my data is comprised of only women, I do not carry this out in my analysis. Moreover, it is 
still relatively rare for a woman to earn more than her husband or ex-husband. 


5. Some might object to the term noncontributory benefits, particularly in relation to care 
credits. One argument for care credits is that women contribute to society through raising 
children in the same way that others contribute through paid work. Raising children is just 
as much a “contribution.” I want to make clear, however, that I am not evaluating the societal 
worth of raising children, maintaining a marriage, or simply being a resident of the United 
States (the minimum benefit). I am simply examining how these different approaches to the 
distribution of resources through the welfare state affect inequality among women. 


6. The Survey of Income and Program Participation (SIPP) also collects household data 
and is merged to Social Security administrative records. But the SIPP does not collect 
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fertility information on women over age 62. Given the care credit proposal, I needed fertility 
information. Consequently, I would have had to use a sample of women from the SIPP 
similar to the age of women in the HRS, making it no more advantageous than the HRS for 
the purposes of this study. 


7. I also had to drop 546 married women whose spouses would not allow their earnings 
records to be accessed. I readjust the Social Security file weight to account for their absence. 
In basic comparisons, these married women did not appear to be significantly different than 
their counterparts. 


8. After comparing my projections with the Social Security Administration's population 
projection, my adjustments were comparable, with similar increases in the percentage of 
divorced and never married women. One issue that did arise with this approach, however, 
was that it could not account for improved mortality rates. Thus I slightly overestimated 
the percentage of women who are widowed and slightly underestimated the percentage of 
married women. 


9. As is standard with multiple imputation, the standard errors are adjusted to account for 
the error associated with imputation. The details of this adjustment are in the discussion of 
the imputation method under the data and methods section. 


10. Table 6 is predicted probabilities based on probit analyses that are available from the 
author on request. 
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Lessons from Biafra: The Structuration of 
Socially Relevant Science in the Research and 
Production Directorate" 


CHIKWENDU CHRISTIAN UKAEGBU, University of Wyoming 





Abstract 


Africa's dismal economic performance is directly attributable to its weakness in the 
production and use of modern technology. Even Nigeria, a country with immense 
human and material resources, coupled with significant scientific infrastructure, 
has not yet been able to manage the all-important technological leap forward. 
The situation was different in Biafra (1967-70), when indigenous scientists 
and engineers performed socially relevant science without the preconditions 
conventionally perceived as necessary for technological development. Anchored in 
structuration theory, this article explores the sociology of scientific and technological 
practice in Biafra, outlines the achievements of Biafran scientists and engineers, and 
offers explanations of why the Biafran technological success has not been replicable 
in post-civil war Nigeria. Discussion concludes with a suggestion for development- 
driven geopolitical restructuring. 


The literature on development, especially from the 1960s through the 1980s, 
has emphasized several perceived obstacles to technological development in 
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developing countries, including a weak base of indigenous science and technology, 
scarcity of research and development institutions and activities, low public 
investment in research and development, poor training facilities, low manpower 
in science and technology, brain drain, and the absence of a capital goods sector 
that would motivate engineering designs and fabrications (Fabayo, Odejide, and 
Alade 1995). Another assumed constraining factor is that because industries arrive 
in the Third World as turnkey operations, Third World scientists and engineers 
will likely be employed in such industries to perform only routine tasks. This type 
of work environment, it is argued, does not provide local scientists and engineers 
the opportunity to learn by doing. It has also been argued that the content of 
scientific and technological education that Third World scientists and engineers 
receive abroad is so far removed from the problems of their native countries that 
they are unable to do socially relevant science at home. 

The Biafran experience challenges these assumptions. All these perceived 
handicaps existed in Biafra, yet through the wartime organization known as 
Research and Production (RAP) Biafran scientists, engineers, and technicians 
managed to perform socially relevant science, sustain their efforts through the 
three-year Nigerian-Biafran war, and put Biafra on the path to technological 
development, had the young nation survived. Thus I suggest that technological 
development is driven more by the effective harnessing of human agency, nurtured 
by appropriate sociopolitical conditions, than by the presence of glamorous 
technological preconditions. That proposition is examined here in the context 
of scientific and technological practice in prewar Nigeria, wartime Biafra, and 
postwar Nigeria. I conclude with lessons and suggestions for socially relevant 
science in Nigeria. By socially relevant science I mean the use of educational skills 
and knowledge in science and technology to solve problems of society while 
continuing to advance that knowledge to improve and perfect technological 
devices. Where scientists, engineers, and technicians cannot rise to that challenge 
in this age of science-based production, their work degenerates into social 
irrelevance. 

Science and technology comprise an important source of new knowledge 
that, when injected into the economy, fuels development in social organizations 
such as education, health care, communication, and manufacturing enterprises 
(Geisler 2001). Teece and Pisano (1998) use the term dynamic capabilities (of 
science and technology) to define a firm's ability to integrate existing conceptual 
and empirical knowledge toward facilitating prescription. When extended to 
society in general, dynamic capabilities denotes a society's ability to utilize all 
its skill, knowledge, and resources to adapt to the ever-changing environment. 
Consequently, the knowledge produced by firms, universities, research institutes, 
and other organizations becomes diffused into the economy to the extent that it 
will be absorbed by relevant organizations, contribute to their success, and thus 
propel socioeconomic progress (Geisler 2001). 
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Science and technology are human, not superhuman, activities. Their 
success depends on the circumstances in which the potential or actual actors 
in scientific/technological and political spheres of society find themselves, their 
recognition that their roles can advance the cause of societal development, and 
their determination to enact those roles for that cause. Four action parameters 
inform much of the enterprise of scientific and technological advancement: basic 
research, applied research, development, and technology. 

The United States’ National Science Foundation (NSF) (as cited in Geisler 
2001) defines basic research as an activity that has as its objective a fuller 
knowledge or understanding of the subject under study, rather than a practical 
application thereof. It is research that advances scientific knowledge but does 
not have specific, immediate commercial objectives. By contrast, applied research 
is aimed at gaining knowledge or understanding in order to determine the 
means by which a specific, recognized need may be met. Development is the 
systematic use of the knowledge or understanding gained from research toward 
the production of useful materials, devices, systems, or methods, including the 
design and development of prototypes and processes. While science produces 
new knowledge, technology applies knowledge to new ways of doing things 
(NSF in Geisler 2001). Although Biafran scientists performed both basic and 
applied research, they placed more emphasis on the development component in 
response to the circumstances of the war, which demanded immediate solutions 
to the urgent problem of national survival. In fact, these scientists mostly invoked 
and recalled existing knowledge from basic and applied research and used that 
knowledge to tackle the business of producing devices, materials, and systems 
for the society. 

The series of events that led to the onset, prosecution, and end of the Nigerian- 
Biafran war (1967-70) has been documented in an impressively voluminous 
literature on that episode of Nigerian history (e.g., de St. Jorre 1972; Forsyth 
1969; Jacobs 1987; Madiebo 1980; Nwankwo 1972; Stremlau 1977). For now, 
suffice it to say that the war was the final stage of a conflict that had started with 
the military coup of January 15, 1966. Igbo military officers were the primary 
actors in that coup, which took the lives of some leading political and military 
figures of non-Igbo origin. The people of Northern Nigeria, in response to the 
loss of their prominent personalities, carried out several massacres of Igbos in 
some cities of the region. The crisis forced thousands of Igbos and other citizens 
of Eastern Nigeria out of the North and back to the East, making them refugees 
in their homeland. The situation culminated in an intense national tension. 

Fluctuating agreements and disagreements, conflict and consensus, promies 
and disappointments, and trust and distrust between the leaders of then Eastern 
Nigeria (led by Lt. Col. Chukwuemeka Odumegwu Ojukwu) and the Federal 
Nigerian Government (led by Lt. Col. Yakubu Gowon) eventuated in the 
blockade of Eastern Nigeria. According to Jacobs (1987), instead of implementing 
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agreements reached by the two parties in Aburi, Ghana, in early January 1967, the 
federal government enacted a series of measures that led the people of Eastern 
Nigeria to believe that they were being expelled from the Nigerian federation. 

Jacobs recounts the measures as follows. The federal government, which held 
the revenues of Nigeria (especially those flowing from oil exports), would not 
help the Eastern Region with its large burden of refugees. Regular remittances to 
which the East was entitled were withheld. Government employees in the East 
were denied their wages. Federal supplies of equipment and material to agencies 
in the East were cut off. Nigeria Airways flights were suspended. All airports 
in the Eastern Region were closed to outside traffic. Eastern assets in Nigeria 
were frozen, as were those owned jointly with Nigeria abroad. Foreign currency 
exchange was cut off. Eastern seaports were closed to shipping, and export of 
Eastern produce was banned except through Lagos. The blockade of Biafra had 
begun, three months before there was a Biafra (Jacobs 1987). 

Furthermore, sensing the possibility of secession by Eastern Nigeria, Lt. Col. 
Gowon issued a decree on May 27, 1967 dividing Nigeria into twelve states. The 
minority ethnic groups in Biafra may have favored that measure. But breaking the 
Eastern Region into three states removed the major oil deposits and installations 
from Igbo control (Jacobs 1987). On May 30, the Eastern legislature, under Lt. 
Col. Ojukwu, seceded from Nigeria by declaring the region an independent and 
sovereign state known as the Republic of Biafra. On July 6, the Nigerian army 
attacked Biafra from the latter's northern borders. Thus began the Nigerian- 
Biafran war. 

Such was the historicostructural context in which Biafran scientists, engineers, 
and technicians (henceforth termed Biafran scientists) operated. Blockaded from 
sea, land, and air, Biafrans were isolated from other parts of Nigeria and the 
outside world. Most accounts of the war make passing mention of the ingenuity 
of Biafran scientists during that conflict. Ogbudinkpa (1985) is one of the very few 
who has thoroughly examined scientific practice in Biafra and the prospects of its 
transferability to national development in postwar Nigeria. A lucid, authoritative, 
and firsthand descriptive account of the role played by Biafran scientists during 
the war has more recently been published by E.O. Arene, a professor of chemistry 
who was a founding member of the Biafran Science Group. Through a series of 
events that group became an organization known as the Research and Production 
Directorate, popularly and fondly known as RAP. 

Arene's book The Biafran Scientists: The Development of an African Indigenous 
Technology (1997) was a courageous undertaking. After the war, in which 
Biafra was defeated, many key leaders and personnel of RAP could not reveal 
their roles for fear of possible reprisal from the federal military government 
of Nigeria (Ogbudinkpa 1985). That fear is understandable, because although 
RAP was comprised of people from different ethnic groups in Biafra, the Igbo 
were predominant. Their very conspicuous presence in RAP was not deliberate 
but rather the direct result of demographic realities. Igbos formed the majority 


Lessons from Biafra: Structuration and the RAP / 1399 


population in Biafra, as they had when it was Eastern Nigeria. It was common 
knowledge that the educated population was higher in Igboland than in other 
parts of Biafra. Also, several non-Igbo ethnic groups in coastal and riverine areas 
fell to the Nigerian army early in the war, stranding a large body of non-Igbos 
behind enemy lines until after the war ended on January 15, 1970 (Madiebo 
1980). Moreover, the conflict that eventuated in the Nigerian-Biafran war was 
essentially a confrontation between the Igbo and the people of Northern Nigeria, 
owing to the conspicuous role of Igbo army officers in the plan and execution of 
the coup of January 15, 1966, in which key leaders of Northern and Southwestern 
Nigeria had died. It is therefore not surprising that Igbo scientists would prefer 
anonymity after losing a war that had been sustained through the maximum 
application of their professional skills and moral support. Arene's book was in 
fact first produced in 1987 as a restricted document. Clearance to publish it for 
public reading came a decade later. 

Arene's magnificent account of RAP is richly descriptive. My intention here 
is to place RAP in an analytical and explanatory context, bringing sociological 
theory to bear on the dynamics of an organization that rallied the best of human 
ingenuity in a society that lacked all the so-called preconditions of technological 
development. Putting Biafra’s RAP in the context of sociological theory helps 
to achieve two objectives. First, it suggests some reasons why scientific practice 
in postwar Nigeria has not enjoyed the same degree of dynamism and social 
relevance that its counterpart in wartime Biafra exhibited. Second, by anchoring 
RAP and its story in structuration theory in particular, we can see that Western 
sociological theory can be used to explain events in non-Western societies. 


Scientific and Technological Practice in Biafra: A Conceptual Framework 


Any number of sociological viewpoints could be claimed as the appropriate tool 
for explaining the social relevance of RAP. At first thought, plain structuralism, 
which sees structure as the singular determinant of action, appears a strong 
candidate. Upon critical reflection, however, plain structuralism offers only an 
incomplete understanding of the phenomenon. A fuller explanation can be gained 
from structuration theory, which (according to Giddens 2001) defines structure as 
both a medium and an outcome of social action. By this definition, structure and 
agency are interactive rather than se_arate from each other, and equal emphasis 
may be placed on both the sub; tive and objective dimensions of action. 
Some early observers of the Nigerian-Biafran war even indirectly interpreted 
the role of Biafran scientists from a psychological perspective. Psychologism, 
according to Wallace (1969), seeks to explain social behavior through the 
psychological characteristics of the participants themselves. In this sense, 
explanations of the properties of populations rest on reference to the individuals 
who compose the populations (Schnore 1961). Emphasis thus fastens on the 
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attitudinal or innate characteristics of actors. Frederick Forsyth's Biafra Story 
(1969) provides a good example: 


Biafra was variously described as the Japan, the Israel, the Manchester, 
and the Kuwait of the African continent. The comparison with Japan 
refers to the population. Rarely among Africans, they [Biafrans] have the 
gift of unceasing hard work. In factories the workers turn in more man- 
hours per year than elsewhere, and in the farms the peasants produce 
more yield per acre than in any other country. It may be that nature 
has necessarily bred these traits, but they are also backed by the ancient 
traditions of the people. In Biafra, personal success has always been 
regarded as meritorious. Igbos are avid for education and particularly 
for qualification in one of the technical professions. Other characteristics 
are adduced to explain the antipathy they manage to generate; they are 
pushful, uppity and aggressive say the detractors; ambitious and energetic 
say the defenders. Throughout Africa one will find Arab traders (Lebanese 
or Syrians) or Indians. These peoples have wandered across the world 
with talent for trade undercutting local traders and driving them to the 
wall. But they will never be found where the Biafrans operate. (105—6) 


In the same vein, Ottenberg (1971) observes that the Igbo are known for their 
high achievement skills, an enterprising disposition, a strong emphasis for an 
open status system, and a high degree of receptivity to change: failure to achieve 
is regarded with a certain pity, almost with disgust (see also LeVine 1966). Such 
characterizations imply that there is something in the Biafran, especially Igbo, 
personality, that positions the individual for success and innovativeness. This 
personality is characterized by hard work, entrepreneurship, creativity, thrift, 
perseverance, and pursuit of excellence. 

But if these attributes, which could indeed have been inherent in the Biafran 
personality, were the primary force for the achievements of RAP, scientific and 
technological innovations should have flourished in Eastern Nigeria before the 
creation of Biafra, or even in other parts of Nigeria where Biafran scientists 
had a conspicuous presence. And the speed of technological innovation led 
by indigenous scientists and engineers in Biafra should have been replicated 
in Eastern Nigeria after the war, especially in Igbo states, which constitute the 
majority population in the region. Granted, evidence of invaluable technical 
creativity continues to be recorded among illiterate blacksmiths, roadside 
technicians, designers, and metal fabricators in parts of former Biafra such as 
Aba, Awka, and Nnewi. The products of these roadside designers and fabricators 
have contributed immensely to productive activity in other sectors, including the 
manufacturing sector. But the characteristics and activities of RAP were different 
from those of roadside designers and fabricators. 

Science and technology activities in RAP were led and performed by people 
with high levels of education and specialization in various fields of science 
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Figure 1. Scientific Practice in Nigeria and Biafra 


and engineering. Most of their activities were conducted in formal or quasi- 
laboratory situations. In most cases, support staff (laboratory and workshop 
technicians, technologists) had an intermediate-level education, secondary to 
two or more years of postsecondary. In a world dominated by science-based 
production, it is only natural to expect highly educated scientists and engineers 
to collaborate in linking science and engineering knowledge to the productive 
sector, thereby contributing to economic growth. Figure 1 shows the spark of 
extraordinary creativity that existed in Biafra during the war. By contrast, science 
and technology activity was characterized by a state of ordinariness in pre- and 
postwar Nigeria. 

Various students of science and society in postwar Nigeria (Anya 1993; 
Chatelin et al. 1998; Ogbimi 1990; Tiffin and Osotimehin 1988; Ukaegbu 
1985, 1991, 1995) have doubted that Nigeria's available science and technology 
manpower can make appreciable contributions to the country's technological 
advancement. This includes manpower in the part of Nigeria that was Biafra, 
where scientists performed socially relevant science. Because scientists of the 
former Biafra did not continue their technological leap forward after the war, the 
attribution of their wartime feats solely to psychobehavioral properties inherent 
in a Biafran personality or agency is an unsatisfactory explanation. 

G.O. Ezekwe, who headed a leading engineering group in RAP, puts this point 
very succinctly in his foreword to Arene's book: 


The record of the achievements of indigenous experts in military 
equipment and essential commodities production shows that even under 
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the prevailing atmosphere of a severe economic depression in Nigeria, 

which is none-the-less more congenial than the civil war environment, 

Nigerian scientists and engineers can transform the country into a newly 

industrialized nation in a tolerable time period, if the conditions are 

right. (1997:4—5) 

Ezekwe does not indicate what those right conditions are or should be. 
Underlying his statement, however, is a structuralist orientation, which asserts 
that human achievements are determined by the social environment in which 
actors find themselves. 

Structuralism, Waters (1994) observes, considers the material structures that 
sustain individuals, societies, and cultures and traces their effects on thought and 
action. Human beings are thus believed to be influenced by their socioeconomic 
and historical location. For a structuralist, the unleashing and utilization of 
human potential in Biafran scientists resourcefulness, inventiveness, endurance, 
and indomitable will to solve the immediate problem would be attributable 
solely to the new geopolitical environment, Biafra, in which they worked. Or 
the new structure could be said to have led to a situationally induced release 
of dormant human efficacy, with which the scientists achieved socially relevant 
results. Structure is thus held to be the determining factor that underlies surface 
appearances. Everyday social experience and the beliefs that sustain it are held 
to be a gloss that masks a genuine but hidden reality beneath the level of 
consciousness. The sociologist's task is then to elucidate the connection between 
action and structure in such a way as to render action the transparent product 
of structure (Waters 1994). 

The secession of the Eastern Region from Nigeria led to a new geopolity, 
Biafra, but the secession also left Nigeria with a new structure made up of the 
rest of Nigeria without the East. A structuralist argument might propose that the 
scientific communities in the two societies responded to the situation differently 
because Biafra was blockaded but Nigeria was not. War and its preparations have 
probably equaled economic considerations as factors in the history of technology 
(Mackenzie and Wajcman 1999): war and the threat of war act coercively to 
force technological change, with defeat the anticipated punishment for those left 
behind; hence military concerns have often shaped the development pattern and 
design details of new technologies. From the structuralist perspective, Biafran 
scientists were coerced to action by a new social environment to which they 
responded with an unprecedented quantum and speed of scientific creativity. 

But structuralist arguments, Sewell (1992) rightly notes, assume a far too 
rigid causal determinism in social life, reduce actors to the status of programmed 
automatons, and fail to recognize the agency of social actors. Secessionist Biafra 
was the result of the interaction between structure and human agency rather the 
sole handiwork of structure. Human beings created Biafra out of Nigeria, the 
latter a tension-ridden and amorphous geoethnic structure readily susceptible 
to intergroup conflict. Biafra thus was the unintentional consequence of human 
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action. In his critiques of structural determinism, Giddens (1984, 1987) argues 
that human agents are not blind and dull objects that carry out whatever is 
predetermined by structures. They are agents who know a lot about their social 
circumstances, the conditions of their agency, and its outcomes. And they use 
their knowledge to carry out their agency. Agency itself is more than an isolated 
action. It is an action that is part of reproducing society, thereby making history 
(Giddens, cited in Munch 1994). On the other hand, structure has the dual 
capacity to enable and to constrain action. And the duality of structure and agency 
finds expression in the processes by which structures are constituted through 
action and action is constituted structurally (Giddens 1976, 1984, 2001). Agents 
create structures, which in turn influence action. Thus one of the hallmarks of 
structuration theory, as Giddens terms it, is that rather than maintaining a dualism 
between structure and agency, as in constructionist and structuralist sociologies, 
structuration views action and agency as interactive and co-influential. As actors 
draw upon structures in order to provide guidance for their own actions, they 
produce or reproduce structures (Waters 1994). 

Giddens (2001) defines structure not as patterns of social relationships but 
as rules and resources recursively implicated in social reproduction. Resources 
consist of any kind of advantages or capabilities that actors may draw upon to 
affect the outcome of the process of interaction. These include meaning (the 
stock of knowledge), morals (value systems), and power (patterns of domination 
and divisions of interest). Biafra was the outcome of antagonistic agency of the 
federal and Eastern political leaders, with support from their constituencies. The 
new and independent geoethnopolity plus the resultant war in an environment 
of domestic and international blockade jointly provided the rules and resources 
which Biafran scientists took advantage of. Resources included the collective 
feeling and appreciation of independence, a collective sense of urgency for 
survival, a sense of shared identity, the collective desire to control group destiny, 
and a pool of bright and skilled science and engineering manpower. Also new 
rules of organizational participation, which placed a high emphasis on merit 
and recognition of competence, motivated Biafran scientists to invest the best 
in themselves in the application of abstract scientific knowledge to productive 
activity. 

The coup of January 15, 1966, was an action by a small group of the Nigerian 
military against the perceived inefficiencies of existing political leaders as 
individuals. It was not intended to prompt the fragmentation or dissolution of 
the country. But in response to the deaths of their leaders Northern Nigerians 
invoked the existing interpretive schemes of an antagonistic multiethnicity and 
responded to the situation as a structural issue by unleashing extreme violence 
on Igbos, an act that unintentionally led to the creation of Biafra. Put more 
formally, the interaction between the coup (agency) and the rules and resources 
of antagonistic multiethnicity (structure) led to the creation of Biafra and the 
war (another structure). The latter, in turn, provided the social capital, and 
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Figure 2. Structure, Agency, and Scientific Practice in Biafra 


the varieties of attributes of human agency or “agentic” outcomes, with which 
Biafran scientists worked to make history by way of an unprecedented leap toward 
technological development (see Figure 2). 

Social capital, defined as the networks and norms of trust, and the reciprocity 
that arises from them, has been found to be beneficial for individual and group 
success (Aguilera and Massey 2003; Marsh 2003; Putnam 2000). Biafra as an 
independent political entity had the power to allocate material resources to 
its constituencies. But it did not have a lot of material on hand because of the 
constraints caused by the blockade. Therefore, social capital was the greatest 
resource at its disposal. 

Thus far I have tried to place the story of scientific practice in Biafra in a 
theoretical context. Every account of the political economy of the war directly or 
indirectly makes reference to the activities and contributions of the Biafran science 
and technology community, especially the organization known as Research and 
Production (RAP). What did the scientists, engineers, and technicians actually 
do to merit this attention? That question is addressed below. 


Biafran Research and Production: Activities and Accomplishments 


According to Ezekwe (in Arene 1997), the response of Biafran scientists during 
the nation's brief life can truly be described as a feat, because there was no 
preexisting science and engineering infrastructure worth mentioning. RAP 
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began its journey around April 1967 as an informal group known as Biafran 
Scientists, with its nucleus at the University of Nigeria, Nsukka (which became the 
University of Biafra once sovereign statehood was declared). The core population 
of this group was comprised of academic scientists and their support staff. As 
the possibility of war loomed, Biafran Scientists coalesced into a more tangible 
entity known as the Biafran Science Group, which was inaugurated in June 1967 
in Enugu, the then capital of Biafra. The Biafran Science Group later became 
transformed into RAP, with a more definable and institutionalized structure, 
indeed an amorphous bureaucracy.' But that huge bureaucracy hardly exhibited 
the negative attributes characteristic of bureaucracies in general and Nigerian 
bureaucracies in particular. 

Strong support from the Biafran government allowed the early informal entity 
Biafran Scientists to convert the laboratories of the Ministry of Commerce into war 
chemical laboratories. The workshops of the Nigerian Railway Corporation and 
the Nigerian Coal Corporation were converted into war engineering workshops. 
The Biafran Scientists were also mandated to build another chemical laboratory 
at Iva Valley. All these facilities were located in Enugu (Arene 1997). The Biafran 
Scientists as a diffuse entity worked in small groups, each group toying and 
experimenting with any ideas that came to mind. When scientists, engineers, and 
technicians in other towns and cities heard of the official inauguration of the 
Biafran Science Group in Enugu, they soon formed similar groups in Aba, Port 
Harcourt, and other places. These groups were made up mainly of scientists and 
engineers from the private sector. Thus, the practical world of industrial scientists 
and engineers merged with the theoretical world of their academic counterparts 
(Arene 1997) to produce socially relevant science. 

Enugu fell to the federal army in October 1967. The Biafran Science Group 
moved to Umuahia, consolidated, and changed its name to Research and 
Production Directorate. The office of Head RAP was initially at Umudike. It was 
later relocated to the Defense Headquarters, around the community of Afara at 
the outskirts of the city of Umuahia. Except for the accountant and his deputy, 
none of the principal administrators of RAP had a degree in management or 
organization studies. B.C.E. Nwosu, who headed the organization, was a nuclear 
physicist. The administrative officer was Steve Emejuaiwe, a microbiologist. The 
scientific officer was Charlie Okafor, with a doctorate in chemistry. The air defense 
unit was managed by EN.C. Oragwu, a nuclear physicist. Yet RAP headquarters 
and its numerous units scattered across the country were run very efficiently. 
Hardly was any action or decision bogged down in red tape. The middle- and 
low-level staff at the head office was a highly efficient group of workers. The 
typists and secretaries among them produced memos and schedules with dazzling 
speed and accuracy. The accounts clerks, some of whom had never done any 
accounting job or office work prior to their employment in RAP, executed their 
functions with utmost efficiency. The welfare officer, Mrs. Wikina, an indefatigable 
woman of incredible energy, toured centers to search for relief materials for RAP 
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employees. The various units of RAP in different towns and villages of Biafra were 
headed by scientists and engineers, some of them lacking prior administrative 
experience but motivated by the desire to contribute their best for the successful 
prosecution of the war. 

One of the admirable attributes of RAP as an organization was its ability to 
adjust quickly to changing circumstances. Arene (1997) clearly documents how 
speedily the Biafran Science Group regrouped and metamorphosed into RAP 
immediately after the fall of Enugu in October 1967. RAP fared similarly after 
the fall of Umuahia in 1969. The office of Head RAP, in particular, quickly rented 
and moved to an uncompleted and rickety building in Isu, a rural town on the 
Anara-Nkwere-Orlu road. Within a very short time after the fall of Umuahia, 
the business of administering RAP had stabilized. The various units of RAP on 
the Umuahia, Umudike, and Uzuakoli axis quickly relocated and continued the 
business of producing for the war effort. The resettlement process and resumption 
of work were swift and efficient, indeed almost magical. 

From its earliest beginnings, when it was known as Biafran Scientists, RAP 
worked on war-related projects. Scientists, engineers, and technicians combined 
their different areas of knowledge and expertise to produce different kinds of 
weapons. The chemical groups worked on incendiaries, smoke signals, detonators, 
napalm, primers, rocket fuels, cocktails, and bombs. Some engineering groups 
produced grenade and rocket casings, mortar shells, and bullets; others fabricated 
armored cars from tractors and trucks. One of the best-known weapons made 
by Biafran scientists was the ogbunigwe (mass killer or destroyer), which became 
a formidable antipersonnel mine (Arene 1997; Ogbudinkpa 1985). Biersteker 
(1978) notes that RAP also demonstrated the capability to produce petroleum 
and fuel oil on a large scale in numerous and widely distributed locations, and 
without the assistance and supervision of expatriate technicians. 

Madiebo provides further evidence of this extraordinary versatility: 


As soon as Port Harcourt fell in May 1968, and with it, most of the oil 
fields and the refinery, shortage of fuel was felt all over the country. A 
Petroleum Management Board (PMB) was established to control what 
was available as well as find ways of effecting replenishments. The 
Board designed and built a sizeable and efficient refinery at Uzuakoli. 
What they produced was not sufficient for the needs of a nation at war, 
thereby making petrol rationing imperative. The Research and Production 
Directorate, which considered no problem impossible to solve, soon stepped in 
to assist. It designed and built several refineries and produced petrol and 
diesel at a considerably fast rate. With its assistance also, all major armed 
forces units and formations as well as civilian organizations set up oil 
refineries. Products of these numerous refineries were generally fair and 
satisfied the urgent fuel needs of the nation even after the main refinery 
at Uzuakoli was lost to the enemy. (1980:114—15, emphasis added) 
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Madiebo's account of localized petroleum refineries in Biafra is consistent 
with the early history of oil refining in advanced countries. For instance, small 
refineries that used a simple distillation process were ubiquitous in nineteenth- 
century America (History Channel 1997). In Biafra, because building conventional 
fractionating towers was not possible at the time, the method of simple laboratory 
distillation did the job: fuel (petrol) was made through this simple process 
(Arene 1997). It is ironic, though, that Nigeria failed to exploit this indigenous 
capability after the war. Instead it preferred to invest billions of dollars in the 
acquisition of complex refineries that, in many cases, have disappointed the 
public due to incessant, prolonged, and excruciating bouts of fuel scarcity. Sadly, 
Nigeria presently spends huge sums of scarce foreign exchange to import petrol 
even though the empirical case of Biafra shows that domestic capacity exists 
to produce the commodity through simpler processes. Rather than integrating 
Biafra's oil refining capability, Nigeria's postwar petroleum policy marginalized 
it to the disadvantage of the nation. 

That RAP “considered no problem impossible to solve" is an excellent example 
of group self-efficacy. According to Bandura (1986), "self-efficacy" refers to the 
beliefs people have about their abilities to execute a course of action required 
to attain designated types of performance. Self-efficacy is thus the power and 
confidence to produce an intended effect. In discourse on development, national 
self-efficacy denotes the confidence of a nation’s citizens that societal problems 
and obstacles are soluble and surmountable. Self-efficacy-is the belief that the 
individual or group can exert considerable control over the environment, rather 
than surrendering life and the future to external forces, human or superhuman 
(Inkeles and Smith 1998). It means that a people takes its destiny in its own hands 
and avoids falling into a state of helpless dependence on outsiders. 

National self-efficacy is energized, nurtured, and sustained by the presence 
of appropriate social capital. Networks of social trust create a sense of shared 
identity, mutual understanding, common objectives, and solidarity that promotes 
group commitment and positive responses to development efforts (Krishna and 
Shrader 1999; Thin 2002). Hence it is logical to state that social capital brings 
out the best in human capital. In particular, Igbo solidarity toward Biafra was 
significant and therefore contributed to the expression of the highest levels of 
motivation among scientists, which moved them to perform to the best of their 
potential. 

Because of the blockade, consumer items were very scarce in Biafra. Using its 
store of individual and institutional self-efficacy, as it had with petrol production, 
RAP's chemical, engineering, and microbiological units combined their skill and 
knowledge, exploited and refined the saline resources of the Abakaliki axis, and 
mass-produced salt (Arene 1997). RAP also produced alcoholic drinks such as 
wines, gin, whisky, and brandy, as well as soap and other consumables. Nafziger 
(1982) rightly attributes to RAP the credit for applied research and the production 
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of replacements for goods from abroad that had been cut off by the blockade 
imposed by the Nigerian government. 

When the Biafran government inaugurated the Land Army to mobilize the 
people to produce more food, biologists, botanists, and zoologists in various 
units of RAP joined professors and lecturers on agriculture with practicing field 
officers. The Land Army did a tremendous job, cultivating all available land in 
the Biafran hinterland (Arene 1997). 

Nor should it be forgotten that Uli Airport, in the heartland of Biafra, 
was built without any foreign technical assistance. As the war progressed, this 
indigenously designed and constructed airport was the only link between Biafra 
and the outside world. 

This short summary of the activities of Biafran scientists clearly approximates 
the definition of socially relevant science. The scientists were versatile: they 
tackled nearly every problem that they were confronted with, whether in military 
outfitting or in consumables for the civilian population. Two further attributes 
stand out: their effective transfer of skills and knowledge, and their dedicated 
work ethic. Transfer of knowledge is the ability to adapt specialized knowledge 
and training toward efforts in other fields. In one instance, a scientist formerly 
employed in a leading multinational brewery that had closed down because of the 
war became a central figure on the shop floor of one of the wartime makeshift 
oil refineries. Arene (1997) also describes a case where professors and lecturers 
in the agricultural sciences, motivated by a desire to solve the immediate societal 
problem, undertook to make explosives. Some sustained injuries from accidental 
discharge or incorrect composition of their devices. 

The work ethic in RAP was indeed dedicated. Scientists, engineers, and 
technicians worked round the clock, in the actual sense of the phrase. I myself 
observed many instances when laboratory technicians in A.N.U. Njoku- 
Obi’s microbiology unit took turns maintaining an overnight watch on their 
experiments. Many high- and low- level scientists and engineers virtually lived 
in their laboratories and workshops. 

Where that energy, collaborative spirit, work commitment, altruism, and 
hardihood came from during the war, and where they have disappeared to since 
the end of the war, deserve research attention. Those qualities no longer exist in 
government work organizations in the postwar Eastern Region, the enclave that 
was Biafra. Nor are they found in other parts of Nigeria, for that matter. 

At the end of the war in 1970, RAP was somewhat reconstituted as a federal 
research institute known as the Project Development Agency (PRODA). It was 
later renamed the Project Development Institute but retained the initial acronym. 
PRODA was expected to be a replica of RAP. G.O. Ezekwe (mentioned above), a 
renowned engineer who had suffered serious injuries from accidental explosions 
in his RAP workshop during the war, was the first director of PRODA, and 
a significant proportion of PRODA's pioneer technical and clerical staff was 
likewise drawn from RAP. Some breakthroughs have since been credited to 
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PRODA, including the establishment of a scientific equipment factory. But it is 
no exaggeration to state that in more than three decades of existence since the 
end of the war PRODA has never achieved the degree of societal relevance and 
visibility recorded by RAP in the three short years of secessionist Biafra. 
During a study of PRODA that I conducted in 1980 its employees reported that 
they were only somewhat effectively utilized relative to scientists in other research 
institutes (Ukaegbu 1987). But the “creative spark,” the high level of excitement 
for innovation, and the round-the-clock attachment and commitment to work 
that had characterized RAP did not exist to the same degree in PRODA, even 
though a good number of its personnel had previously served in RAP. Studies of 
industries, universities, and other research institutes in postwar Nigeria (Ukaegbu 
1985, 1991, 1995; Ukaegbu and Agunwamba 1995) have also revealed a science 
and technology system that has been unable to rise to the challenges posed by a 
developing society. Instead the system has elicited complaint and reproach. 


Scientific and Technological Practice in Postwar Nigeria: The Complain/ 
Blame Syndrome 


Since the end of the Nigerian-Biafran war, politicians, academic commentators, 
research administrators, and the ordinary public have generally expressed 
disappointment with and complaints about the failure of the science and 
technology sector to contribute to Nigeria’s technological development. 
Dissatisfaction has focused on obstacles and inadequate technological 
preconditions, as described earlier. Babatunde Thomas, adviser on human 
resources, science, and technology to President Olusegun Obasanjo and a seasoned 
scholar of technological development, has articulated what he calls “the vicious 
circle of technological backwardness in Nigeria” (2001:6). The character of this 
vicious circle as he describes it is an exact replica of the presumed causes of 
technological underdevelopment that I outlined in the introductory remarks to 
this present article. In other words, problems that existed upon independence in 
1960 still exist intact more than forty years after. 

The phrase “technological backwardness” implies that no appreciable 
incremental change has occurred in the science and technology sector by way of 
socially relevant science. This culture of complaint (indeed, lament) is not limited 
to Nigeria alone. Wakhungu (2001) presents a similar list of handicaps in her 
discussion of science and technology policy in Africa as a whole. To generalize 
for all of Africa, however, implies that all the countries of the continent are 
handicapped by the same degree of deficiencies and deprivations. The relatively 
ample science and technology institutions and infrastructure that we see in 
Nigeria simply do not exist in Gambia. Kenya is home to a number of domestic 
and international scientific research agencies whose equivalents are absent in 
Angola. For Nigeria to continue to blame its failures on social and infrastructural 
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conditions that existed more than four decades ago means that the country has 
wasted all those years. 

Biafra's leap forward in science and technology occurred in those same 
conditions, actually in even direr conditions of resource deprivation. For Biafra 
there were no established capital goods industries such as iron and steel and 
machine tools to aid engineering designs and fabrications. The Biafran experience 
even challenges the argument by dependency theory that the content of Western 
science and technology education received by Third World scientists and engineers 
is so removed from the problems of their countries that their educational skills 
are useless to their local environments. Dependency theory further states that 
Third World scientists and engineers, after studying with sophisticated equipment 
in advanced countries, become disoriented and discouraged upon return to their 
home countries because only inadequate and obsolete equipment is available 
to them there. These may be valid observations. But the Biafran experience 
neutralizes their generalizability. Most of the Biafran scientists and engineers 
were first-generation professionals who had received their education in European, 
American, and first-generation indigenous institutions of higher education with 
a strong Western presence. Yet when the challenge of survival came, they used 
their knowledge acquired abroad to solve the problems at home. 

In the years since the end of the Nigerian-Biafran war, more Nigerian scientists 
and engineers have been trained at home than abroad. If domestic training were 
the magic bullet, science and technology should have flourished since the end of 
the war. After all, immediately after World War II Japan sent its young citizens 
to acquire scientific and technological education in Western Europe and North 
America, with great success. That Western scientific and technological education 
could be relevant in Japan, China, South Korea, Malaysia, and Singapore, but not 
in Nigeria, defies logic. 

The only noteworthy research institute in Biafra at the time of its origin, 
the Agricultural Research Institute at Umudike, was not designed to cater to the 
variety of war needs. Neither were the workshops of the Nigerian railway and 
coal corporations and the laboratories of the Ministry of Commerce designed 
for the omnibus activities and tasks performed in them by Biafran scientists. 
It took human ingenuity, determination, confidence, self-efficacy, and will to 
recreate and redirect the minimal infrastructure to serve the needs of the time. 
This effort demonstrates my contention, stated above, that science and technology 
are human, not superhuman, activities. Human agents create the structures 
and infrastructures of science and society, which either enable or constrain the 
performance of socially relevant science. 

According to Diamond (1968), there were some 500 medical doctors, 700 
lawyers, and 600 engineers in Biafra at the time of secession from Nigeria. 
Although the proportion of engineers to other professionals was far above that 
in many African countries at the time, 600 engineers in a country of 14 million 
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people is nevertheless inadequate. Engineers may have been relatively few in Biafra, 
but they received high-quality education, whether at home or abroad. Since the 
end of the war Nigeria has also made significant investments in education and the 
production of high-level manpower. It presently has more than 40 universities, 
including several that specialize in science and technology education. It has 
many polytechnic schools and colleges of education. Enrollment in science and 
technology constituted 3096 of the total enrollment in Nigerian tertiary education 
in 1992 (UNDP 1992). Nigeria has always lamented its inability to achieve a much 
higher rate of enrollment (60%) in science. But that 30% is consistent with the 
situation in other countries around the world at that fairly recent date. Of the 
total world enrollment in tertiary education, the average enrollment in science 
was 39% in industrialized nations; the average for developing countries was 32%. 
In fact, of the 2.1 million degrees awarded in the United States in 1993, only 24% 
were in science and engineering (UNESCO 1998). It would appear that Nigeria's 
30% figure is not significantly low at all. Overall, these data show that production 
of a large quantity of scientific and technological manpower, while desirable, is 
not a sufficient condition for technological development. Much depends on the 
quality of the manpower produced and the extent to which such manpower is 
put to effective use. 

Commentators on technological development who lament that Nigeria still 
suffers an acute shortage of trained manpower should also refocus their attention 
toward the quality of education received and the extent of its utilization by those 
already trained. Research has shown that by 1985, graduates in engineering 
and allied disciplines (e.g., architecture and surveying) in Igbo states of Nigeria 
were seeking employment as secondary school teachers because they could not 
find jobs related to their education (Ukaegbu 1995). In recent years blame has 
shifted from the numerical inadequacy of manpower to the eroding quality 
of education and the international outmigration of Nigerian scientists. The 
costs of this departure of highly trained professionals, and the resultant brain 
drain, are undoubtedly high for a developing country. In the complain/blame 
game, however, society forgets that one serious constraint on socially relevant 
science is the absence of committed, visionary, knowledgeable, and courageous 
leadership that can work to erase the economic, social, and personal insecurity 
that induces the outmigration of scientists. After all, China, South Korea, Taiwan, 
and Singapore have lost and continue to lose highly trained professionals to the 
more advanced countries of the West but continue to make technological and 
economic progress on their own. 

To hinge Nigeria’s lack of technological development on the international 
outmigration of native scientists is also to insult the equally brilliant and 
capable professionals who remain at home, for such blame implies that they are 
incompetent to achieve the desirable results attributed to their absent peers. A 
case in point is Brino Gilbert, a physicist who resides in Edo State, who stirred 
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international interest at the Invention and New Product Exposition in the United 
States in May 2003. His “counter-collision gadget” (CCG) won a silver medal in 
the manufacturing category, a bronze medal in aeronautics, and a trophy as the 
best invention from Africa. He had spent years seeking support from governments, 
groups, and individuals in Nigeria, without success (Williams 2003). 

It is appropriate to presume that in the many years since independence Nigeria 
would have taken visible steps in the direction of socially relevant science, and 
thus toward technological advancement. Yet one veteran observer of scientific 
practice in Africa remarked in 1979, nearly two decades after independence, that 
“increase in the number of Nigerians with scientific training has not resulted 
in an increased amount of scientific activity” (Eisemon 1979:506). Nearly two 
decades farther along came an article provocatively titled “The Nigerian Scientific 
Community: The Colossus with Feet of Clay” (Chatelin, Gaillard, and Keller 
1997). And in 2001 Nigerian Presidential Adviser Babatunde Thomas commented 
with cautious hope that “burdened by the increasing deterioration recorded in 
her rating in the field of science and technology despite her array of intellectuals, 
Nigeria may soon come out with a new policy that is human-factor based” 
(2001b). These independent opinions expressed by influential international and 
domestic observers of science and development in Nigeria indicate that scientific 
manpower has not risen to the challenge of performing socially relevant science. 
It is true that some research institutes and universities have worked on prototypes 
and devices awaiting application by willing entrepreneurs. And some agricultural 
research institutes have successfully worked on improved seed varieties. But until 
dissemination occurs with appreciable speed and in large quantity, Nigerian 
society will not feel the presence of a socially relevant scientific community. 

The description of the Nigerian scientific community as a “colossus with 
feet of clay” is particularly instructive. It is a metaphor for ineffectiveness. 
“Colossus” signifies an object or entity of immense size, scope, power, strength, 
and preeminence over other entities. For a colossus to have feet of clay, despite its 
apparently towering attributes, means that such an object is in reality feeble and 
undependable. Despite a gigantic human and material infrastructure in science 
and technology as well as large quantities of publications in science, the Nigerian 
scientific community is still on the margins of Nigerian society because it has not 
made a visible impact on the country’s quest for technological development. 

But they have held their ground in the arena of production and dissemination 
of scientific knowledge. Chatelin, Gaillard, and Keller (1997) used a pancontinental 
bibliometric comparison to show that Nigerian scientists have the highest 
publication rate in Black Africa. When Africa is considered in total, they rank next 
to South Africa and Egypt. In agriculture and medicine, in particular, Nigerian 
scientists publish more than their South African and Egyptian counterparts. 
Infrastructurally, Nigeria boasts of more than 30 scientific research institutes, 
supported by a considerable number of technological incubation and engineering 
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centers distributed across the country. Most of these agencies are under the 
power, supervision, and administration of the federal government. There is also 
a national academy of science, whose exact contribution to the development of 
science and technology is difficult to assess. Added to these resources are series of 
public policy documents on science and technology, which hardly see the light of 
implementation. By contrast, Biafran technological achievement occurred without 
a science policy and without a national academy of science. The steel-rolling mills 
that have been built since the end of the war did not exist for Biafra. Polytechnic 
schools now produce lower- and middle-level technical manpower. Compared to 
the situation in Biafra, the scientific and technological infrastructure in postwar 
Nigeria is indeed gigantic. Yet scientific activity in Biafra was known for its social 
relevance, whereas that of Nigeria draws only complaints. 

Two questions arise. Why has PRODA, the designated postwar successor 
to RAP, failed to achieve the dynamism of its predecessor? And why has the 
Nigerian science and technology sector in general been unable to make a visible 
contribution to the nation's socioeconomic development despite its relatively huge 
infrastructure and the enormity of societal problems that demand scientific and 
technological skills? Answers to these questions are suggested below: 


Constraints to Socially Relevant Science in Postwar Nigeria: Some 
Concluding Remarks 


Arene states flatly that *RAP was effective within Biafra because of the stress of 
war. Remove the stress of war and the Biafran scientists would be no different 
... [than] they were before Ojukwu pronounced the former Eastern Nigeria an 
independent state of Biafra” (1997:73). RAP was the preeminent organization 
charged with and capable of applying scientific and technological knowledge to 
the problems of the time. Biafran scientists saw that they were needed and that 
the results of their efforts were recognized and used for solving practical problems. 
In addition, they were given specific assignments and challenges by a government 
that respected and valued their skills and knowledge. Employees of RAP were also 
accorded respect in the communities in which their units were located. And the 
Biafran government matched its demands on RAP with resources to enable RAP 
to perform its scientific activities. All these culminated in what may be termed 
“the right conditions.” By contrast, PRODA is in competition with numerous 
other research institutes for governmental resources and public attention. In a 
society unconstrained by blockades and war, governments and citizens can satisfy 
their material needs by using imported goods. Consequently, postwar Nigeria 
with its high import orientation marginalizes domestic research activities and 
innovations to the point of irrelevance. Forrest’s succinct observation of Biafra 
is relevant here: “Under conditions of autarchy in war-time Biafra, scientists and 
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engineers demonstrated the capacity to innovate and adapt existing technology. 
Although the value of this experience was recognized, attempts to carry over the 
achievements to peacetime were largely unsuccessful. The scientific personnel 
and engineers involved, quickly scattered” (1994:236). 

Biafra thus vindicates dependency theory, which holds that the more a country 
is cut off from the world system the more likely it is to realize its full human 
potential, which is the hallmark of national development. The blockade of Biafra 
was a de facto severance from the world system. It was a condition in which 
necessity became the mother of invention. But dependency theory also argues 
that foreign-trained scientists are so demoralized by the lack of available work 
resources that their knowledge acquired abroad is of little value to the solution 
of problems at home. This apparent contradiction in dependency theory can 
be reformulated to better effect according to the tenets of structuration theory, 
in which my arguments here have been anchored: namely, that skilled, capable, 
and determined agents can achieve national development by taking advantage of 
the rules, resources, and social capital provided by independence from the world 
system. That was the case in Biafra. 

If the creative spark in Biafra was ignited by the stress of war, as Arene has 
said, does it mean that Nigeria must fight another war before a similar spark 
occurs? If so, with whom will that war be fought? Granted, the sense of collective 
struggle for survival and possible victory was an enabling environment for 
the revolutionary application of scientific knowledge by Biafran scientists. But 
postwar Nigeria has had, and continues to have, plenty of war to fight in the 
social sphere. Since the end of the Nigerian-Biafran war, the majority of Nigerians 
have continued to suffer from poverty, destitution, disease, and varying forms of 
deprivation. These conditions should have inspired the kind of energetic national 
political leadership that kindles a sense of a shared identity, mutual understanding, 
common objectives, solidarity, collective commitment (social capital), and the 
resulting group motivation, determination, and effort characteristic of a society 
confronted with a real war situation. If challenge is what is required, the challenges 
are present. 

One major constraint, among others, is that postwar Nigeria lacks a sense of 
a collective national ownership, with the attendant social capital and maximum 
harness of human agency that existed in Biafra, especially among the Igbo. Hence 
the science and technology system in Nigeria remains impervious to any form of 
material fortification and administrative mobilization. And hence the recognized 
and desired linkage between scientific research endeavors and the productive 
sector remains elusive more than four decades after independence and three 
decades after the civil war. 

The irony of scientific and technological practice in Nigeria came to light at 
the Sixth USA-Africa Conference on Manufacturing Technology held in Abuja 
in July 2002. Some forty Nigerian scientists and engineers in diaspora, most of 
them academics domiciled in the United States and Britain, convened to discuss 
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how to improve manufacturing technology in Nigeria. They presented papers on 
science policy as well as on their bench work in science and engineering. The 
sophistication of the diaspora scientists' research, the clear connections between 
this research and specific functions in manufacturing, and the fact that some 
prestigious manufacturing firms in Britain and the United States were waiting to 
use this research made the diaspora Nigerians seem like visitors from Mars. The 
efficacy, relevance, and the immediate utility of their scientific activities in their 
countries of residence were unquestionable. Some of these conferees had practiced 
in Nigeria before they went abroad. That they could not achieve the same 
successes in Nigeria, nor can their equally qualified and brilliant counterparts who 
have remained at home there, attests to the entrenched imperviousness toward 
scientific and technological mobilization mentioned above. 

Thus Nigerian science and technology professionals shine and perform socially 
relevant science in their countries of residence abroad but cannot do the same 
at home. Their effectiveness in diaspora is not a function of pressures of war. 
Rather, they are operating in sociopolitical environments that recognize excellence 
and competence through adequate material and honorific rewards. Structuralism 
would argue that Nigerian scientists in diaspora perform socially relevant work 
because they operate in structures conducive to professional success. Such an 
argument should recognize that those structures were the outcome of human 
agency, whether intentionally or unintentionally designed. 

In a recent study of private manufacturing enterprises owned by indigenous 
entrepreneurs in Igbo states (Ukaegbu 2002), I found that most entrepreneurs 
and their managers did not know much about scientific research institutes in 
Nigeria. Some of them, in fact, could not name any of the institutes. Those who 
had some knowledge of the existence of the institutes did not think of them as 
potentially useful to their firms' operations. What has become of all the scientific 
and technological knowledge acquired by Nigerians both before and after the 
Nigerian-Biafran war? And where did the extraordinary scientific creativity 
exhibited by Biafran scientists go? Research institute employees, especially since 
the 1990s, are often idle. Many items of research equipment are also idle and 
covered with dust, clearly indicating lack of use. And many research facilities 
that break down are simply abandoned to rot away (Ukaegbu and Agunwamba 
1995). As Arene (1997) has pointed out, the fusion of the theoretical knowledge 
of academic scientists and engineers with the practical knowledge and skills of 
their counterparts in the productive sector was highly instrumental to Biafra's 
technological success. In contrast, the two sectors are decoupled and severely 
disarticulated in peacetime Nigeria. 

Informal conversations in July 2002 with a group of highly educated science 
and engineering personnel in positions and roles related to technological research 
and applications showed a visible frustration, throughout the group, with the 
lack of linkage between research and production in Nigeria. Government was 
the immediate target of blame, even though the respondents were themselves 
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high-level government employees. But some also blamed an abstract entity they 
called the *Nigerian factor," which on further probing turned out to signify the 
carefree and nonchalant attitude (a kind of national lackadaisical syndrome) 
of both the political leadership and the general citizenry toward the country's 
development. The government itself blames the multitude of constraining factors 
described earlier. To put it in the most ordinary parlance, everybody blames 
everybody else. 

This generalized blame syndrome implies three scenarios. First, the political 
leadership and the scientific and technological elite, past and present, have not 
known what to do to move the country onto a path of technological development. 
From that perspective the problem must seem very daunting. Hence political 
response has been either to shift the problem to the periphery of governmental 
and national attention, as if it does not exist, or to pay lip service to the need 
for change. Second, the leadership has been afraid of taking the risks involved 
in technological development. And third, the political class, with its mentality of 
dependence on outside assistance, has been unable to muster the will to move 
Nigeria toward taking its technological destiny into its own hands. 

In sum, the Nigerian scientific community should not be the primary focus 
of attributed culpability. Socially relevant ideas and knowledge may and indeed 
do emanate from the nation's scientists. But the actualization of such ideas is 
more often a function of intelligent and visionary public policy — not political 
corruption and complaisance toward the accelerated depletion of the country's 
strategic human and natural resources by the forces of globalization. 

Geisler (2001) aptly notes that the more knowledge-driven a society, the more 
it relies on new knowledge to maintain its institutions. Nigeria today is not a 
knowledge-driven society, its numerous educational institutions and bright, highly 
educated workforce notwithstanding. It seems not to be driven by anything, not 
even by a set of ideas cherished by the general public or even a majority of its 
citizens. Or if driven by anything at all, the motive force is a geoethnoreligious 
structure that renders political institutions incapable of mobilizing the general 
society for the promise of economic development. Every political group that 
assumes leadership of the country becomes suffocated in the entanglements of 
managing this complex and debilitating ethnocultural diversity. 

Perhaps Nigeria as presently configured is best described as a geoethnopolity 
of antiprogress. The rules and resources emanating from the nebulous principle 
of federal character regarding employment, political appointments, and elective 
office contravene the merit principle (recognition of competence), a core 
ingredient of success for Biafra's RAP. An overcentralized political economy makes 
the constituent states so dependent on the center that state leaders are unable to 
furnish their respective citizens with the societal visions and conditions that are 
conducive to the creative sparks and the strong social capital that could propel 
technological and economic development at regional and local levels. Similarly, 
research institutions and activities in Nigeria are so dependent upon and focused 


Lessons from Biafra: Structuration and the RAP / 1417 


on the central government that state governors, commissioners, and state houses 
of assembly hardly think or talk about technological development as one of the 
challenges that demand their urgent attention. This is not to say that politicians 
at the central level of government seriously think and talk about it, either. For 
example, the majority of campaign speeches for federal and state positions during 
the 2003 elections, including speeches by all the presidential candidates, contained 
no clear plan for long- or short-term strategies for a forward leap in the areas of 
science, technology, and production. 

Short of war, an appropriate way to apply what we have seen in Biafra's RAP 
— to spark creativity and inspire human capital toward socially relevant science 
and technology in present-day Nigeria — would be to decentralize the complex 
Nigerian geoethnopolity in a manner more far-reaching than the current structure 
allows. This intentional agency might generate new rules and resources that 
would enable each of the resulting constituencies to confront the challenges of 
its own environment. Implicit in the idea of disentangling the geoethnopolity are 
the controversial concepts of restructuring, political decentralization, and true 
federalism or even confederalism, all of which have been commonly used in recent 
years to describe possible paths for Nigeria's political future. Most proponents 
of geopolitical restructuring have based their arguments on the premise that 
political centralization in an amorphous ethnocultural environment is the direct 
cause of incessant intergroup conflict in Nigeria. This perspective presupposes 
and advocates that a political decentralization more far-reaching than presently 
in force will reduce ethnoreligious territorial tensions, restore or create relative 
ethnoreligious homogeneity in the resulting regions/geopolitical zones, and 
promote peace and stability among divergent groups. 

Sociologically speaking, this basic premise that geopolitical decentralization 
will promote stability by reducing ethnoreligious tensions is seriously undermined 
by empirical conditions, because conflict is ubiquitous in human interaction. 
Recent troubles in Nigeria illustrate the point. The people of Umuleri and Aguleri 
in Anambra State have traditionally been perceived by outsiders as agnates (a 
patriclan/ummunna). But several years ago they engaged in a fratricidal war that 
took many lives and destroyed much property on both sides. Yet they are Igbo 
who live in close geographic proximity and have shared the same language and 
culture for ages. An intense Ife-Modakeke conflict in Yorubaland captured the 
attention of the Nigerian public for several years in the 1990s and showed signs 
of recurrence in 2003. Members of the maitasine group in Northern Nigeria 
staged attacks against fellow Muslims in the early 1980s. Similar intraregional 
conflicts have been documented in other parts of Nigeria. We must conclude that 
ethnocultural homogeneity is not a sufficient condition for peace in a community, 
or in the macro-society for that matter. 

A developmental argument for restructuring is superior to the ethnocultural 
alternative. In this case the quest for far-reaching politicoeconomic decentralization 
is based on the idea that the more the constituent parts take their destinies in their 
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own hands, the more they will rise to the resulting challenges. Hence they will 
more readily tap the social capital and creative energies of their own populations 
for technological and economic development. In a Nigeria restructured without 
war, the expected outcome of intentional agency, the source of challenge, will be 
new rules and resources for which political leaders and their constituencies are 
largely responsible — and thus accountable to themselves — in terms of success 
or failure. So long as the central government remains the focus of immense 
political and economic power, the satellite constituencies will continue to depend 
on it. And the center, though itself historically inept as an agent of development, 
will remain a handy scapegoat on which to blame constituents’ failure to rise 
to the challenges of transforming their domains socially, economically, and 
technologically. 

In a country where individual and subgroup sentimental attachments 
reside more in the ethnocultural enclave than with national allegiances, the 
state is incapacitated to make and implement development-oriented policies. 
Kenny (2003) observes that because ethnic groups in Africa command far more 
loyalty and solidarity than the state, the state lacks legitimacy, is weakened, and 
therefore becomes unable to make and implement bold policies for growth 
and development. Kenny further notes that the longer the people of a diverse 
society stay together in their evolution as a nation-state, the more they develop 
the solidarity that can propel development. In other words, Nigeria is faced 
with a crucial choice: to reconfigure its geopolitical structure with the aim of 
accelerating development, or retain the constricting status quo and wait for a 
time that may never come. 

I would argue that the intractable and intense ethnic and religious divisions 
that persist in Nigeria stifle cohesive political vision and constrain politicians 
from implementing prodevelopment policies. Further, the country’s economic 
burden is presently shouldered by a handful of states, because the current 36- 
state structure has resulted in many economically unviable states that continue in 
existence for the sake of an illusory political stability. These two factors forestall 
emergence of the political rationality, courage, and clarity of purpose that leaders 
need in order to make viable policies and mobilize the public toward the path of 
development. Much time and effort is expended on balancing ethnic claims and 
maintaining a superficial political stability. Superficial national stability combined 
with a dismally declining quality of life and fragmentary societal infrastructure 
is the hallmark of a failed political project. In other words, rather than enabling 
development, the present structure constrains it. 

The failure of Biafra has had several aftereffects. From the federal government's 
perspective, especially, the defeat of Biafra returned Nigeria to the desired status 
quo of a unified country, with the gains typically anticipated from cultural 
diversity and political economies of scale. But those gains have not materialized 
in the appalling political, economic, and social conditions that have plagued 
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Nigeria since the end of the war. To ignore this negative fallout, especially on the 
Biafran side, is thus to engage in a biased intellectual enterprise. The failure of 
Biafra aborted the foundations it had laid, as well as its readiness, for a takeoff in 
technological development. That takeoff would have shown that Black Africa was 
capable of technological development even if only at a crude and rudimentary 
level. The history of technology shows that societies quite often proceed rapidly 
from crude to polished stages of development. Hence, Black Africa lost a society, 
a golden opportunity, that might have served not only as a technological role 
model but could also have kindled the intraregional imitation and competition 
historically known as catalysts of technological advancement. As Onyeani (2003) 
emphatically states, Biafra would have become the most powerful black country in 
the world had it been given the chance to survive and continue with the process 
of manufacturing virtually everything it needed. 

Two years into the war, many Biafran citizens had understood the capabilities 
of their scientists, internalized their effectiveness, and reposed confidence 
in them as problem solvers. Given their breakthroughs, societal recognition 
of their work, and the positive reinforcement of seeing their inventions and 
innovations applied to practical uses, Biafran scientists would have continually 
pushed their activities to higher levels. But the Biafran scientist who created 
something from nothing could not be replicated in postwar Nigeria. The current 
Nigerian geopolitical structure has created a societal mentality of individual and 
subgroup indifference toward the nation's development. Excessive centralism in 
postwar Nigeria has precluded the emergence of the kind of social capital that 
could inspire its scientific and technological workforce toward innovation and 
development. This paradox is consistent with the duality of structure that we 
see in structuration theory: structure enabled the practice of socially relevant 
science in Biafra but constrains it in Nigeria. That Biafran scientists achieved 
technological relevance under severe material deprivation vindicates my theory 
that scientific and technological development, or development in general, is 
primarily propelled by maximum deployment of total available human essence 
(agency) whereas technological preconditions and material thresholds only serve 
as desirable accessories. 

Structuration consists of the conditions and the media by which structures 
are transformed into systems (Giddens 1984). The lessons of Biafra show that 
the production and reproduction of its socially relevant science and technology 
system was a dynamic process anchored in reciprocal interaction between the 
social environment and human agency. Postwar Nigeria has enormous human 
and natural resources. But its 36-state structure, enacted through the agency of 
various political actors, draws enormous allocative power and citizen attention to 
the central government, creating a mentality of dependency and role abdication 
among leaders of the states. Ethnoreligious tensions, an intractable foe, rob the 
society of the kind of social capital that could optimize human potential for 
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creating viable institutions. Consistent with the assumptions of structuration 
theory, production and reproduction of social forms can be achieved through 
the instrumentality of human agency and structural conditions. Common 
sense therefore suggests that a development-driven geopolitical restructuring 
through deliberate political action is a very necessary precondition for national 
development in general and for the development of science and technology in 
particular. 


Note 


1. I had intended to include many names of leaders and employees of RAP in order to make 
my account vivid and concrete. But I was a very junior employee at the office of Head RAP, 
now more than 30 years ago, and Arene's book (1997) contains many more names than I 
could ever hope to have recollected. For additional detailed information on the composition 
of RAP units, see also Ogbudinkpa (1985). 
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Abstract 


This study tested three types of theories of gender inequality in preindustrial societies 
by using half the societies in the Standard Cross-Cultural Sample: militarist, 
Marxian, and non-Marxian materialist theories. The first phase of the research 
used simple cross-tabulations with chi-square as a test of significance and gamma 
as a measure of association. The results from this phase showed no support for 
militarist theories, some support for Marxian theories, and substantial support 
for non-Marxian materialist theories. Since the first phase involved no control 
variables, a second phase was conducted using multivariate analyses. These analyses 
confirmed that militarist theories must be emphatically rejected, and that both 
Marxian and non-Marxian materialist variables help determine gender inequality. 
Non-Marxian materialist variables, however, explain much more of the variance 
in gender inequality than Marxian variables do. 


Gender inequality has been a pervasive feature of human social life for millennia. 
There is widespread, although hardly universal, agreement that all societies have 
` been to some extent male-dominated (Goldberg 1993; Rosaldo and Lamphere 
1974). Be that as it may,.it is clear that there is marked variation in the degree of 
gender inequality across societies. Explaining both the universal and the variable 
aspects of gender inequality is extremely important, but this article seeks to 
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explain only the variable aspects. In doing so it tests three competing theories: 
militarist, Marxian, and non-Marxian materialist theories. 

Militarist theories (Collins 1975, 1985; Divale and Harris 1976) hold that 
societies with frequent warfare, or preparation for warfare, are likely to be more 
male dominated than other societies. Societies regularly engaged in warfare train 
males to be fierce and aggressive and denigrate feminine qualities, out of which 
develops a culture of male supremacy. Using half of the societies in the Standard 
Cross-Cultural Sample (SCCS) of Murdock and White (1969), Whyte (1978) 
tested this theory and found that warfare was actually related to a higher rather 
than a lower status for women. 

The first of several Marxian theories was proposed by Engels ([1884] 1970). 
Engels argued that in the earliest forms of human society women were equal with 
men. With the rise of social stratification and the state in social evolution, men 
gained control over private property and this came to be extended to control over 
women; as a result, women’s status plummeted. Karen Sacks (1975) has elaborated 
on Engels’s work. According to her, as societies moved from a communal mode of 
production to an incipient class mode, women were increasingly perceived as the 
property of men and became only indirectly related to the means of production. 
This gradual diminution of their importance to the means of production meant 
a gradual diminution of the economic power that women held in society, and 
hence a declining status. Whyte (1978) tested this overall argument with the 
data in the SCCS and found little empirical support for it. Hendrix and Hossain 
(1988), using the same sample but a larger number of variables, also found very 
little support. 

According to non-Marxian materialist theories, ecological, technological, and 
economic factors are the most important determinants of women’s status. The 
greater the extent to which women are involved in economic production, the 
higher their status tends to be (Blumberg 1984, 1991; Chafetz 1984; Martin and 
Voorhies 1975). Blumberg (1984, 1991) emphasizes that the key factor is women’s 
level of economic power; the greater the extent of this power, the higher women’s 
overall status. Non-Marxian materialist theories are similar to Marxian theories, 
except that the latter tend to emphasize the role of social class and stratification 
and the former do not. A number of Whyte’s (1978) empirical analyses relate to 
this type of argument. For example, he found that the use of the plow strongly 
predicted a low female status, primarily because plow agriculture is one in which 
men predominate and women lose much of their productive role and thus their 
economic power. 

Collins et al. (1993) have attempted to draw into a comprehensive, synthesized 
theory many of the factors of all three types of theory. Their laudable attempt 
results, however, in an extremely complex set of flow diagrams and models that 
would likely prove extraordinarily difficult if not impossible to test, especially 
with the data that are available. Indeed, they provide no empirical tests of their 
models. 
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HYPOTHESES 


We formulate three hypotheses that correspond to the three theoretical 
perspectives. Hypothesis 1 is derived from militarist theories: 
Hypothesis 1: The greater the prevalence of warfare, the lower the status 
of women. 


Hypothesis 2 is derived from Marxian theories: 


Hypothesis 2a: The greater the control women have over the products of 
their own labor, the higher their status. 


Hypothesis 2b: The greater the degree of stratification, the lower the status 
of women. 


Hypothesis 3 is derived from non-Marxian materialist theories: 


Hypothesis 3a: The greater the contribution of gathering to the total food 
supply, the higher the status of women. 


Hypothesis 3b: The greater the contribution of women to overall 
subsistence, the higher their status. 


Hypothesis 3c: In agricultural societies, the greater the contribution of 
women to agricultural labor, the higher their status. 


Hypothesis 3d: In agricultural societies, the greater the intensity of 
cultivation, the lower the status of women. 


Hypothesis 3e: In patrilineal societies the status of women will be lower 
than in nonpatrilineal societies. 


Hypothesis 3f: In patrilocal or virilocal societies the status of women will 
be lower than in nonpatrilocal or nonvirilocal societies. 


Data 


The data set used to test these hypotheses is Murdock and White's (1969) Standard 
Cross-Cultural Sample, which contains 186 pre-industrial societies. To make the 
SCCS a representative sample, Murdock and White divided the world into 186 
regions and then selected one society from each of the regions. Region selection 
was intended to solve “Galton’s problem” of cultural diffusion. Assuming that 
diffusion can and will occur when two cultures are in contact with each other, 
Galton’s problem suggests that any sample could be biased by cultural diffusion. 
Murdock and White solved the problem by selecting regions that were separated 
by one or more significant geographical boundaries, boundaries that would limit 
or eliminate contact between neighboring societies. Within each region, the society 
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that was selected was the best documented society. This means that the SCCS is, 
strictly speaking, a nonrandom (but nonetheless highly representative) sample. 

Anthropologists, missionaries, and others have recorded the information about 
the societies of the SCCS. All ethnographies were gathered prior to 1950 and in 
most cases before significant impact was made by modern societies. Controversy 
has surrounded the use of such data sets. Data quality differs from one society 
to another because of the different kinds of persons — missionaries, travelers, 
professional ethnographers, etc. — who were collecting the data, their motives 
for doing so, and their levels of competence. Data sets like the SCCS are still, 
however, widely used by anthropologists and some sociologists, and such studies 
have often produced compelling findings. We contend that the various measures 
used in this study have a high degree of face validity. For example, coding whether 
societies engaged in warfare, used the plow, or were patrilineal, and the like, 
are relatively straightforward constructs with meaningful indicators. The more 
problematic issue, in our view, pertains to the reliability of the measures. Our 
basic argument, however, is that, since reliability error attenuates correlation 
coefficients, if statistically and substantively significant findings emerge, then they 
actually underestimate the strength of the relationships discovered rather than 
distort the overall pattern of results. Overall, if the data were completely lacking 
in validity and reliability, then no significant relationships should be found. 

In 1978, Whyte created gender status variables for the SCCS. In the interest of 
saving an enormous amount of time and effort, Whyte coded every other society 
starting with Nama Hottentot and concluding with Tehuelche until he had 93 
with which to work (see Whyte 1978 for a list of the 93 societies he coded). Whyte 
then created a set of 52 dependent variables. For purposes of simplification, he 
reduced the number of dependent variables by using cluster analysis. This yielded 
10 dependent variable scales. These scales are the ones Whyte used in his own 
research on women’s status. 


DEPENDENT VARIABLES 


We used three of Whyte’s dependent variable scales as measures of the status of 
women in preindustrial societies. These scales were chosen over Whyte’s seven 
other scales for two reasons. First, they had the strongest correlations with the 
independent variables in Whyte’s own empirical analyses, which suggest that 
they are the three best measures of women’s status. Second, they most clearly 
operationalize the concepts in the theoretical categories that we test. We also 
summed the three measures to create a composite measure of gender inequality. 
The dependent variable scales are the following: 


(1) Domestic authority of women, which is a cluster of three dependent 
variables: final authority over infants, final authority over children, and 
lack of male dominance over wives. This scale has an average interitem 
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correlation of .375. It is coded low, medium, and high, where the higher 
the score, the more domestic authority women have. 


(2) Ritualized female solidarity, which is a cluster of five dependent variables: 
exclusively male work organizations, exclusively female work organizations, 
menstrual cycle taboos, female initiation ceremonies, and lack of a belief 
in female inferiority. This scale has an average interitem correlation of 
.247. It ranges from low to high, where the higher the score, the greater 
the ritualized separation between men and women. 


(3) Control over women's marital and sexual lives (referred to in this article as 
“male control over female sexuality"), which is a cluster of four dependent 
variables: lack of a premarital double standard, lack of an extramarital 
double standard, remarriage ease, and relative age at first marriage. This 
index of male control over female sexuality has an average interitem 
correlation of .242 and is dichotomized into stricter and less strict levels 
of control. 


(4) Composite gender inequality variable, which is a summed index of the other 
three independent variables (Cronbach's alpha — .41). We created this 
variable to strengthen the measurement of gender inequality by creating 
a more sensitive measure of gender status across societies. The validity of 
the composite measure is supported by its rather robust correlations with 
all but two of the Marxian and non-Marxian materialist variables. While 
the Cronbach's alpha is relatively low, even for a three-item measure, the 
result will be to underestimate the statistical effects of the independent 
variables rather than alter the pattern of results. 


INDEPENDENT VARIABLES 


Three variables from the SCCS data bank were used to measure warfare: 
Internal warfare: Warfare fought in and around a society's own territories 
(trichotomized into infrequent, frequent, and continual). 


External warfare: Warfare fought at considerable distances from a society's 
own territories (very low, low, moderate, high, very high). 


Overall warfare: Operationalized as the average of internal and external 
warfare (very low, low, moderate, high, very high). 

The Marxian hypothesis was measured by two independent variables: 
Female economic control of the products of their own labor (absent, 
present). 


Class stratification (egalitarian, wealth distinctions only, elite or dual, 
complex). 
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The non-Marxian materialist hypothesis and its subtypes were measured by the 
following independent variables: 


Percentage of the contribution of gathering to the food supply (none, less 
than 10%, less than 50% and less than any other, less than 50% but more 
than any other, more than 50%). 


Subsistence type (foraging, shifting cultivation with digging sticks, shifting 
cultivation with metal hoes, intensive agriculture without the plow, 
intensive agriculture with the plow). 


Percentage of the contribution of women to overall subsistence (low, 
moderate, high). 


Percentage of the female contribution to agriculture (0O-10%, 20%—40%, 
5096—60906, 70% or more). 


Intensity of cultivation (no agriculture or casual agriculture, extensive 
cultivation or horticulture, intensive agriculture with or without 
irrigation). 


Use of the plow (absent, present). 
Patrilineality (yes, no). 


Patrilocal or virilocal residence (yes, no). 


DATA ANALYSIS 


The data analyses were performed in two stages. First, we used ordinary chi-square 
tests as a test of statistical significance and gamma as a measure of association for 
bivariate analyses. Thirty-nine bivariate cross-tabulations were conducted to test 
the hypotheses, one for each combination of an independent and a dependent 
variable and each hypothesis. To control for potential spuriousness, in the second 
stage of the analysis we conducted a series of multivariate analyses, one for each 
of the original dependent variables. For the two dependent variables that were 
ordinal, domestic authority of women and ritualized female solidarity, we used 
ordered (proportional odds or cumulative) logistic regression. We also computed 
unordered multinomial logistic regression equations for comparison. For all 
models, significance tests suggested that the unordered multinomial logistic 
regression equations were unnecessary. Therefore, the models were recomputed 
using ordinary least squares regression and compared to the ordered logistic 
regressions. Because these models produced identical conclusions, the ordinary 
least squares regressions are shown for ease of interpretation. For the dichotomous 
dependent variable, male control over female sexuality, we used binary logistic 
regression. For the composite gender inequality variable, which ranged in value 
from 3 to 9, we used ordinary least squares regression. 
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BUILDING THE MODELS 


For all four dependent variables, we ran a series of ordered, hierarchical models 
with the warfare variables entered first into the model, followed by the Marxian 
variables, and then by the non-Marxian variables. Because of the small sample size, 
to preserve degrees of freedom and to minimize problems with multicollinearity, 
we used a stepwise approach by testing the effect of each independent variable 
in each block one at a time. Because of the small sample size and low power, we 
retained any independent variable that achieved a one-tailed significance level of 
p € .10 (see Hosmer and Lemeshow 1989). Variables not achieving a significance 
level of .10 for entry were not included in the final models. Because the bivariate 
analyses gave no support for the warfare hypotheses, we entered warfare variables 
into the models first to give them the greatest opportunity to be included in the 
final models. We entered the Marxian variables next, because we wanted to give 
credit for any mutually (with other materialist variables) explained variance 
to the Marxian variables rather than to the non-Marxian materialist variables. 
This strategy provided the most conservative test of the non-Marxian materialist 
theories. We felt our ordered strategy was the most effective way to pit the three 
theories against one another. In a sense, non-Marxian materialist theories are not 
contradictory to Marxian explanations; they build upon Marxist explanations. 
Therefore, if non-Marxian materialist variables were not predictive above and 
beyond Marxian variables, then this would provide greater support for Marxian 
models and suggest other materialist variables are not necessary to understand 
gender inequality. However, if non-Marxian materialist variables were significant 
determinants of gender inequality after controlling for significant Marxian 
variables, we could conclude that Marxian theories by themselves are inadequate 
to explain gender inequality. 


Results 
PHASE 1: BIVARIATE RESULTS 


Militarist Hypothesis 


Table 1 shows the bivariate results that test the hypotheses. The results of the 
nine tests of the militarist hypothesis show that it must be emphatically rejected. 
All but one of the associations between warfare and gender inequality are not 
significant, and the one association that is significant is positive (gamma = .54; 
p < .01, between internal warfare and male control of female sexuality), which 
is in the opposite direction predicted by the militarist theory. In fact, five of the 
nine associations are in the opposite direction from that which was predicted. 
The assertion by Divale and Harris (1976) and by Collins (1975, 1985) that the 


1432 / Social Forces 83:4, June 2005 


Table 1. Bivariate Tests of the Militarist, Marxian, and Non-Marxian 
Materialist Hypotheses 





Theory/Hypothesis Domestic Authority Control of Sexuality Female Solidarity 
Militarist 
Internal warfare .07 BLS .16 
External warfare —.02 —.22 EN 
Overall warfare —.10 —.16 47 
Marxian 
Female economic control sl —.31 25 
Class stratification EA ous =321 ies 


Non-Marxian Materialist 


Contribution of gathering 23 .38* Ar 
Subsistence type —— 7 —.42** —.54*** 
Female contribution to subsistence .05 13 —.01 
Female contribution to agriculture .46*** 32 SF ne 
Intensity of cultivation —.29* —.39* —40** 
Use of the plow —.51** —.63** —.58*** 
Patrilineality —22 .05 —.52*** 
Patrilocal/virilocal residence —.44** —.27 —43** 





Numbers in the table are gammas. Ns range from 67 to 93. 
PORRO MEE pO TES 00 


greater the level of warfare, the lower the status of women, has no empirical 
support. It is true that men dominate the means of warfare, but this does not 
necessarily prevent women from having a relatively high degree of domestic 
authority, less strict controls on their sexuality by men, or the ability to form 
solidarity groups. 

Divale and Harris's version of the militarist hypothesis focuses on band and 
tribal societies, whereas Collins is concerned mainly with militarism and warfare 
in large-scale agrarian societies. Therefore, we computed separate correlations for 
three different types of societies (results not shown). For hunter-gatherer societies, 
overall warfare was moderately correlated with female domestic authority (r— 
—.32), but this was the only correlation that supported the militarist hypothesis. 
Overall warfare correlated .24 with female solidarity and .25 with male control 
of female sexuality; and even more striking, external warfare correlated .49 with 
female solidarity and .52 with male control of female sexuality. These correlations 
are in the opposite direction from that predicted by the theory. 

For horticultural societies, female domestic authority and male control of 
female sexuality were uncorrelated with warfare, but ritualized female solidarity 
correlated .45 with overall warfare. Finally, for agrarian or intensive agricultural 
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societies, female domestic authority and female solidarity were not correlated with 
warfare, but male control of female sexuality correlated .38 with overall warfare 
and .55 with external warfare. Once again, however, all these correlations were 
in the opposite direction from that predicted by the theory. Thus, there does not 
appear to be an interaction effect between type of society and warfare, driving 
yet another nail into the militarist theory's coffin. 


Marxian Hypothesis 


Table 1 shows six tests of the Marxian hypothesis. The first three tests used the 
independent variable, “female economic control of products of their own labor.” 
Of these tests, only the one using female solidarity as the dependent variable was 
statistically significant and in the direction predicted by the theory (gamma = .53; 
p<.01). The next three tests used class stratification as the independent variable. 
Although the relationships are in the predicted direction, only one (involving 
domestic authority of women) was both statistically significant and of moderate 
strength (gamma=-.41; p<.01). In sum, the Marxian hypothesis is only partially 
supported. Three of the six statistical tests achieved significance, but only two of 
the six relationships were of moderate strength. 


Non-Marxian Materialist Hypothesis 


The 24 tests of the non-Marxian materialist hypothesis are shown in Table 1. 
Sixty-seven percent (16 of 24) of these tests were statistically significant, and in 
12 of the 24 relationships the measure of association was moderate to strong 
(gamma = .40 or higher). The variables that were most highly correlated with the 
dependent variables were use of the plow, subsistence type, female contribution to 
agriculture, patrilocality/virilocality, and intensity of cultivation. Across all three 
dependent variables, the average correlation for these independent variables was, 
respectively, .57, .43, .38, .38, and .36. The only non-Marxian materialist variable 
that was clearly not associated with gender inequality was women’s contribution 
to subsistence (average r=.06). 


Summary of Bivariate Results 


In this preliminary segment of the research, there were 39 total tests of three 
hypotheses (13 independent variables each being associated with 3 dependent 
variables). The militarist hypothesis received no support at all, failing to achieve 
statistical significance in any of its statistical tests. The Marxian hypothesis had 
some support, with three of the six statistical tests being significant and two of 
the six relationships of at least moderate strength. The non-Marxian materialist 
hypothesis clearly achieved the best support, with 67% of the 24 statistical tests 
being significant and 50% of the relationships of moderate to high strength. We 
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can also look at the results in terms of which independent variables were at least 
moderately associated with at least two of the dependent variables. None of the 
independent variables that were part of the militarist or Marxian hypotheses were 
moderately or strongly associated with at least two of the dependent variables, but 
four of the eight non-Marxian materialist independent variables were moderately 
or strongly associated with at least two of the three dependent variables. 


PHASE 2: MULTIVARIATE RESULTS 


Male Control of Female Sexuality 


The multivariate results for male control of female sexuality are presented in 
Table 2, along with the zero-order logistic regression results. As the results 
show, only four independent variables were significant under bivariate analysis. 
Only one independent variable, intensity of cultivation, was significant net of 
the other variables in the model. Societies with a lower intensity of cultivation 
are more likely to give higher status to women, which is supportive of a non- 
Marxian materialist explanation. None of the warfare and Marxian variables 
was an important determinant of male control of sexuality. The pseudo R^ was 
low (496), and the model accurately classified only 64% of the cases, suggesting 
that the ability of the independent variables to explain male control of female 
sexuality was quite modest. 


Domestic Authority of Women 


The multivariate results for domestic authority of women are shown in Table 
3. Results from the ordinary least squares regression are shown because the 
results are quite similar to (and conclusions the same as) the ordered logistic 
regression. Although eight independent variables were significant under bivariate 
analysis, only three variables were significant in the multivariate model. Societies 
with greater class stratification, intensive agriculture, and patrilocal or virilocal 
residence were more likely to have lower levels of domestic authority for women. 
None of the warfare variables was a significant determinant of domestic authority. 
The one significant Marxian variable, class stratification, explained 14% of the 
variance, while the two significant non-Marxian materialist variables, subsistence 
type and locality of residence, explained an additional 11% of the variance. Thus, 
the overall model explained a rather robust 25% of the variance. 


Female Solidarity 


Multivariate results for female solidarity from the ordinary least squares regression 
are shown in Table 4 because they were quite similar to the ordered logistic 
regression models. Although 10 independent variables were significant under 
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Table 2. Logistic Regression Results for Male Control of Female Sexuality 





(N £72). 
| Zero-order Final Model 
Independent Variables p? Odds Ratio B Odds Ratio 
Warfare Variables 
Frequency of warfare —.14 .87 — — 
Internal warfare .01 1.01 — — 
External warfare —.19 .83 — — 
Pseudo R? — 
Marxian Variables 
Female economic control —.65 52 
Social stratification —.18 .84 — — 
Class stratification —.14 .87 — — 
Pseudo R? — 
Non-Marxian Materialist Variables 
Percentage contribution of gathering — .59T 1.81 — — 
Subsistence type —.40T .67 — — 
Female contribution to subsistence JR} 1.26 = E 
Female contribution to agriculture 19 1.21 -— —— 
Intensity of cultivation —25T .78 —.33F m2 
Use of plow —15f 23 — — 
Patrilineality .09 1.09 oo — 
Patrilocality/virilocality —.56 O7 — — 
Pseudo R? .04 
Overall correct classification 64% 
Model chi-square (df) SAAD 
aB = Unstandardized logistic regression coefficient 
Tp <.10 (one-tailed) *p<.05 (one-tailed) **p<.01 (one-tailed) 





bivariate analysis, only four were statistically significant in the multivariate 
models. Societies with greater female economic control, lower degrees of class 
stratification, greater female contribution to agriculture, and nonpatrilineal 
descent were more likely to have greater female solidarity. None of the warfare 
variables was a significant determinant of female solidarity. The two Marxian 
variables, female economic control and class stratification, were significant 
determinants of female solidarity, and together they explained a robust 29% 
of the variance. Two non-Marxian materialist variables, female contribution to 
agriculture and patrilineality, were significant determinants of female solidarity. 
Together, they increased the explained variance in female solidarity by 11% to a 
very robust 40% for the overall model. 


1436 / Social Forces 83:4, June 2005 


Table 3. Zero-order (Ordered) Logistic Regression Coefficients and OLS 
Regression Results for Domestic Authority of Women (N = 71) 








Model 1 Model 2 
Independent Variables Zero-order B° Beta B Beta 
Warfare Variables 
Frequency of warfare —.09 — — = — 
nternal warfare —.16 — — — — 
External warfare —.02 — — — = 
R? — — 
Marxian Variables 
Female economic control 25 — -— — — 
Social stratification —.41* — — — — 
Class stratification —42** —18** —.38 —11* -22 
R .14 
Non-Marxian Materialist Variables 
Percentage contribution of gathering .40T — — 
Subsistence type —50** —10T  —19 
Female contribution to subsistence .12 — — 
Female contribution to agriculture 13 -— — 
Intensity of cultivation —26T — — 
Use of plow —1.22* — — 
Patrilineality —.52 — — 
Patrilocality/virilocality —1.05T —43**  —26 


R 5 
*B — Unstandardized regression coefficient 
Beta = Standardized regression coefficient 
tp<.10 (one-tailed) *p<.05 (one-tailed) **p<.01 (one-tailed) 


Composite Gender Inequality Variable 


Multivariate results from ordinary least squares regression equations for the 
composite gender inequality variable are shown in Table 5. Although nine 
independent variables were significant under bivariate analysis, only two were 
significant in the final multivariate model. Societies with intensive agriculture 
and in which females made a lesser contribution to agriculture had more gender 
inequality. Societies that had more class stratification also had more gender 
inequality, although the Marxian variable was not statistically significant in 
the final model, once non-Marxian materialist variables were allowed to enter. 
None of the warfare variables was a significant determinant of gender inequality. 
A Marxian variable, class stratification, explained 21% of the variation in the 
composite measure of gender inequality. The non-Marxian materialist variables 
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Table 4. Zero-order (Ordered) Logistic Regression Coefficients and OLS 
Regression Results for Female Solidarity (N — 54) 





Model 1 Model 2 
Independent Variables Zero-order Ba Beta B Beta 
Warfare Variables 
Frequency of warfare .17 — — — = 
Internal warfare .06 — — — = 
External warfare dm — — = = 
R — = 
Marxian Variables 
Female economic control oy 1432. 1271 od .20 
Social stratification —39** E222 SS —47 —17**  —35 
Class stratification —. 31% — — — == 
R? 29 
Non-Marxian Materialist Variables 
Percentage contribution of gathering .75** — — 
Subsistence type —.66** — — 
Female contribution to subsistence —.01 > = 
Female contribution to agriculture .37** .06T 2 
Intensity of cultivation —.36** — 
Use of plow —1.47** f — — 
Patrilineality —1.28** —.41*  —26 
Patrilocality/virilocality —1.02** — — 
R? 40 





4B = Unstandardized regression coefficient 
Beta — Standardized regression coefficient 


tp<.10 (one-tailed) *p<.05 (one-tailed) — **p«.01 (one-tailed) 





increased the explained variance by 17 percentage points to a very robust 3896 
in the final model. 

When pairwise deletion of data was used (results not shown), the results 
were very similar, although use of the plow and intensity of cultivation were also 
statistically significant and the total explained variance was 52%. 


Summary of the Multivariate Results 


The multivariate results support the conclusion that warfare variables do not 
determine dimensions and levels of gender inequality. Marxian variables, on the 
other hand, do explain modest amounts of the variance in gender inequality. Non- 
Marxian materialist variables consistently contribute significant and substantial 
amounts to the explanation of gender inequality. 
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Table 5. Zero-order Correlation Coefficients and OLS Regression Results 
for Composite Gender Inequality Variable (N — 53) 











Model 1 Model 2 
Independent Variables Zero-order B Beta B Beta 
Warfare Variables 
Frequency of warfare —.002 — — — — 
Internal warfare .05 — — — — 
External warfare .003 — — — — 
R? — — 
Marxian Variables 
Female economic control —.17 — — — — 
Social stratification .38* ote .46 .09 Jn 
Class stratification 568 — —- — — 
R 2 
Non-Marxian Materialist Variables 
Percentage contribution of gathering —.36** — — 
Subsistence type 30 15 em 38 
Female contribution to subsistence —.03 — — 
Female contribution to agriculture —46** —13* —24 
Intensity of cultivation 9275 — — 
Use of plow 44** — — 
Patrilineality dd M — — 
Patrilocality/virilocality dns — — 
R? .38 


aB = Unstandardized regression coefficient 
Beta = Standardized regression coefficient 


Tp <.10 (one-tailed) *p<.05 (one-tailed) **p<.01 (one-tailed) 


To provide a more definitive test of the relative effects of the Marxian versus 
the non-Marxian materialist variables, and to ensure that there were no suppressor 
(spurious nonassociation) effects (especially for the militarist variables), an 
additional set of multivariate analyses was performed. In this case the non- 
Marxian materialist variables were entered first, the Marxian variables second, and 
the militarist variables at the end. With respect to women’s domestic authority, 
the non-Marxian materialist variables accounted for 11% of unique variance, 
the Marxian variables for 3% of unique variance, and the two sets of variables 
mutually explained 11% of the variance. In looking at the female solidarity 
variable, the non-Marxian materialist variables accounted for 11% of unique 
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variance, Marxian variables accounted for 1296, and they mutually explained 
1796 of the variance. Finally, in looking at the composite gender variable, the 
non-Marxian materialist variables accounted for 1796 of unique variance and 
the two sets of variables mutually explained 2096 of the variance. The Marxian 
variable class stratification explained only 196 of unique variance, and this was 
not statistically significant. When the militarist variables were entered at the very 
end, they failed to explain any of the variance in the gender variables, ruling out 
the possibility of suppressor effects. 

The results thus suggest that, although the Marxian variable class stratification 
has explanatory importance, overall the non-Marxian materialist variables are 
clearly substantially more powerful in explaining variations in gender status 
in our sample of societies. The average unique variance explained by the non- 
Marxian materialist variables across all dependent variables (excluding control 
of female sexuality) was 1396, whereas the average unique variance explained by 
the Marxian variables was only slightly more than 596. The mutual explained 
variance across the same dependent variables averaged 1696. This gives us a total 
variance explained of 3496, which is quite robust, especially when considering 
the relatively weak measurement of the dependent variables. 


Conclusions 


This study empirically tested three major types of theories of gender inequality 
in preindustrial societies. Militarist theories received no support at all. We found 
no evidence that warfare lowers the status of women, as these theories suggest. 
Marxian theories received some support, with class stratification being a consistent 
determinant of the status of women. It is non-Marxian materialist variables, 
however, that consistently received the most support across all the bivariate 
and multivariate analyses. Women tend to do best in societies with modes of 
production that are relatively unintensive. When the mode of production is more 
intensive, women do better if they make an important contribution to agricultural 
production. Women do worse when the residence system is patrilocal and the 
descent system patrilineal. 

Issues of collinearity among the independent variables make it difficult 
to determine exactly which specific non-Marxian materialist variables are the 
strongest determinants of gender inequality. Nevertheless, we can conclude 
that it is non-Marxian materialist theories that provide the best explanations of 
the variations in gender inequality from one society to another. Our research 
has been guided by a philosophy of science that we take to be the norm in the 
natural sciences, if not always in sociology. This philosophy emphasizes the 
idea of parsimonious explanation and comparative theory testing (Laudan 1977). 
Parsimonious explanations are those that explain the most with the least; the fewer 
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the concepts and assumptions that are needed to explain a given phenomenon 
or category of phenomena, the better the explanation. Explanations can be made 
parsimonious only by testing competing theories comparatively, that is, against 
each other as well as against a body of data. The objective of such testing is for a 
single explanation, or a single type of explanation, to survive. Such an objective 
does not preclude building synthesized or integrated models or explanations 
when the data suggest that this is what is needed. To some extent that is needed 
in research on variations in gender inequality, since Marxian explanations do 
receive some empirical support in the present research. But such a synthesized 
model can be a goal only for future research; we are not able to accomplish 
that here. This future research might also want to consider additional material 
factors, especially those involving sexual politics and the gender organization of 
reproduction, as suggested by Collins et al. (1993). 

Naturally, the present study has certain limitations, the most significant of 
which is probably the relatively weak level of measurement of the explanandum. 
We used three measures of the status of women in addition to a composite 
measure, but these were not ideal measures of gender inequality. The interitem 
correlations within each measure were relatively low, and the measures correlated 
only weakly among themselves. Nonetheless, the SCCS is the only cross-cultural 
sample that we know of that includes gender status variables of any type, and 
we used the best gender status variables that were available. Moreover, random 
measurement error reduces the size of statistical relationships. With more reliable 
measures, the relationships we observed would very likely be stronger. 
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Abstract 


China's class structure is changing dramatically in the wake of post- 1978 market- 
oriented economic reforms. The creation of a mixed "market-socialist" economy has 
eroded the institutional bases of a cadre-dominated social hierarchy and created 
conditions for a new pattern of social stratification. Although conditions remain 
dynamic, results of a 1998 urban survey that measured strength and diversity 
of social ties among 400 households in four of China's largest cities documented 
networks of social exchange among 13 occupation-based classes that identify a class 
structure distinct from the cadre-dominated social hierarchy of the Mao era. In 
particular, analysis of visiting during the Lunar New Year celebration suggests an 
urban society simultaneously divided along two axes: one by economic success in 
the more privatized economy and one by distinctions in political authority at the 
workplace. Thus contrary to those who privilege market transactions as the primary 
engine for creating a new class hierarchy, we conclude that to understand processes 
of social stratification one needs theories and methods that work simultaneously 
with multiple dynamics of class differentiation rather than presuming linear 


hierarchy. 
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Greater reliance on market transactions and privatization of ownership in China 
since 1978 have created conditions for a process of social class formation that 
differs from the status hierarchy of the state socialist era. Before market-oriented 
reforms, a politicized stratification regime anchored in a state redistributive 
economy divided families and individuals into qualitatively distinct status groups 
with little intragenerational mobility (Davis 1985; Kraus 1981; Walder 1986; 
Whyte 1975; Whyte and Parish 1984). However, the mixed "market socialist" 
economy created in the two decades after 1978 fundamentally eroded the 
institutional bases of the previous status hierarchy (Bian 2002) and supported 
dynamics of social stratification that gravitate around variation in market position 
(Davis 1995). Although this emerging post-socialist system continues to evolve, 
it is possible to identify persistent behaviors and patterns of asset accumulation 
across occupational groups that document a post-socialist social order. 

In the earlier state redistributive economy (Szelényi 1978), political power 
and position conferred material privileges (Walder 1985) and distinguished cadre 
officials not only from industrial workers (L. Li 1995) but also from managers 
and professionals (Davis 2000). Thus researchers initially hypothesized that 
greater reliance on market exchange and privatization of property would weaken 
cadre privilege and advantage (Nee 1989). To date, the empirical support for this 
position has been mixed. While some have found that the economic advantage 
of cadre status has declined in the new era of market socialism, others have 
documented consistent financial returns to political power and position (see 
Bian 2002; Nee and Cao 2000; Nee and Mathews 1996). In this article, however, 
we do not address decline in the relative position of cadre officials but rather 
focus on how marketization and privatization are shaping networks of exchange 
across occupational classes and how a study of social networks speaks to broader 
sociological debates about class formation and networks. 


Theoretical Considerations 


Focusing on class and networks offers two distinct but complementary approaches 
to the study of social structure. Social class, however defined, is about the 
distribution of people across positions of distinction. This positional approach 
depicts society as a structure consisting of disconnected groupings of people that 
have distinct interests and possess different resources. Positional class analysis has 
identified property, authority, and skill as distinctive resources that define social 
class boundaries in contemporary capitalist societies (Wright 1997). By contrast, 
a relational approach that focuses on networks between groups and individuals 
builds on positional class categories but concentrates on identifying patterns 
of social relationships that assess the strength of social ties within and among 
different social positions. Social network analysis addresses the dynamic process 
of resource mobilization that is a key mechanism of social stratification. 
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In the pre-reform era, the class structure of urban China was relatively 
straightforward. At the top stood a small group of "state cadres" (guojia ganbu) 
who occupied prestigious administrative, managerial, and professional jobs 
and represented about 5% of the total workforce and about 20% of the urban 
labor force. These individuals were provided with above-average compensation 
packages (Walder 1995) and were kept in reserve for training and promotion into 
leadership positions (about 2%) in party and government offices (Zhou 2001). In 
contrast, those who began their careers as blue-collar workers (gong ren) rarely 
were promoted to positions of authority as a state cadre (Bian 1994:140-41). 

After twenty years of increased marketization, the bureaucratic command 
economy no longer monopolized the key resources of the urban economy or 
dictated the reward structure. As a result, by the late 1990s the class structure 
of urban China had become more fragmented and less clearly hierarchical than 
in the late 1970s and early 1980s (Bian 2002; So 2003). Significant debate has 
arisen about the degree to which market reforms have created a fundamentally 
new order, and in particular about whether significant erosion has occurred in 
the privilege and superior social status of the cadre official class in comparison to 
entrepreneurs and intellectuals and whether inequality in the rewards of manual 
and nonmanual employment has significantly increased. 

For example, many scholars find that former and current cadre officials 
and state managers have continued as a distinct class via a process of 
embourgeoisement. Nee and Lian’s (1994) opportunism model captures some 
features of an embourgeoisement process through which administrative and 
managerial cadres give up political commitments in order to catch opportunities 
in a growing marketplace. So (2003) argues that the statist society is the trademark 
of China’s reforms and that only the cadres were in a historically strategic position 
to develop a capitalist economy. Oi (1992) was impressed by the rise of “local state 
corporatism" during the 1980s, under which local governments became industrial 
firms while local officials either made capitalism “from within” (Walder 1994) 
or created “network capitalism” (Boisot and Child 1996) by taking advantage of 
their political and social capitals (D. Goodman 1996). During the second decade 
of reform, “informal privatization,” organizational proliferation, consortium 
building, and a system of “one manager, two businesses” massively diverted assets 
and profits of state enterprises into the private hands of cadres (Ding 2000a, 
2000b; Duckett 2001; Nee 1992; Nee and Su 1998). Most recently a state-imposed 
property rights reform has permitted administrative and managerial cadres to 
become shareholders in the transformed state enterprises (Zhang 2000). 

Clientelist relations between the new capitalist entrepreneurs and state officials 
have created a second process that allows the persistence of cadre advantage 
but marginalizes the new business class (Wank 1999). Nationally in 1997 there 
were more than 2 million registered private entrepreneurs, who hired 12 million 
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workers (SSB 1998:49). But these *business elites" appear to be weak politically, 
having no incentive, no autonomy, and no class capacity to work for the cause of 
a democratic state and politics (Pearson 1997). Rather, despite Jiang Zemin's call 
to recruit party members from all social strata, including private entrepreneurs, 
wealthy entrepreneurs seem satisfied with the “symbiotic clientelisms” that Wank 
(1999) observed in the first decade of reform and remain unlikely to become 
part of either the bourgeois cadre class (So 2003) or a dominant class themselves 
(L. Li 1995). Over time, entrepreneurs may become incorporated politically, but 
because of the political marginality of the business class, we suspect that they 
have thus far remained a class distinct from cadres. 

Positional analysis similarly documents an ambiguous class status for 
professionals, cultural elites, and technocrats, who had comprised the socialist 
intellectual class (Kraus 1981). Intellectuals lost their slight autonomy in the 
early 1950s when their occupations required them to be state cadres (Davis 
2000). During the Cultural Revolution (1966-76) intellectuals were actually 
degraded as Mao’s “stinky old ninths” (chou lao jiu), ranking last among all 
nine “black” categories. They were flattered when given a “working class” status 
by Deng Xiaoping in 1979; with that labeling, intellectuals finally had become 
a “revolutionary” class in the reform era (Huang 1993). But while educational 
credentials granted prestige, only a successful political screening granted them 
the best material conditions and workplace authority (Walder 1995; Walder 
et al. 2000). Huang (1993) describes post-reform Chinese intellectuals as 
divided between “in-institution” and “out-institution” groups, depending on 
whether they work primarily within the state sector or outside it. However, this 
institutional boundary implies no anticipation that “out-institution” intellectuals 
are “autonomous humanists” (zi you wen hua ren) with an independent class 
identity. 

In the Mao era, the urban proletariat was officially and politically recognized 
as a “leading class” (ling dao jie ji), and industrial state workers were also seen as 
“quasi-middle class” (Q. Li 2001). Post-1978 reforms have eroded this high status 
recognition and exposed manual workers to fierce economic competition in the 
newly commodified urban labor markets. By the late 1990s the Maoist working 
class was differentiated in four groups: employees in state-owned enterprises with 
multiyear contracts (70 million in 1998), migrant peasant labor in both state and 
private enterprises (60 million), laid-off labor from former state jobs (30 million), 
and daily wage labor in the private sector (12 million) (Zhang 2000:30). While 
state properties became productive assets for officials’ and managers’ private gains 
(Y. Lin and Zhang 1999; Y. Lin 2000), unprotected state manual labor began to 
experience truly proletarian (wu chan zhe) conditions. A new urban poverty 
stratum emerged among laid-off workers, retired workers, and migrant “peasant 
workers” (Zhang 2000), and labor opposition became a sensitive and serious issue 
in the changing structure of state and society (Chan 1996). 
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Migrant peasant labor is both institutionally distinctive and diverse in class 
position. Having no official permanent resident status in the cities, members 
of this class are called “peasant workers” (nong min gong) by urbanites but are 
confined to a secondary labor market of heavy-duty, poorly paid, and temporary 
jobs, living in conditions of direst hardship (Pun 1999). Yet some migrant workers 
become small-time entrepreneurs who run family businesses in the service sector 
or manage construction subcontracts through which to exploit fellow migrants 
from hometown area. After gaining both business skills and financial capital, 
they may return home to start a new business or become assimilated into urban 
life as long-term residents (Ma 2001). On the whole, a positional analysis should 
forfeit the assumption that migrant labor constitutes a single class and instead 
consider specific occupations and employment conditions when categorizing 
class positions of individuals and households in this group. 


Relational Analysis 


The relative distribution of material rewards and political power in a society can 
be analyzed to discern a rank order of class positions within it. But some theorists 
and researchers have urged a broader perspective on class analysis (e.g., Erickson 
1996, 2001; Giddens 1980; Portes 2000; Wright and Cho 1992). The hallmark of 
this relational analysis is to look beyond position alone to socjal networks in order 
to explore relationships and social distance between positions. As Portes argues, 
“Classes are defined by their relationships to each other and not simply by a set 
of ‘gradational’ positions along some hierarchy. In this sense, status rankings are 
a manifestation, not a defining feature of class” (259). 

Any number of relationships could be explored in such an analysis. Social 
network analysts have drawn attention to “friendship networks” (Fischer 1974; 
Laumann 1973), “discussion networks” (Burt 1984; Marsden 1987), and, more 
recently, “social support networks” (van der Poel 1993). This article focuses on 
“Chinese New Year visitor networks.” By long tradition, the Chinese make visits 
to the homes of close social contacts during a weeklong celebration of the New 
Year, the most important festival of the entire year. This deeply entrenched 
custom provides sociologists with an ideal opportunity to evaluate the scope and 
intensity of social ties across relational types and occupations of the visitors and 
their hosts (Bian and Li 2001). Capitalizing on our data about occupations of 
heads of interviewed households and those of their New Year visitors in 1998, 
we now venture to assess the strength and diversity of social ties within and 
between different occupational classes and to gauge the social distance among 
these classes. 

While social relationships are not as dense between classes as within them, 
the extent of between-class relationships may vary from one pair of classes to 
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another. From a social network perspective this variation speaks directly to the 
underlying character of social structure. Not only does network analysis estimate 
social distances between class positions, but it also identifies which classes are 
socially central and which are marginalized or isolated (Wright 1997). In short, 
analysis of social ties helps establish an image of association among social classes 
across the entire social structure. 

From a social network perspective, one could posit three theoretically 
distinctive patterns of association. The first is one of class segregation, in which 
members of the same class associate exclusively with other members of their class, 
resulting in high levels of social segregation. One possible cause of this pattern 
might be that political, economic, and social resources are so highly segmented 
along class boundaries that between-class association either produces no gains 
or even creates disadvantages. Such a scenario can be seen in the Cultural 
Revolution: people with “red class labels" (workers, poor peasants, “revolutionary” 
cadres, military officers) would not associate with people with “black class labels” 
(landlords, rich peasants, capitalists, antirevolutionaries, “revisionist” intellectuals) 
for fear of negative consequences (Whyte and Parish 1984). In the uncertain 
environment of more recent radical socioeconomic and political transformation, 
that politicized segregation appears to have been replaced by a new dynamic of 
segregation whereby occupational groups — with similar economic interests 
— form coalitions to ensure that others do not capture their resources and 
privileges (Sun 2002). 

The second and third patterns of association are class diffusion and class 
hierarchy. Both relate more to cross-class ties than to class segregation, but with 
different dynamics of exchange. In the class diffusion model, cross-class ties are as 
likely between class positions as within social classes; in the class hierarchy model, 
ties are as likely with members of "adjacent" classes as within class positions, but 
less likely with those significantly “higher” or “lower” in the status hierarchy. 
The class diffusion model suggests high degrees of social inclusiveness even as 
income and wealth become more unequally distributed. The class hierarchy model 
suggests either a “class-for-network” process, by which variation in assets drives 
social networking, or a “network-for-class” process, by which networking for 
access to resources dominates — as some have argued it did in pre-Communist 
Chinese society (Fei [1949] 1992; Fried [1953] 1969). 

Recently the study of social class has been enriched by a new version of an 
old debate, centering around realist (often micro-level) versus nominalist (often 
macro-level) approaches to class. Arguing from a realist perspective that "the 
simple fact" is that “detailed occupations are deeply institutionalized,” Grusky, 
Weeden, and Sorensen (2000:303) assert that the analyst who ignores fine-grained 
occupational categories “may be compared to the feudal scholar who eschews 
the ‘surface categories’ of serf, lord, and clergy in favor of uncovering latent or 
subterranean structures that are concealed from ordinary view.” From the other 
side of this debate, Portes charges that “calling occupations ‘classes’ leaves no baby 
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after the bath water" (2000:257). In his view *classes are theoretical constructs 
devised for the structural interpretation of social phenomena and the prediction 
of major long-term trends" (257, original emphasis). He advocates identifying 
“the ‘real’ class structure” prior to empirical analysis in order to apply the class 
concept subsequently to the analysis of various phenomena (257). 

It is beyond the scope of this article to resolve such long-standing debates as 
these. Nonetheless we wish to demonstrate one research procedure that could 
integrate elements of both nominalist and realist class analysis. First, in the next 
section, we derive a number of (nominalist) hypotheses, each containing a claim 
about the "real" class structure of urban China. Each hypothesis is set forth as 
a distinctive claim about groupings of occupations. Then we present a model 
that allows us to test how well each of these claims captures the social-network 
structure by comparing frequency of New Year visits among households headed 
by 13 different occupations, representing 8096 of all urban jobs in the national 
census of 2000. 


Research Hypotheses 


Because there is little research on how social ties form among Chinese urbanites, 
our hypotheses necessarily build on established scholarship rooted in Western 
societies and are provisional. They also take account of varied explanations 
about how expressive ties (such as friendship) formed in one arena continue as 
participants enter into another. For example, friendships formed on the job or 
within a family persist after one changes jobs or marries. Alternatively, control 
over resources and shared interests may also produce ties. You become friends 
with people like yourself, or you keep close ties with a relative whose similar 
social position wins your favor. Our expectation is that both sets of influences 
are important but produce different patterns of within-class and across-class 
associations. Shared resources and interests produce intraclass ties; expressive 
ties can generate interclass interaction. 

On the basis of prior research and our observation of the emerging social 
class order, we propose four hypotheses about patterns of social relationships 
among different occupational classes to guide our data analysis. 


HYPOTHESIS 1: A CLASSLESS SOCIETY 


One possible starting point in empirical analysis about China is to assume that 
Maoism created a state of equality, such that occupations are merely job duties 
that do not imply social class distinctions. In other words, the destratification 
process of the Cultural Revolution (Parish 1984) might have persisted through 
time, and the underlying distinctions (on the basis of property, authority, and 
skill) that emerged in the reform era have not yet created distinctive patterns 
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of social relationships among cadre officials, managers, professionals, manual 
workers, new entrepreneurs, and any other occupational groups. If this hypothesis 
is true, the social patterning of New Year visits will be random with respect to 
the occupation of the host (head of household). 


HYPOTHESIS 2: MANUAL VERSUS NONMANUAL DICHOTOMY 


The classless assumption is probably highly unrealistic in reform-era China. 
As many researchers have documented (Bian 2002), variations in ownership 
of property, authority on the job, and occupational skill have created unequal 
distribution of positional power, social prestige, and economic rewards in 
contemporary China. Manual, blue-collar workers increasingly have substantially 
fewer of these resources than those in nonmanual, white-collar jobs, who have 
greatly benefited from the reforms. Over time those engaged in manual labor 
have experienced increased levels of anxiety, deprivation, isolation, and separation 
from the society, causing social unrest and generating movements of resistance 
to the regime (Chan 1996; Whyte 1999). Plausibly, such differences in resources 
and concomitant feelings of alienation might translate into greater social distance 
between blue-collar and white-collar workers. If this hypothesis is true, we will see 
a manual/nonmanual dichotomy in visitation patterns among urban households 
as well as propensities of segregation within each broad class category. 


HYPOTHESIS 3: DUAL ELITES 


Nonmanual, white-collar occupations may not form a class category that is 
uniform with respect to New Year visitation behavior. Research on contemporary 
capitalist societies indicates that although acquired skill presents people with 
labor market opportunities and grants them enhanced social prestige, it does 
not necessarily extend to imply property control or managerial authority 
(Wright 1997). In China, skill and authority are very different resources: skilled 
and prestigious professionals are a different elite group from cadre officials, 
administrators, and managers. These latter groups not only have social prestige; 
more importantly, their political, administrative, or managerial authority offers 
material rewards that professionals do not have (Walder 1995; Walder et al. 
2000). This dual elite model implies that skilled professionals (e.g., scientists 
and researchers, college professors, elementary and high school teachers, doctors 
and nurses) may form a distinctive class. If this hypothesis is true, our analysis 
will show skilled professionals visiting more among themselves than with either 
manual laborers or other nonmanual workers. 
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HYPOTHESIS 4: THE MANAGERIAL CLASS 


Because of their primary interests in the growing market economy, managers have 
emerged as an increasingly powerful class. Initially they benefited from managerial 
reforms within the state sector (Naughton 1995), and "informal privatization" 
gradually gave managers the power of exclusive control over enterprise operation 
(Ding 2000a, 2000b). Since 1998 new legislation has routinized the status 
quo of managerial control and confirmed income rights over properties in 
state, collective, and hybrid enterprises (Y. Lin 2001). By the late 1990s these 
transformations had created an expanded managerial class consisting of three 
strategic players in any business enterprise: (1) general managers, who assume 
the overall managerial authority and decision-making power of an enterprise; 
(2) accountants, who manage enterprises budgets and exercise the power of 
manipulating accounting books in order to safeguard against any inside or outside 
uncertainties of a transitional economy; and (3) sales and marketing managers, 
who explore and secure business opportunities for the enterprises. Inside and 
outside an enterprise setting, these three business players are highly dependent 
upon each other not only for career advancements but, more importantly, for 
minimizing risks that rise from the high uncertainty and transitional nature of 
China's emerging market economy (Y. Lin 2001). Sociologically, therefore, these 
three business players both share a managerial class interest and tend to form a 
clique of trust and social protection to safeguard their positions. If this hypothesis 
is true, we may expect that households headed by members of the managerial 
class will have significantly more New Year visitations among themselves than 
with households headed by other white-collar or blue-collar classes. 

Hypothesis 4 predicts that different levels of New Year visitation will emerge 
among four classes: managers (general managers, accountants, sales and marketing 
managers), administrator-officials (cadre officials, legal administrators, the police, 
administrative and clerical workers), skilled professionals (all other white-collar 
jobs), and workers (all manual, blue-collar jobs).' 


Data and Methods 


The data used in this analysis were collected during a yearlong interview project 
designed to explore living standards and social life among several occupational 
groups for whom the market reforms had brought differential income rewards. 
Specifically, we used a panel of 400 households that were headed by cadre 
officials, state enterprise managers, professionals, industrial and service workers, 
and migrant workers. All households were located in four of China's largest 
metropolitan areas (Shanghai, Shenzhen, Tianjin, and Wuhan), evenly distributed 
into samples of 100 households per city. To ensure having a sufficient number of 
households for analysis in each occupational category, we used a quota sampling 
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Table 1. Occupational Samples as Percentages of China's Fifth Population 





Census in 2000 

AEG a a a R RS lU s 
Occupation Urban total 4 cities Shanghai ^ Shenzhen Tianjin Wuhan 
1. Cadre 8 19 29 "i .6 6 

2. Manager 4.3 4.2 Oe 3.6 5.4 5.6 

3. Scientist/professor 8 8 1.0 4 112 1.6 

4. Teacher 2.4 1.6 1.7 5 2.5 2.6 

5. Professional 3.0 3.4 3.8 2:3 4.1 325 

6. Doctor/nurse 2.4 1.8 1.8 6 2.8 27 

7. Accountant/sales 6.3 5.6 6.3 32 7.6 6.3 

8. Legal/police 1.0 I9 (9 3 1.6 1.0 

9. Admin/clerk 8.2 9.1 11.8 52 10.8 8.4 
10. Industrial 38.1 42.1 36.2 60.2 35.1 30.0 
11. Chauffeur 5.3 4.6 5.0 2.6 6.3 5.6 
12. Service/commercial 2720) 253 26.9 21.0 21:9 Sieg, 
13. Domestic worker n3 A 6 A 2 3 
All occupations 100.0 100.0 100.0 100.0 100.0 100.0 


Total labor force (N) 147,298,220 17,663,160 6,883,742 5,187,079 2,919,739 2,672,600 


All occupations as % 
of total labor force 79.6 87.9 89.8 93.1 89.4 76.1 





procedure that over sampled cadre officials, state managers, and professionals and 
under sampled industrial and service workers and migrant workers. To avoid 
subjectivity, we adopted random procedures for sampling four neighborhoods in 
each city and for sampling households within each occupational cluster. (These 
clusters were developed from household registries of the sampled neighborhoods 
that listed occupation for head of household.) In terms of specific occupations 
itemized in the China 2000 Census, our samples captured household heads in 
88% of all occupations in these four cities, and 80% of all jobs in urban China, 
as shown in Table 1. This comprehensive coverage of all urban occupations gives 
us confidence that our samples can legitimately serve to test hypotheses that may 
apply more generally to trends across urban China. 

Interviewers visited each household four times at approximately three-month 
intervals between January 1998 and January 1999. In addition to these home 
interviews, in which spouses were interviewed separately for between 60 and 90 
minutes, each spouse completed two daily logs, the first during the Spring Festival 
of 1998 for Chinese New Year social interactions and the second in May 1998 for 
social eating. Our analysis here uses the material from the logs kept during the 
six days of the New Year celebration (the last day of the old year and the first five 
days of the new). One could, of course, rely on comparisons of many different 
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social interactions to assess scope and strength of relational ties, but because of 
the centrality of socializing and exchange of greetings during the New Year for 
households of all income levels across the country, we find that the New Year 
celebration offers a particularly good focus for an initial study of class variation 
in associational dynamics. 

In Chinese cities, as well as rural villages, social gatherings and visitations 
are important occasions for individuals and families to cultivate, maintain, and 
develop, or terminate social ties (Bian 1997; Yan 1996; Yang 1994). For example, 
receiving an invitation to a wedding, funeral, birthday party, or feast signals 
the recipient's membership within the host's social networks (known locally as 
guanxi) (Bian 2001). Of the many significant social occasions that pattern social 
exchanges, the Spring Festival (Lunar New Year) is most widely observed across 
all regions in China. Traditionally the celebration starts on New Year's Eve and 
lasts for fifteen days, each day reserved for a culturally significant event or meal. 
Since 1949 the Communist government has granted urban workers a weeklong 
official holiday. In recent years telephone greetings have become popular, but 
exchanges of visits continue to be among the core social activities of the season. 
Although not every visitor is necessarily a core social tie, our broad observation 
indicates that one's core ties will be among the visitors: there is no good excuse for 
close relatives or friends not to visit each other during the New Year celebration 
(Bian and Li 2001; Bian, Davis, and Wang 2002). Thus, New Year visitation data 
are a strong event-based measure of membership, strength, and composition of 
core social networks among the Chinese, far more so than a survey of casual 
interactions among neighbors or a study of participation in associations at the 
neighborhood level such as Gould (1993) conducted in the Paris Commune. 
In our sample, households received an average of 17 visitors and 20 telephone 
calls, and of these greeters 44% were relatives, 25% friends, 20% colleagues, 8% 
neighbors, and 396 other close contacts. 

To record the occupational class of visitors to our sample households, we 
utilized an occupation-based position generator, a social network device that 
was invented for measuring the structure of an egocentric network (N. Lin and 
Dumin 1986). As with survey ratings of occupational prestige, this network 
device can include only a few job titles to cover a range of occupations in terms 
of prestige, socioeconomic status, or embedded resources. For the present study 
we initially selected 20 occupations to represent the range of prestige standings of 
urban occupations in China; for comparison purposes the selection was confined 
largely to job titles for which prestige ratings had already been collected in 
previous studies (Bian 1996; N. Lin and Xie 1988). We requested our responding 
households to record whether or not their New Year visitors had a job in any of 
the 20 job categories on the list. Seven job titles each received an insignificant 
total in frequency counts, and these were merged with others, resulting in the 13 
occupational categories shown in Table 1.* Although these selected occupations 
do not represent the entire array of urban jobs, they do provide 8096 coverage 
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of urban jobs of the country and 8896 coverage in the four cities in our sample 
(Table 1) and also capture underlying class positions along the dimensions of 
authority and skill. 

The occupations of the heads of sample households were organized into 
the same 13 occupational categories, and a 13x13 occupation matrix was 
generated to indicate how host households in each occupation exchanged New 
Year greetings with visitors (households or individuals) from each occupation. 
That matrix (Table 2) presents the data we analyze and report here? Table 2 is 
thus a contingency table using occupations as the units of analysis and counting 
the number of contacts between hosts and New Year visitors in terms of their 
respective occupations. 


MODELS AND ANALYSIS 


To assess the degree to which any one of our research hypotheses fits the data of 
Table 2, we employ Leo Goodman's (1981; Breiger 1981) modeling framework 
for assessing the homogeneity among categories in a contingency table. Within 
this framework, the most general model we will use is 


R +u C +u D +u RC, subject to 
i j i ij 


ihnF.=u+u 
ij i 


u; =u? fori e class k, and : (1) 


un = ms for (i, j) € subtable s (k, 1) 
where the F, are the cell counts estimated by means of the model,‘ with subscripts 
i and j indexing occupational origins (rows) and destinations (columns) of New 
Year visitors, subscripts k and / indexing class origins and destinations of New 
Year visitors, and the u terms representing parameters to be estimated. If all the 
u p and UA terms were set to zero, then equation 1 would simplify to the model 
of simple independence for Table 1 (with superscripts R and C indexing the table’s 
rows and columns, respectively). If the 4/7" terms were set to zero and a separate 
uP were hypothesized for each occupation (superscript D indexing cells on the 
table's diagonal, reflecting New Year visits within occupations), then equation 1 
would simplify to independence for cells off the diagonal of Table 2, the so-called 
quasi-independence model. The final terms (those with superscript RC) pertain 
to off-diagonal interactions above and beyond the effects of the marginals; a 
separate interaction parameter is estimated for each of the subtables formed at the 
intersection of each pair of classes that are defined in one of our hypotheses. We 
provide an example of these ub parameters, as well as the uP, in Table 3, with 
respect to our hypothesis 4, which relates to owner-manager coalition.” 

Notice (Table 3) that any two occupations placed by hypothesis within the 
same class are constrained, by application of Goodman’s model as given in our 
equation 1, to have identical intensities of interaction (uf parameters) with all 
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other occupations. (Compare, for example, the first two rows in Table 3, as well 
as the first two columns.) Breiger and Mohr (2004) build on this point to assert 
that Goodman's (1981) homogeneity model provides an operationalization of 
the social-network concept of structural equivalence (reviewed in Wasserman and 
Faust 1994) as applied to cross-classification tables such as the counts of New 
Year greetings given in Table 1. Thus, the model of equation 1 provides us with a 
means for furthering the relational analysis of class and networks in China that we 
reviewed above. Each of our research hypotheses can be assessed by measuring (by 
means of standard chi-square tests) how well the expected frequencies produced 
by the model (the F.) correspond to the observed data in Table 2. 

Conventional log linear class analyses often distinguish between within- 
occupational relations (for which the data lie on the principal diagonal of the 
contingency table) and relations within and between the class categories (macro- 
categories of occupations) that are postulated (for which the relevant data are 
all the cells in the contingency table, as illustrated above in Table 3). We follow 
this distinction, focusing first on within-occupational relations (in the first 
subsection below and in Table 4) and then on the full class structures that have 
been postulated (in the second subsection below and in Table 5). 


Results 


Hypothesis tests (1): Class-specific Effects on Visiting Others of the Same 
Occupation 


A test of hypothesis 1, the classless society hypothesis, is given in line 1 of Table 
3. Hypothesis 1 postulates no distinctions among positions or sectors in their 
relational propensities. It may thus be operationalized as the model of simple 
independence for Table 2. The independence model clearly does not fit these 
data (p —.02, line 1 of Table4). 

The model of independence could be improved considerably by a model 
postulating the randomness of between-occupational moves (as the independence 
model does) but a uniformly higher propensity for visitations among households 
within the same occupation. This is the model Marsden (1981) termed "constant 
inbreeding" in his application of log linear contingency-table models to counted 
data on social networks. We may specify this model as 


Eq. (1) with up =u” and uns 0) (2) 


postulating a within-occupation propensity for New Year visitation, but one that 
does not differ by occupation (uv? is a constant). Table 4 indicates that this so- 
called constant-inbreeding model fits the data (line 2) and provides a significant 
improvement over the independence model (line 7), and the constant u” is 
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Table 4. Models Pertaining to Class-Specific Propensities within 
Occupations for New Year Visits 


IUE VLC eeu ee Or san 


A. Models 

(tci puc! ps fene emos no pe ies nacen JO ROLLE ELT E sss 
Line Model Class specification df G p 
MER D eI ROM — p P ee 27 er oe E 
1 Independence none 144 180744 .02 

2 Constant by occupation [1-13] 143 147.260 .37 

3 Separation of working class [1-9], [10-13] 142° .138.636, , 256 

4 Dual elite [1,2,7,8,9], [3-6], [10-13] 141 132.804 .68 

5 Managerial class [1,8,9], [2,7], [3—6], [10-13] | 140 — 131.955  .67 

6 Occupation-specific (1, [2]. [3], [4], [5], [6], [7], 

BIT REPA San Core 
mas un s coe pre oct al cene E CU Ee e i n EE re 
B. Comparisons among models 
- Wn ktugd tcl a s u= ==— 
Line Comparison df G p 
Sold aM DUI Mmliefss Eeculirri cose e A ee eee 
7 UND 1 33.480 <.001 
8 3 X. 2 1 8.628 .003 
9 4 v.3 1 5.832  .016 
10 5v.4 1 871 351 
11 6 v. 5 9 9.281  .412 





estimated empirically as .457, which exponentiates to 1.58. The model of line 2 
(Table 4) thus suggests that New Year visitation is 5896 more likely among greeters 
who share an occupation, in comparison to greeters of different occupations. 

We now bring social class into this picture. The models of lines 3-6 of Table 
4 may be specified as 


Eq. (1) with u$* -0 


Hypothesis 2 postulates a dichotomy between manual and nonmanual 
occupations. Within the context of equation 3, this hypothesis mandates that 
the four manual occupations (those numbered 10-13 in Table 1) will have a 
class-specific propensity for New Year visitation, as will the nine nonmanual 
occupations (those numbered 1-9). The fit of this hypothesis (tested in line 3 of 
Table 4) provides a significant improvement over the constant inbreeding model 
(see line 8 of Table 4); we thus would prefer the class model of hypothesis 2. The 
uk parameters estimated for the model of line 3 are (in exponentiated form) 2.46 
for the manual working class and 1.40 for the nonmanual class, thus suggesting 
that manual workers are 2.46 times as likely as by chance to visit within their 
own occupations, whereas those in nonmanual jobs are 1.40 times as likely as 
by chance to behave similarly. These class-specific propensities are significantly 
different from one another (p=.003, line 8 of Table 4). 
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Hypothesis 3 posits that the white-collar class is divided into a dual elite: 
skilled professionals (categories 3-6 in Table 2) and occupations based on 
administrative or managerial authority (categories 1, 2, 7, 8, 9). With respect to 
occupation-specific propensities for New Year visits, the model is tested in line 
4 of Table 4; it is seen to provide a significant improvement over the previously 
discussed models (see line 9 of Table 4). The four skilled-professional occupations 
are estimated to have a common propensity for New Year visitation with others of 
the same occupation (exponentiated u x — 1.93) that is significantly higher than 
the class propensity associated with other white-collar occupations (exponentiated 
uP =1.20); the common class propensity for the manual working class estimated 
by this model (2.47) is similar to that estimated by the previous model. 

Hypothesis 4 postulates distinctions between managers (categories 2 and 
7) on the one hand and official-administrators (categories 1, 8, and 9), skilled 
professionals (categories 3-6), and workers (categories 10-13) on the other. 
Applied within the context of equation 3, this hypothesis does fit the data (line 
5 of Table 4), but it does not provide a significant improvement in fit over 
hypothesis 3 (p=.35, line 10 of Table 4). 

Just as lines 1 and 2 of Table 4 provided null models for our models (3, 4, 
and 5 of Table 4) of class-specific effects on the propensity for within-occupation 
New Year visits, line 6 provides a different sort of comparison. Suppose, still 
within the framework of equation 3, that each of the 13 occupations was in a 
class by itself. The result would be the model of quasi-independence (discussed 
previously). Although this model does fit the data (line 6 of Table 3), it does not 
provide a significant improvement over hypothesis 4 (p=.412, line 11) or over 
any of the other class models that we have introduced. Thus we have established 
(so far) that there are significant class-specific effects on propensities for Chinese 
householders to visit others within the same occupation. The most parsimonious 
model that we have formulated (so far) that fits the data is based on hypothesis 3 
(line 4 of Table 4), specifying classes of manual workers, skilled professionals, 
and administrative and economic managers and personnel. 


Hypothesis Tests (2): Class-specific Effects on Visitation within and between 
Classes 


We now turn to equation 1 in its most general form, and we are concerned with 
the class patterning of New Year visitations (Table 5) in addition to the class 
differences manifest for within-occupation visiting patterns (Table 4). 
Hypothesis 1 has already been tested (in line 2 of Table 4). The first three lines 
of Table 5 provide tests of our other substantive class hypotheses, in the context 
of equation 1. Each of these models fits the data. Hypothesis 2, the separation 
of manual and nonmanual occupations (line 1 of Table 5), provides a significant 
improvement over hypothesis 1 (line 4 of Table 5). Hypothesis 3, the dual elite 
(line 2 of Table 5), significantly improves over hypothesis 2 (p=.042, line 5). 
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Table 5. Models Pertaining to Class-specific Propensities for New Year Visits 
within and between Occupations 


A. Models 

e ee 
Line Model Class specification Df G2 p 
ee SPSS ir mer sore ois i ee aS eee 
1 Separation of working class [1-9], [10-13] 141.115.051: 946 
2 Dual Elite [1,2,7,8,9], [3—6], [10-13] 137 = 105.120 900 
3 Managerial class [1,8,9], [2,7], [3-6], [10-13] 131 92.762 .995 
Frac pyle oe r sr RP ai gl Biel (ROI ee eS ee 


B. Comparisons among models, part 1 


Ee RE AO O a EE EE 
Line Comparison Df G2 p 
Dae o nat nia a T raaa l aa mi MN 


4 1 (Table 5) v. 2 (Table 4) 2, 32215 < 008 
5 2 v. 1 (Table 5) 4 9.93] .042 
6 3 v. 2 (Table 5) 6 12.358 .054 
AI!" ewe oe ee et a eel ae e ecc eee 
C. Comparisons among models, part 2 

g Ci E O 2 ooo i is p YY M 
Line Comparison Df G2 p 
Ck ee E 
7 1 (Table 5) v. 3 (Table 4) J 23.585 «00! 
8 2 (Table 5) v. 4 (Table 4) 4 27.684 «.001 
9 3 (Table 5) v. 5 (Table 4) 911—39:171 097 


2 a a a — — ———n -.-Ñ  —r[ P -A.  — 


Hypothesis 4 (line 3), the manager class hypothesis, provides improvement over 
hypothesis 3 that is almost as great (p=.054, line 6), though clearly this p-value 
(as well as the one in line 5) is marginal at the .05 level. 

Panel C of Table 5 provides comparisons, within each substantive model 
of social classes, of the extent to which the fit of each hypothesis (2, 3, and 
4) is improved when applied to the class patterning of visitation (equation 1 
and Table 5) as distinct from class differences manifest for within-occupation 
visiting patterns (equation 3 and Table 4). The results on each line of Panel C 
are unambiguous: the class effects on New Year visitation patterns are significant 
above and beyond the class-specific within-occupation (diagonal cell) propensities 
investigated in Table 4. 


Summary 


What have we found so far? Occupational classes matter in structuring New Year 
visitation patterns in China. Our four hypotheses are nested and hierarchical, 
and each hypothesis adds to our understanding of these patterned relations 
among occupations. Occupations themselves matter (Table 4, line 2), in that 
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households are more likely to visit others within the same occupation. But classes 
add significantly to this picture, both with respect to visits within the same 
occupation (Table 4, lines 3-5) and with respect to visits with households in 
other occupations (Table 5). There is an important separation of manual workers 
from other occupations (our hypothesis 2). There is a significant (p — .042, line 5 
of Table 5) additional separation among white-collar occupations into skilled 
professionals versus administrative and economic managers and personnel (our 
hypothesis 3). And there is a marginally significant additional separation (above 
and beyond hypothesis 3) between bureaucratic elites (cadres, legal workers 
and police, administrative clerks) and economic actors (general managers, 
accountants, sales and marketing managers) (our hypothesis 4, line 6 of Table 5). 
In post-reform China skill and authority are relevant to the structuring of 
occupational classes. 

Let us interpret the u? and urs parameters for hypothesis 4 (the model 
of line 3, Table 5) that we introduced previously, in Table 2. Manual workers 
(those in occupations 10, 11, 12, 13) are 2.36 (= exponential of .859) times as 
likely as chance to greet others within the same occupation, and they are 1.50 
(= exponential of .408) times as likely as chance to greet households of other 
occupations within the same class. Similarly, those in the other classes postulated 
by hypothesis 4 are also more likely to conduct New Year visitation within the 
same class (though this is least true for managers). With two important exceptions 
(discussed below), the pattern of New Year visitation in Table 3 exhibits the 
“saddle” pattern well known to analysts of mobility tables: on the diagonal, the 
intensities decrease and then increase; off the diagonal, the intensities decrease 
in any direction. The model thus suggests an ordering moving from those with 
administrative authority (occupations 1, 8, 9) to those engaged in business 
management (2, 7) and skilled professionals (3-6), to workers (10-13). 

As noted, there are two exceptions to this “saddle” pattern. There are slight 
tendencies for workers to visit managers (exponential of .041 = 1.04 times chance) 
and for managers to visit skilled professionals (exponential of .133 = 1.14 times 
chance). The first of these is expected by the nature of guanxi connections: 
subordinates pay respect to their superiors by visiting the homes of the latter on 
the important occasion of the Chinese New Year celebration (Bian and Li 2001). 
The second of these reflects the other side of the guanxi connection: bosses extend 
their thankful greetings in person to their professional elites who, though having 
no managerial or administrative authority, are nonetheless the most respected 
socially; the New Year celebration is traditionally a strategic occasion for this 
gesture to occur. Most crucially, however, in all the postulated classes individuals 
and households are shown as more likely to visit others within the same class. 
Classes matter significantly to New Year visitors in China. That is to say, using 
the terminology of our introductory discussion of relational analysis, we find 
extensive tendencies toward occupational class segregation. We also find some 
examples of class diffusion (e.g., the tendencies for workers to visit managers, and 
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for managers to visit skilled professionals) and class hierarchy (with reference to 
the “saddle” pattern identified just above). 


Conclusions 


Analysis of New Year visiting patterns in urban China indicates a dual dynamic 
of economic and political stratification. Increased marketization and privatization 
have simultaneously divided urban Chinese society along two axes: one by 
economic success in the more privatized economy, and one by distinctions in 
political authority at the workplace. Thus unlike those who privilege market 
transactions as the primary engine for creating a new class hierarchy, we observe 
processes of social stratification that are simultaneously political and economic. 
Therefore we propose research strategies that employ theories and methods that 
work simultaneously with multiple dynamics of class differentiation rather than 
presuming linear hierarchy. 

Specifically, the results of our data analysis support three empirical conclusions. 
First, households headed by manual workers have a distinctive pattern of social 
ties. In particular, members of these households concentrate their New Year 
visitations upon others in similar manual occupations and are relatively unlikely 
to cross occupational class boundaries to socialize with households headed by 
managers, white-collar professionals, or cadres. Thus our results suggest that 
after two decades of intensifying marketization and privatization not only have 
the economic rewards of manual employment in urban China declined (Bian 
2002; Chan 1996), but blue-collar workers may be increasingly isolated socially. 
Second, households headed by white-collar employees in state bureaucracies 
(such as cadres, legal workers, police officers, administrators, and clerks) also 
have occupationally distinctive patterns of association. During the New Year 
celebration these households tend to visit each other more frequently than to visit 
those in other job categories. Finally, we find a bifurcated segregation between 
households headed by white-collar professionals and those headed by managers 
that indicates the importance of distinguishing between political position and 
authority at the workplace among those with comparable incomes. 

These results are congruent with previous studies that have documented an 
increasingly differentiated reward structure since 1978 (Bian 2002; So 2003; Sun 
2002). However, they also go beyond earlier documentation of rising income 
inequality to demonstrate how the emerging market economy segments social 
networks. In particular they suggest a pattern of social segregation among 
households headed by manual workers who were “quasi—middle class” in the 
pre-reform years (Q. Li 2001). Through the late 1970s state industrial workers 
were provided with lifetime employment and an impressive array of collective 
benefits (Walder 1986); because of lifetime employment and enterprise-owned 
housing systems (Bian 1994) they frequently interacted on a daily basis with 
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managers and professionals within their work units (Blau, Ruan, and Ardelt 
1991). However, by 1998 the basic conditions of employment and housing had 
changed. Lifetime employment had disappeared within the state sector, most 
new jobs were in the private sector, and most new housing had been privatized 
(Davis 2003). Our results suggest that these transformations of the socialist 
system may constrain socializing across the manual/nonmanual divide and that 
marketization and privatization may PE socially isolated workers from those 
with authority and wealth. 

A second implication of these patterns of association is the emergence of 
bureaucratic elites as a distinct class. Most striking is the preference for intraclass 
socializing among government officials. This finding implies that while the 
cadre class still holds a central position in the distribution of resources and 
opportunities (Bian 2002), in a relational perspective political cadres may not 
have centrality in connecting different occupational classes in urban China. 

Finally, our results indicate a bifurcated professional/managerial class with the 
managerial group in a position of network centrality. That is, although managers 
are more likely than officials to socialize with the intellectuals (engineers, teachers, 
doctors and nurses, college professors, scientists), they also demonstrate a 
tendency to be visited by manual workers and to visit skilled professionals. Thus 
it appears that not only do managers in state and non-state business enterprises 
have superior income and authority, they also experience advantageous social 
and networking power. 

Our small sample and our focus on only one type of socializing necessarily 
render our results preliminary. Only further research can verify the relevance of 
these conclusions to the entire urban population and to occasions other than 
visitations during the Chinese New Year celebration. We also need to apply 
alternative measures of network ties to assess the comparative and theoretical 
significance of the position-generator device. For example, would analysis 
of interclass marriage and intergenerational mobility confirm or reject the 
hypotheses in the same way as this study of New Year visits? Finally, we need 
to extend the relational approach to include analysis that examines property 
assets as well as the authority and skill differentials among occupations. Until 
such extensive materials are available, however, the occupationally segmented 
socializing of the 400 households in our study provides working hypotheses for 
subsequent research. 


Notes 


1. Data limitations prohibit us from explicitly testing any hypothesis that requires information 
about property ownership as an underlying dimension of class boundary. 


2. We fully understand that in New Year social exchanges, younger persons are likely to visit 
older persons among family and kin, subordinates to visit superiors among close colleagues, 
and low-status friends to visit high-status friends, whereas the reverse is less likely to occur. 
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Our purpose here, however, is to show the propensities of within- and between-occupation 
linkages, and within- and between-class linkages, as indicated by visits of relatives, friends, 
and other contacts to the interviewed households. Such visits imply the presence of a network 
connection between visitors and members of the interviewed households. 


3. Our object is to explore occupational and class boundaries as implied in the patterns of 
New Year visitation. Because colleagues are only one-fifth of New Year visitors in our sample, 
the proposition that the workplace, rather than occupation or social class, is the mechanism 
for social interaction may be only partially viable at best. Unfortunately, because our survey 
did not record information about the extent to which such ties were with colleagues in 
respondents’ own workplaces, we cannot test this alternative interpretation. 


4. The merged categories and their serial numbers as shown in Table 1 are: (2) enterprise 
and nonprofit organization managers, party and mass organization leaders; (3) scientists and 
researchers, college teachers; (4) elementary school teachers, high school teachers; (6) doctors, 
nurses; (7) sales and marketing managers, accountants; (8) legal workers, police officers; (12) 
cooks, restaurant wait staff. 


5. Using the occupation of the household head as the indicator in this contingency table 
analysis amounts to an assumption of occupational and class homophily for all family 
members in the household. We tested this assumption by using husbands as the unit of 
analysis and found that the estimated results showed no difference at all from the ones 
reported in Tables 2-5. Because a substantial number of wives were homemakers, an 
occupation that is not provided for in visiting parties, we abandoned the possibility of using 
wives as the unit of analysis. 


6. Estimation of this log linear model is straightforward (e.g., Hout 1983:72—-76). 


7. Specifically, these are the parameter estimates for the model at line 3 in Table 5. 
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With a Little Help from My Friends (and My 
Financial Planner)* 


MARIKO LIN CHANG, Harvard University 





Abstract 


Despite the tremendous implications that financial decisions have for socioeconomic 
well-being, the study of financial decision-making has been left largely to economists. 
This paper places this topic firmly within sociological terrain and demonstrates 
that the search for financial information is embedded within broader systems of 
social inequality. Analyses of data from the 1998 Survey of Consumer Finances 
reveal that social networks are by far the most frequently used source of saving and 
investment information; however they are used most often by those with the least 
wealth. Wealthier households are more likely to turn to paid financial professionals 
and to certain forms of media for saving and investment information. Results 
indicate that those at the top of the socioeconomic ladder do gather information 
from multiple sources possibly to minimize the risk of making a poor decision; yet 
as socioeconomic status increases, networks are decreasingly likely to be among the 
sources consulted. 


Whereas the study of financial decision-making has largely been left to economists, 
social structures such as networks have been shown to be conduits of financial 
information (Baker 1984; Uzzi 1999) and networks have been shown to influence 
the nature of market transactions (Abolafia 1996; DiMaggio and Louch 1998; 
-. Podolny 2001). One key area of economic decision-making that has been largely 

neglected by sociologists is the potential role of social networks in shaping saving 
and investment decisions. Yet, the choice of saving and investment vehicles 
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may have a tremendous impact on the eventual rate of return and, hence, the 
accumulation of wealth across households. During the 1989-1998 period, for 
instance, stocks had an average annual rate of return of 1096, bonds 4.796, and 
savings accounts, money market funds, and certificates of deposit combined only 
1.496 (Wolff 2000). The average rate of return between stocks (a much riskier 
asset) and savings accounts (a much safer asset) is dramatic, but consequential 
differences in rates of return also exist among low-risk assets such as bank 
accounts, money-market deposit accounts and certificates of deposit (CDs). 
Analyses of Federal Reserve data reveal that on an aggregate level, the American 
public is losing $30 to $50 billion every year in interest by relying heavily on bank 
accounts for saving money instead of other vehicles such as CDs or money-market 
deposit accounts that are just as safe (Reuters 2000). 

In a perfect world, information about each asset type and savings option 
would be available to all, regardless of ascribed characteristics. However, the 
flow of financial information is imperfect. Financial information is complex, 
financial knowledge may require a substantial investment of time or resources 
to acquire, and separating the relevant information from the irrelevant (or even 
inaccurate) information is difficult. Under such circumstances, people are likely 
to seek additional information or assistance from others when making financial 
decisions. 

This article explores how socioeconomic status influences whether people use 
social networks to search for savings and investment information, or whether they 
prefer alternative sources of information. It begins with an overview of the reasons 
why people may or may not use networks to obtain financial information generally 
and then addresses how the use of social networks may vary by socioeconomic 
status. After discussing the data and methods, this article presents results from 
bivariate probit analyses that examine how indicators of socioeconomic status 
influence whether people get saving and investment information from social 
networks or from alternative sources, such as bankers, the media or paid financial 
professionals. The final section of the paper discusses the conclusions and how 
the findings contribute to our understanding of wealth inequality. 


Social Networks 


A thriving literature addresses the centrality of social networks in the job- 
matching process (e.g., Fernandez, Castilla and Moore 2000; Granovetter 1995; Lin 
2000; Marsden and Gorman 2001), in shaping consumer transactions (DiMaggio 
and Louch 1998; Engel, Blackwell and Miniard 1995), and in the operation of 
financial markets (Abolafia 1996; Baker 1984; Podolny 2001; Uzzi 1999). The 
importance of networks in shaping both consumer decisions and socioeconomic 
outcomes suggests that social networks would also play an important role in the 
search for financial information. Drawing upon the networks literature, I expect 
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that people will prefer to use social networks as sources of financial information 
for two reasons: cost and trust. 

First of all, people may prefer to use networks because they cost less. Unlike 
financial professionals, friends and family do not charge a fee for their advice. 
In addition, obtaining information from friends and family is often easier than 
locating alternative sources of information. Friends and family are accessible 
and opportunities for exchanging information may arise in the context of other 
informal interactions. A second reason people may prefer networks is trust. Since 
most people are not confident of their level of financial knowledge (Lach 1999) 
and because it is often difficult to determine the quality of financial advice, the 
risk of receiving misleading or false information is particularly salient in the 
search for saving and investment information. Networks are embedded in shared 
norms of mutual obligation and trust, minimizing the risk of opportunism. In 
other words, people may prefer to get information from social networks because 
people assume that friends and family members will treat them fairly, will not 
knowingly steer them into the purchase of an inferior product or service, and 
will have no economic incentive for offering their opinions. 


WHY WOULD PEOPLE NOT USE NETWORKS! 


With all else being equal, people would probably prefer to receive saving and 
investment information from friends and relatives. Why conduct an extensive 
search for information or pay for advice if you don’t have to? But what if the 
type of information people seek requires knowledge, information or expertise 
that their networks do not possess? In such situations, people may turn to sources 
outside their networks. 

The primary reason that people turn to financial professionals is because they 
believe that professionals are more knowledgeable (Larson 1993). In matters such 
as personal finance, in which people lack information or expertise, professional 
advice might be especially sought out — particularly as the degree of complication 
or the need for specialized knowledge increases. | 

Furthermore, people may prefer not to use their social networks if the 
information is sensitive and they do not feel comfortable discussing it with people 
close to them. In fact, many people feel uncomfortable revealing their financial 
situation to friends and family members (Rubenstein 1981). And perhaps for 
good reason. Money is frequently the medium through which people express 
struggles over love and feelings of anger, resentment and envy (Millman 1991; 
Zelizer 1997, 2002). People may feel that if they share the details of their own 
financial situation with friends and family members, they may respond with 
jealousy, pity or embarrassment. 

And finally, exchange theory (Blau 1964) suggests another reason that people 
may not prefer to use their social networks. The exchange of information within 
personal networks involves the expectation of reciprocity and, therefore, asking 
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others for advice or information may make one indebted to the person providing 
the information. 

Therefore, although social networks may be easier, cheaper and more trusted 
sources of information, people may search outside their networks if they cannot 
obtain the information they need from their networks, if they wish to keep 
financial affairs private, or if they wish to avoid feeling obligated to the person 
providing the information. 


Socioeconomic Status and the Use of Networks 


Extrapolating upon the use of networks for job information, I expect that 
the use of networks for obtaining saving and investment information varies 
by socioeconomic status. Socioeconomic status is positively related to access 
to resource-rich networks (Lin 1999). Because networks are economically 
and educationally homogeneous (Lin 1999; Marsden 1987), the higher one's 
socioeconomic status, the more resource-rich one's networks are. For example, 
economic elites have access to important types of financial information through 
their business, political and social ties (Domhoff 1974, 1998; Mills 1956; Useem 
1984). Boards of directors are in powerful positions with respect to knowledge 
that may be translated into financial gain and are often linked to others in similar 
positions through interlocking directorates or indirect interlocks. In fact, Useem 
(1984) found that corporate leaders use interlocking directorates to exchange 
information that could be used for economic gain. Even if one is not a member 
of the corporate elite, members of the upper class tend to belong to the same 
social clubs and tend to marry other members of the upper class (Domhoff 1974). 
Therefore, one would expect that the wealthy not only have more money, but 
also have networks whose members are financially knowledgeable. One might 
argue that this tendency exists, to a lesser degree, in the middle class as well. 
Accountants, lawyers and others in the business and financial industries are also 
likely to have access to financial knowledge or information that may be useful 
when making saving and investment decisions. 

If socioeconomic status is positively related to financial information or 
access to financial expertise more generally, and if social networks are fairly 
homogeneous with respect to socioeconomic status, then we would expect 
people in socioeconomically advantaged positions to have networks that contain 
individuals with more extensive financial knowledge. The literature on the use 
of social capital in job searches suggests the same relationship between social 
standing and access to others in “high” positions. (For a review, see Lin 1999.) 

Nevertheless, one cannot assume that access to social resources implies the 
use of social resources (Lai, Lin and Leung 1998). For example, although those 
of higher socioeconomic status are more likely to have resource-rich networks, 
research on job searches has shown that they make less use of their networks in 
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acquiring a job (De Graaf and Flap 1988; Granovetter 1995). An explanation for 
this finding is that people with resource-rich networks are also likely to possess 
the human capital necessary to render other avenues of a job search fruitful. 
In other words, they may have alternative paths or options available to them 
when acquiring jobs. The same processes may operate in the search for financial 
information. Those with more beneficial networks are also likely to have the 
option of seeking information from other sources — such as the ability to pay a 
financial professional for advice. Consequently, the use of networks for financial 
information is hypothesized to decline as socioeconomic status increases because 
socioeconomic status is positively related to one's ability to obtain saving and 
investment information from alternative sources, rendering one less reliant on 
networks as sources of information. 

The same hypothesis is evident when the issue is approached from the opposite 
end of the socioeconomic hierarchy. Some studies have found that the poor rely 
on social networks for both emotional and economic support. Auslander and 
Litwin argue that the poor rely on informal support from family and friends 
because they have no other options: they have “heavy burdens, small circles 
of support already fully tapped, and few additional resources from which to 
seek informal help" (1988:237). Despite the heavy reliance on networks among 
the poor, research suggests that their networks are smaller and more heavily 
comprised of family members (Auslander and Litwin 1988; Marsden 1987). 

We are left with the impression that the poor are less likely to have extensive 
networks to tap for financial information. But once again, we must bear in mind 
the distinction between having and using networks. Despite the fact that the 
poor are less likely to have networks containing people with access to financial 
expertise, the poor may be more likely to use networks because they have few 
alternative sources of financial information. 

To summarize, the literature on the use of networks for job information 
suggests that although socioeconomic status should be positively related to access 
to networks of individuals with extensive financial information or expertise, 
those with the most resource-rich networks may be least likely to use them to 
search for financial information because they have greater access to alternative 
sources of information, such as financial professionals. As socioeconomic status 
decreases, individuals become increasingly reliant upon networks because they 
have less access to alternative sources of information. Therefore, variation in the 
composition and use of social networks by socioeconomic status suggests that 
access to saving and investment information is embedded in broader systems 
of social inequality. By examining how people search for financial information, 
we can better understand the ways in which such individual-level processes are 
related to economic inequality. 
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DATA 


I use data from the 1998 Survey of Consumer Finances to examine socioeconomic 
differences in the sources of information used when making saving and investment 
decisions. The Survey of Consumer Finances is a comprehensive triennial national 
survey carried out by the Federal Reserve Board. Households for the main survey 
are selected from a multi-stage area probability design that was intended to 
represent the population with respect to broad demographic characteristics. In 
addition, the Federal Reserve Board oversamples high-income households who 
are less likely to show up in the main sample. (See Kennickel, Starr-McCluer and 
Surette [2000] for an overview of the Survey of Consumer Finances). To correct 
for the differential probability of being included in the sample, sample weights 
are used when appropriate.’ 

The main question of interest revolves around the respondent's answer to 
the following question: 


“How do you (and your spouse/partner) make decisions about saving 
and investments? Do you call around, read newspapers, material you get 
in the mail, or use information from television, radio, an online service 
or advertisements? Do you get advice from a friend, relative, lawyer, 
accountant, banker, broker or financial planner? Or do you do something 
else?” 


Answers were open-ended and coded, in the order given, to a maximum of 10 
responses. Respondents reported using an average of 1.83 sources of information. 
I created a set of dummy variables to indicate whether a respondent reported 
consulting each particular source for financial information, regardless of the order 
reported by the respondent. However, because the survey recorded responses in 
the order given, I was able to compare analyses of the respondents’ first responses 
with the analyses presented here and did not find any substantive differences. 

The primary explanatory variables are indicators of socioeconomic status. 
Three variables are used to examine effects of socioeconomic status: level of 
education, income and liquid assets.? Education refers to the highest level of 
educational attainment of the respondent or spouse, which ever is greater. This 
variable is coded into the following ordinal categories: less than high school, 
high school diploma or GED, some college, college degree and graduate school. 
Income is measured as the log of the total household income from all sources 
(i.e., wages, capital gains, etc.). An indicator of wealth,* liquid assets is measured 
as the log of the total market value of all household liquid assets (savings and 
checking accounts, certificates of deposit, bonds, mutual funds and stocks).° 

To properly specify the model, it is necessary to control for attributes that may 
influence the search for saving and investment information besides socioeconomic 
status. These control variables include the following: marital status, race, age and 
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financial risk tolerance. Marital status is represented by dummy variables — one 
dummy variable indicating that the household is headed by a single female and 
the other dummy variable indicating that the household is headed by a single 
male. Married and cohabitating households are treated as the reference category.° 
Race is measured using two dummy variables indicating whether the respondent 
is black or whether the respondent is of another race, with white as the reference 
category.’ Age refers to the respondent (or, if married or cohabitating, the age of 
the older partner is used). Risk tolerance refers to the self-reported preference for 
taking financial risks. Answers range from 1: “not willing to take any financial 
risks” to 4: "take substantial financial risks expecting to earn substantial returns"? 
Means and standard deviations for all variables are presented in Appendix A. 


METHOD 


Since the dependent variable of interest is the source(s) of information when 
saving and investment decisions are made (and therefore only applies to those 
households who save or invest), one must model the possibility of sample 
selection bias. Selection bias occurs when cases that are non-randomly related 
to the dependent variable are excluded from analyses. In the context presented 
here, the dependent variable is the source of information used for saving and 
investment decisions and hence includes only those households that save or invest. 
The potential source of selection bias is that households who:do not save or invest 
are excluded from the analyses. Heckman’s (1979) maximum-likelihood probit 
estimation with selection is an appropriate method for addressing this issue. This 
model assumes that the likelihood of consulting each source of information is 
a function not only of the independent variables in the model, but also upon 
the likelihood that a household saves or invests in the first place. The Heckman 
model first estimates the selection model (to predict whether a household saves 
or invests), calculates the expected error, and then uses the estimated error from 
the selection equation in the calculation of the analyses predicting whether 
a household uses a particular source of information when making saving or 
investment decisions.’ A separate model was run for each source of saving and 
investment information. 

The selection criteria is whether or not a household saves or invests. To 
identify households who do not save or invest, I used respondent answers to the 
question of how they make decisions about saving and investments (see the first 
page of this article’s “Data and Method” section). If respondents answered this 
question by stating that they don’t save or invest (and listed no sources of saving 
or investment information), they were coded as households that do not save or 
invest. Using this definition, only 10% of households reported that they do not 
save or invest.!° This definition is liberal and most likely includes households that 
save regularly as well as those that rarely save. It may also refer not only to current 
saving and investment decisions, but perhaps to decisions that were made many 
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years ago. Nevertheless, analyses that utilized less liberal definitions of whether 
the household saves or invests did not differ from those presented here. 

The primary question of interest refers to both saving and investment 
decisions; but because “saving” and “investing” may involve different processes, 
I also conducted parallel analyses using Heckman’s probit model with different 
“selection” criteria — one in which I selected only households who were likely 
to be “investors” (as evidenced by their ownership of stocks, mutual funds or 
bonds) and another in which I selected only households engaged in “savings” 
(those with checking accounts, savings accounts or CDs, but without stocks, 
mutual funds or bonds). Results were consistent when each of these different 
selection criteria were used, suggesting that socioeconomic status affects “saving” 
and “investing” in similar ways. Consequently, I present the results that do not 
distinguish between saving and investing because doing so fits most closely with 
the wording of the questionnaire. 


Descriptive Results 


Of the 90% of households reporting that they save or invest, friends and relatives" 
are the most frequently utilized sources of information for saving and investing 
decisions, as shown in Figure 1." Approximately 4196 of households that save or 
invest consult friends and relatives, revealing the importance of social networks 
in the exchange of financial information. In fact, no other single source of 
information comes close to matching the frequency by which households consult 
friends and relatives when making saving and investment decisions. After friends 
and relatives, the most commonly reported sources of information were bankers 
(27.196), magazines and newspapers (20.896), and financial planners (20.596). 
Other sources reported include brokers, advertisements, material in the mail, 
television and radio, accountants, the Internet and lawyers. 

Collectively, these various sources can be grouped into four general types: (1) 
social networks: friends and relatives; (2) paid financial professionals: financial 
planners, accountants, brokers and lawyers; (3) bankers; and (4) media: magazines 
and newspapers, television and radio, advertisements, material in the mail? and 
the Internet. Figure 2 presents the percentage of respondents who report each 
of these four general types of sources. 

Whereas 41.396 of households that save or invest reported consulting friends 
or relatives, 35.596 reported consulting a paid financial professional, and 27.196 
reported consulting a banker. Here, I am assuming that "banker" refers to 
someone who works at a bank — such as an account manger or specialized 
financial counselor — not the CEO.'* What separates bank employees from other 
paid financial professionals in this paper is the key distinction that this person 
is not paid a fee directly by the consumer for their advice. People who work at a 
bank are also much more accessible than the other financial professionals listed 
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Figure 1. Sources of Information Used When Making Saving and Invest- 
ment Decisions (Only Households that Save or Invest; N — 3975) 
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Figure 2. Aggregated Sources of Information Used When Making Saving 
and Investment Decisions (Only Households that Save or Invest; 
N = 3975) 


1478 / Social Forces 83:4, June 2005 


because it is often possible to speak with a bank employee without making an 
appointment. Consequently, the effects of socioeconomic status on consulting 
people who work at a bank are likely to differ from the effects of socioeconomic 
status on consulting the other types of financial professionals. 

About 3596 of households use some form of media to gather information 
about saving or investing. Of all five media sources, the category that includes 
magazines and newspapers is the most frequently cited source of information. 
Unfortunately, it is not possible to discern whether the respondent is referring to 
advertisements in local newspapers or specialized financial publications such as 
Fortune or the Wall Street Journal. Therefore, it is important to bear in mind the 
tremendous diversity in the range of content and quality of information likely 
received by respondents who report consulting magazines and newspapers for 
information. 

Approximately 1596 of respondents who save or invest report consulting no 
source other than themselves (or a spouse or partner). Of the 8596 of households 
that consult other sources, friends and relatives are the most frequently utilized 
sources of information. These findings reveal that social networks are key 
mechanisms for the exchange of financial information, providing a unique 
opportunity to observe how sociological factors intersect with economic 
decisions. This article will next address how socioeconomic status affects whether 
information is obtained from networks or from alternative sources (i.e., paid 
financial professionals, bankers and the media). 


SOCIOECONOMIC DIFFERENCES 


Of all three socioeconomic measures (i.e., education, income and liquid assets), 
liquid assets best signify a household's overall socioeconomic resources, As an 
indicator of wealth, liquid assets is more encompassing than income or education, 
and more closely tied to a household's ability to save and invest." 

Table 1 displays the frequencies of using particular sources of information by 
liquid asset quartile for those households that save or invest.’ The tendency to 
consult friends and relatives declines as liquid asset quartiles increase. In contrast, 
the percentage of households consulting paid financial professionals increases 
quite dramatically as liquid assets increase." More than half of the households 
in the top quartile report consulting a paid financial professional when making 
saving and investment decisions. The use of media also generally increases as 
liquid assets increase, although to a lesser degree. The use of bankers does not 
follow a linear trajectory, as bankers are consulted most often by those in the 
top and bottom quartiles. 

At this point, one might reasonably conclude that friends and relatives are 
the most frequently used source of information for those in the bottom half 
of the economic distribution and that financial professionals are the favored 
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Table 1. Percent of Households Reporting Different Sources of Saving and 
Investment Information by Liquid Asset Quartile. 


Liquid Asset Quartile 
Ist 2nd 3rd 4th 
$0-$600 $601- $4,691-  $29,101- 
Informational Source $4,690 $29,100 highest | Ch? 
Friend, Relative 45.7 44.8 42.4 33.7 p«.001 
Paid Financial Professional 19.4 25.1 41.0 52.3 p «.001 
Financial Planner 10.9 15:5 24.2 28.9 p«.001 
Broker 4.8 7.2 11.5 22 p<.001 
Accountant 4.4 5:7 10.4 15.4 p<.001 
Lawyer 3.8 2.8 37 6.0 p<.001 
Banker 27.4 25:5 25.8 29.8 p<.1 
Media 31.4 34.9 37.8 35.9 p«.001 
Magazines, Newspapers 15.1 17.0 22.3 27:3 p«.001 
Advertisements 10.1 13.0 11.6 10.6 p<.001 
Material in the Mail 919 10.2 9.5 10.4 p<.1 
Television, Radio 10.3 10.6 9:2 9.6 p«.001 
Internet 4.1 ZZ 11.5 12:5 p«.001 
Unweighted N (Total-3975) 632 742 833 1768 


Note: Data are for households that save or invest and are weighted. 


information source for the wealthiest households.!* Nevertheless, more than 
one-third of the wealthiest households report consulting friends and relatives for 
advice, suggesting that networks are a significant source of information across 
the socioeconomic spectrum. 


Results of Multivariate Analyses 


I utilize Heckman’s (1979) probit model with selection to examine the likelihood 
that a household uses each particular source of information. As mentioned 
previously, the bivariate probit model with selection was chosen in order to 
address possible selection bias since not all households save or invest. The 
particular model tested here assumes that the likelihood of consulting each 
source for information is a function of a household’s level of education, marital 
status, race, age, financial risk tolerance, income and liquid assets;? whereas 
the likelihood of saving and investing (the selection criteria) is a function of a 
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household's level of education, income, marital status, race, age and financial 
planning period.” 

For purposes of simplification, the tables in the main text present only the 
results from the substantive equation that predicts whether or not each source 
was consulted. The coefficients from the first selection equation (which predicts 
whether a household consults friends and relatives) are presented in Appendix B. 
The coefficients for the selection portion of the remaining equations are virtually 
identical and, therefore, it would be redundant to list them independently; 
however, they are available from the author by request. The results from the 
selection portion of the analyses reveal that the most significant predictors 
of whether a household saves or invests in the first place is respondents’ level 
of education, income and the financial planning period that they feel is most 
important when making saving and spending decisions, ranging from (1) “the 
next few months” to (5) “more than 10 years.” 

Table 2 presents the results of the bivariate probit analysis with selection for 
each source of saving and investing information.” 


FRIENDS AND RELATIVES 


Friends and relatives are used primarily by those with the least financial resources, 
as represented by both income and liquid assets. Therefore, the hypothesis that 
socioeconomic status is negatively related to the use of personal networks for 
financial information is supported.” Although networks have the advantage of 
being easier, cheaper and embedded within norms of trust, as socioeconomic 
status increases these benefits appear to either decline or are replaced by 
advantages offered by other search methods. 

Besides socioeconomic status, the use of social networks for information is 
greater for households headed by single females and for younger households. 
These findings are consistent with previous findings that network range is greater 
for younger individuals (Marsden 1987). 


OTHER SOURCES OF INFORMATION 


Paid Financial Professionals 


Two indicators of socioeconomic status (education and liquid assets) increase 
the likelihood that a household will consult a paid professional for saving and 
investment information. When households have more money to save and invest 
— and more money to pay for professional advice — it is not surprising that 
they seek information from a professional. 

A look at the detailed categories that comprise the aggregated paid financial 
professional category provides more information on socioeconomic differences 
(see Table 3). Consistent with the results of the aggregated financial professional 
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Table 2. Coefficients for Bivariate Probit Models with Selection Predicting 
Sources of Information Consulted When Making Saving and 
Investment Decisions. 


Friends/Relatives ^ Paid Professional Banker Media 
Socioeconomic Status: 
Education .023 EI 05S —.091*** SISSE 
(.024) (.023) (.026) (.025) 
(log) Income —.024* .002 —.026 —.040** 
(.015) (.014) (.017) (.015) 
(log) Liquid Assets —.029*** 207677 .020** — —.005 
(.009) (.009) (.009) (.009) 
Controls: 
Marital Status: 
Single Female-Headed :309** .019 .015 —.015 
(.057) (.059) (.061) (.059) 
Single Male-Headed .068 —.067 —.098 . 1251 
(.065) (.065) (.070) (.064) 
Race: 
Black —.031 T met .2 7: 
(.078) (.080) (.083) (.079) 
Other .087 —253** 20—234 9" 2193::$ 
(.074) (.0781 (.082) (.075) 
Age —153*** .003* .001 —.010*** 
(.002) (.002) (.002) (.002) 
Financial Risk .001 12 185275 —.055* Ale es 
Tolerance (.028) (.029) (.028) (.027) 
Log likelihood —3470.7 —3411.9 —3223.9 —3469.7 
Wald Chi2 254.5%*** 28591 35.0571. 01075 ()^ 
Rho nr 1003 —.349* ELO NES 
(.262) (.194) (.345) (:277)) 
N 4305 4305 4305 4305 





Note: Standard errors are in parentheses and coefficients for the constant term are not 
presented. 


*p<1.0 **p<.05 *** p<.001 (two-tailed tests) 
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category, as socioeconomic status increases so does the likelihood that a household 
will consult each type of financial professional. 

In addition to socioeconomic differences, there are statistically significant 
racial differences in the use of paid financial professionals. Black households 
are more likely than economically similar white households to seek the advice 
of paid professionals when making saving and investment decisions (financial 
planners, accountants and lawyers in particular). Since many blacks are newer 
to the investment arena (Conley 1999; Oliver and Shapiro 1997), they may 
feel more of a need to turn to professionals for information. As a financial 
consultant quoted in a 1997 Money magazine article explained, "Since many 
blacks are first- or second-generation investors at best, there is no pattern of 
behavior to follow, so they often feel more secure relying on professional help" 
(Smith 1997). And unlike other minority groups, blacks have been the focus of 
marketing by the financial services industry, such as Ariel Capital Management 
(a black-owned financial services company) and many others, who have targeted 
the black community as potential clients. Furthermore, organizations ranging 
from churches to the Coalition of Black Investors have been actively reaching 
out to current and potential black investors to increase financial literacy and 
forge connections between blacks and financial professionals (Kong 2000; Mabry 
1999; Weems 1998:107). 


Bankers 


A less expensive alternative to paid financial professionals are people who work 
at a bank. Interestingly, education is negatively related whereas liquid assets are 
positively related to the likelihood that a household will consult a banker for 
saving or investment information. In comparing the effects of socioeconomic 
status across the banker and paid professional categories, results indicate 
that education and risk aversion, rather than financial assets, per se, are the 
determinants of which type of financial industry representative (paid professional 
or banker) a household chooses. A banker is the favored information source of 
less-educated, more risk-averse households; a paid professional is the favored 
information source of higher-educated, less risk-averse households. 


Media 


Interestingly, the use of media increases with education, but decreases as income 
increases. However, as mentioned previously, the media category could potentially 
include things as diverse as the local news, direct-mail advertisements, the Wall 
Street Journal or financial sites on the Internet. Given the diversity of types of 
media, it is prudent to examine the bivariate probit analyses of each individual 
media category, which are presented in Table 3. 
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Higher education attainment increases the likelihood that households will 
consult several forms of media: material in the mail, the Internet and magazines 
or newspapers. Those with more wealth are likely to consult the Internet as well 
as magazines and newspapers. One interesting benefit of consulting the media 
(compared to networks or paid professionals) is that the search is private and there 
is no risk of offending anyone if the advice or information is disregarded. 

But it is important to bear in mind that the content and quality of information 
varies across media forms and even within the same form of media. Without 
knowing more about the media form or content, the implications of any 
socioeconomic differences in media use are unclear at this point and can be 
resolved only in future studies. 


Are Alternative Sources used at the Expense of Networks? 


Results thus far indicate that social networks were used most often by those with 
the least wealth, while wealthier households were more likely to gather savings 
and investment information from paid professionals and from certain types of 
media. However, is the reliance on professionals and the media at the expense of 
social networks? After all, it seems reasonable that one might want to use multiple 
sources of advice, thus combining their strengths and minimizing their inherent 
weaknesses. To elaborate, networks have the advantage of being embedded in 
norms of mutual trust, thereby minimizing the risk of opportunism. However, 
people in one's networks may not be as financially knowledgeable as paid experts. 
In contrast, paid professionals have the advantage of being knowledgeable, but 
there is the risk that the financial professional will behave opportunistically rather 
than provide advice that is in the customer's best interest. Combining information 
from networks with information from other sources (such as paid professionals) 
may help to minimize risk by tapping both sources of information — trustworthy 
(but perhaps not as knowledgeable) and knowledgeable (but perhaps looking to 
further their own interests over yours). 

To further explore how socioeconomic status impacts whether people use 
networks in combination with other sources of information, I conducted a 
multinomial logistic regression for those households that save or invest,? in 
which there were three possible outcomes: (1) networks used as one's only 
source of information, (2) networks used in conjunction with other sources of 
information, and (3) networks are not used. In the results presented in Table 
4, people using networks in conjunction with other sources of information are 
the reference category, which allows me to determine (a) how they compare to 
those who use networks exclusively and (b) how they compare to those who do 
not use networks. 

The first column of Table 4 compares those who use networks exclusively with 
those who use networks in conjunction with other sources of information (the 
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reference category). Results indicate that those who use networks exclusively have 
lower socioeconomic status (as measured by education, income and liquid assets) 
than those who use networks in conjunction with other sources. Those with the 
least socioeconomic resources are more likely to rely exclusively on networks. 

The second column of Table 4 compares those who do not use networks with 
those who use networks in conjunction with other sources. Although education is 
positively related to the use of networks in conjunction with other sources, those 
who use networks in conjunction with other sources have lower liquid assets, 
suggesting that as wealth increases, people are more likely to exclude networks 
as a potential source of information. 

These results suggest that the use and exclusivity of networks as a source of 
information shifts as socioeconomic status increases. The exclusive use of networks 
is most likely for those at the bottom of the socioeconomic ladder. People who 
use networks in conjunction with other sources have higher economic status than 
those using networks exclusively. But those who do not use networks have the 
highest economic status of all. Apparently, as socioeconomic status increases, 
one is less likely to use their networks at all, even to cross-check the information 
acquired from outside sources. 

A second way of assessing how socioeconomic status may be related to 
whether people are possibly gathering information from multiple sources is to 
examine how many different sources of information people use when making 
savings and investment decisions. To empirically assess how socioeconomic 
status is related to the number of sources people utilize for making saving and 
investment decisions, I conducted a Heckman selection model that estimates 
regression models with sample selection. Like the Heckman probit model with 
selection (discussed previously), it addresses the possible selection bias that 
not all households save or invest; but whereas the Heckman probit model is 
appropriate for discrete outcomes, the Heckman selection model is appropriate 
for regression equations. 

The dependent variable for the Heckman selection model is the number 
of sources people report using when making saving and investment decisions. 
The independent variables for the regression equation and the selection model 
are identical to previous models. The results of the Heckman selection model 
predicting number of sources of information when making savings and 
investment decisions are presented in Table 5. (And, like models in Tables 2 and 
3, the coefficients for the selection portion of the analyses are virtually identical 
to those presented in Appendix B.) 

As shown in Table 5, socioeconomic status is positively related to the number 
of sources consulted when making saving and investment decisions. It seems 
logical that those individuals with more wealth would perform more extensive 
searches for information since they have more wealth to manage. Taken in 
conjunction with the previous analyses, it appears that as socioeconomic status 
increases, people are more likely to consult multiple sources of information except 
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Table 4. Coefficients for Multinomial Logistic Regression Model Compar- 
ing Characteristics of Those Using Networks in Conjunction with 
Other Sources (Reference Category) with those Using Networks 
Exclusively and those Not Using Networks. 





Used Networks Exclusively Did Not Use Networks 





Socioeconomic Status: 


Education —.298*** —.110** 
(.063) (.037) 
(log) Income —.023 .034 
(.034) (.023) 
(log) Liquid Assets — 075 ` .026* 
(.023) (.015) 
Controls: 
Marital Status: 
Single Female-Headed 12 —467*** 
(.157) (.102) 
Single Male-Headed 228" —.028 
(.187) (.116) 
Race: , 
Black —.096 .036 
(.207) (2137) 
Other .169 —.079 
(.196) (.133) 
Age (Olas 40305355 
(.004) (.003) 
Financial Risk Tolerance —.244** —.049 
(.083) (.047) 
Log likelihood —3188.2 —3188.2 
Wald Chi2 BAR 498. qux 
N 3,975 3,975 


Note: Standard errors are in parentheses and coefficients for the constant term are not presented. 
*p<1.0 **p<.05 *** p<.001 (two-tailed tests) 
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for their social networks. In other words, those of the highest socioeconomic status 
are likely to be combining multiple sources of information, but as socioeconomic 
status increases, networks are decreasingly likely to be one of the sources. 


Discussion and Directions for Further Research 


Social networks are by far the most frequently used source of information for 
making saving and investment decisions among the 11 sources examined in this 
paper. Social networks are clearly key mechanisms by which financial information 
is exchanged in households of all socioeconomic levels. However, networks are 
used most often by households with the fewest economic resources. The negative 
relationship between socioeconomic status and the use of networks for saving 
and investment information may be a result of the types of information sought 
by lower socioeconomic status households. Members of these households are 
not likely to be flush with cash or investing in the stock market. Therefore, they 
are most likely interested in finding out information about account fees, interest 
rates or other types of information that can be acquired successfully by asking 
people in one's network. 

But the greater reliance on networks may also hinder poorer households' 
access to information regarding all possible saving and investment vehicles. 
Because networks are fairly socioeconomically homogeneous, people in the lowest 
socioeconomic groups are turning to others of similar socioeconomic status. 
This reliance on networks may be problematic if financial information does 
not flow equally across all socioeconomic classes or if the information obtained 
from networks is inferior to information obtained from other sources. In that 
case, those most in need of financial planning experts and access to financial 
information are least likely to receive it. 

Households with greater wealth are more likely to turn to paid financial 
professionals and to certain forms of media. Interestingly, as socioeconomic status 
increases, so does the number of sources of information consulted. However, as 
socioeconomic status increases, networks are less likely to be one of the sources 
of information. Results indicate that those at the top of the socioeconomic 
ladder do gather information from multiple sources — possibly as a way of 
cross-checking or combining information to minimize the risk of making a poor 
decision — however, networks are likely to be left out of this process. Apparently, 
as socioeconomic status increases, networks are less likely to be used for saving 
and investment information because (a) networks are less useful sources of 
information, (b) people are more resistant to sharing their personal financial 
information with others in their networks, or (c) people are hesitant to accrue 
obligations to others in their networks. 

An alternative explanation for the finding that socioeconomic status is 
negatively related to the use of networks for information is measurement error. If 
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Table 5. Coefficients for Heckman Selection Model Predicting the Total 
Number of Sources of Information Consulted When Making 


Saving and Investment Decisions. 


Socioeconomic Status: 
Education 


(log) Income 


(log) Liquid Assets 


Controls: 
Marital Status: 
Single Female-Headed 


Single Male-Headed 


Race 
Black 


Other 


Age 


Financial Risk Tolerance 


Log likelihood 
Wald Chi2 
Rho 


N 





Number of Sources Consulted 


uoo wie 
(.023) 


—.042** 
(.016) 


.044*** 
(.011) 


.075 
(.072) 


.081 
(.082) 
.294** 


(.099) 


.034 
(.095) 


—012*** 
(.002) 


—.241*** 
(.034) 


—8610.1 
24002 


—.133 
(.091) 


4305 


Note: Standard errors are in parentheses and coefficients for the constant term are not presented. 


*psl.0 **p<.05 *** p<.001 (two-tailed tests) 
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financial information is exchanged routinely in the course of other conversations 
and without being actively sought out, then the recipient of the information may 
not recognize that relevant information has been exchanged. As explained by 
Lin, “embeddedness in resource-rich social networks increases the likelihood of 
receiving useful information in the routine exchanges and without actively seeking 
such information. ... Only when such useful information is not available and not 
forthcoming would activation of social capital become necessary" (2000:792). It 
may well be the case that people in positions of higher socioeconomic status are 
more likely to receive information relevant to saving and investment decisions in 
routine exchanges with members of their social networks. Because the information 
is exchanged within the context of other conversations or in small incremental 
pieces throughout multiple conversations, the recipients may not even realize 
that they received information. The flow of information may therefore constitute 
*the invisible hand of social capital" (Lin 2000:792). Because those without such 
resource-rich networks do not receive information via the invisible hand, they 
must actively seek information from networks. 

In many ways, the findings here parallel the findings regarding the use of 
networks for job searches. In both cases, we would expect that those in higher 
socioeconomic positions would have access to beneficial networks; and yet 
findings suggest that those who have the most resource-rich networks are less 
likely to use social networks for information. There are two main explanations for 
this finding: (1) those with beneficial networks have other advantages (e.g., human 
capital, economic capital, etc.) and are less reliant on networks for information, 
and (2) those with beneficial networks do not need to actively seek it from their 
networks because they routinely receive it. Further research is needed to adjudicate 
between these two explanations. 

Further research is also needed to address several key questions brought to 
the fore by this paper that could not be examined due to data limitations. First, 
to better understand the implications of different search methods, future research 
should address how the content and quality of information received from these 
different sources varies. If financial professionals generally provide high-quality 
information, then it is likely that the rich will become richer, further exacerbating 
wealth inequality. Second, future research should also address how the source 
of information impacts savings and investment decisions. For example, it is 
likely that those seeking information from financial advisors and brokers are 
more likely to invest in stocks and bonds, both because the people who use paid 
financial professionals are likely to have higher wealth to begin with, but also 
because these types of financial professionals specialize in (and make money 
from) recommending these types of investments. Third, the collection of data 
over time is necessary to tease out the causal relationships involved in the process 
of financial decision-making and the acquisition of wealth. 

The research reported here is intended to bring financial behavior to the 
attention of sociologists because saving and investment decisions have important 
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implications for understanding wealth inequality. As stated recently by several 
scholars (Conley 1999; Keister 2000; Oliver and Shapiro 1997; Shapiro 2004; 
Shapiro and Wolff 2001; Sherraden 1991; Spilerman 2000; Tilly 2000), measures of 
economic well-being must move beyond income to include wealth because wealth 
inequality is much more extreme than income inequality, and also because wealth 
confers benefits that income does not: it can be transmitted across generations 
fairly easily, it can provide the ability to weather common financial crises (e.g., 
illness or unemployment), and it can be used to help generate more income and 
wealth. Consequently, in order to understand past, present and future economic 
inequality, it is necessary to understand how wealth is created and maintained 
and the ways that financial decision-making may reinforce wealth inequality. 


Notes 


1. Sample weights are used in all descriptive tables and figures, but not in the multivariate 
analyses. Since the sampling weights are derived in part from the independent variables 
in the analyses, it is not appropriate to use weighted data in the multivariate analyses. 
See Winship and Radbill (1994) for a more detailed discussion of the use of weights in 
multivariate analysis. 


2. Among the independent variables the only correlation coefficient above .5 was for liquid 
assets and education (.577). Omitting education from the analyses did not change the main 
findings. 


3. Results did not differ when I treated education as a series of dummy variables. 


4, An alternative measure of wealth would be net worth. Net worth consists of the total 
household assets (i.e., real estate, vehicles, businesses, liquid assets) minus debts (i.e., home 
mortgage, credit card debt, loans). Since net worth can sometimes obscure subtle but 
important differences in wealth that may be relevant for the types of saving and investment 
decisions people make (for example, a person with $15,000 equity in their home but no 
other assets is likely to be in a different saving and investment situation than a person 
with $15,000 in liquid assets), liquid assets are used in the analyses. Nevertheless, results of 
additional analyses using net worth instead of liquid assets did not differ substantially from 
those reported here. 


5. Following the lead of other researchers, I excluded retirement plan assets that the holders 
cannot borrow against. 


6. The survey questions were designed to treat cohabitating couples as married couples and 
consequently most variables in the analyses are based on this assumption. However, the results 
did not differ when cohabitating couples were treated as distinct from married couples. 


7. The data do not contain the racial identity of both spouses in the household. The race 
variable only refers to the respondent and therefore it is not possible to ascertain whether 
the racial category refers to both spouses or partners (in the case of married or cohabitating 
households) or just the respondent. However, in 1998 interracial married couples constituted 
only 2.4% of all married couples (U.S. Bureau of the Census 2000). 


8. The question refers to the amount of risk that the respondent and spouse/partner (if 
applicable) are willing to take when they save or make investments. Because it is possible 
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that one's financial risk tolerance has been constructed with a financial advisor, I have run 
additional models that exclude this variable from the analyses and the results are consistent 
with those presented here. 


9. For a more detailed description of the Heckman estimator, see Greene (1997) and Winship 
and Mare (2000). 


10. This percentage is much lower than another indicator of savings in the Survey of 
Consumer Finances that asks respondents whether, over the preceding year, their household 
spent less than its income, more than its income, or about as much as its income. Fifty-six 
percent of households reported spending less than their income, an indicator of saving. 
However, when asked for their motivations for saving (even though they may not be saving 
currently), only 4.996 of respondents reported that they do not save (Kennickell, Starr- 
McCluer & Surette 2000). 


11. In the data, friends and relatives are combined into a single category, rendering it 
impossible for me to separate the two sources of information. 


12. Sources reported by less than 396 of the households are not included. In addition, "calling 
around" is not analyzed due to the tremendous ambiguity of this category; it may include 
responses as diverse as calling friends, banks or a stock broker. Nevertheless I conducted a 
bivariate probit analysis for this category (not shown) and found that the variables used in 
the analyses were extremely poor predictors of whether or not someone “called around,” 
further indicating the conceptual ambiguity of this category. 


13. While material in the mail most likely contains items ranging from direct-mail 
advertisements to newsletters from financial institutions, its content is most likely similar 
to information in the other forms of media in this category. 


14. Characteristics of households that consult bankers support this assumption—they tend 
to have lower levels of education (see Table 2). 


15. Results for the other indicators of socioeconomic status suggest similar bivariate 
relationships as those presented in Table 1. 


16. Liquid asset quartiles were calculated using the entire sample and are weighted to account 
for the oversample of wealthy households. Therefore, quartiles are designed to reflect the 
U.S. population and not the survey sample. 


17. Patterns in the use of lawyers, however, deviate somewhat from the clear pattern 
mentioned here. 


18. The likelihood of having friends and family members who are financial professionals 
most likely increases as socioeconomic status increases. It is impossible to discern whether 
respondents classified such individuals as a financial professional or as a friend/relative. Since 
we have no reason to believe that a classification bias exists (i.e., respondents are neither 
more nor less likely to report these individuals as either friends/relatives or as financial 
professionals), the error is most likely random. Nevertheless, if we assume that financial 
professionals who are friends or relatives are disproportionately classified as financial 
professionals, then the observed effect of socioeconomic status on the use of friends/relatives 
and on paid financial professionals may be weakened. On the other hand, if we assume that 
these same individuals are more likely to be classified as friends or relatives, then the observed 
relationship between socioeconomic status and both the use of networks and the use of paid 
financial professionals is further strengthened. However, we have no reason to believe that 
either of these two scenarios is operating here. 
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19. It is possible that there may be a reciprocal relationship between source of information 
and liquid assets. For example, households might have greater (or less) wealth because they 
followed the advice of a friend or a broker. Because the data is cross-sectional, I cannot 
determine the causal order of the variables in the models. However, I do not think this issue 
poses a serious challenge to the findings. When people answer the question of how they 
acquire information when making saving and investment decisions, they probably think back 
to whom or what sources they consulted the last time they made a decision. Since amounts 
of liquid assets change very slowly for most households (most households have very little 
wealth), it is unlikely that the value of a household's liquid assets changed dramatically since 
their last saving or investment decision. 


20. The survey asks the following: “In planning your (your family’s) saving and spending, 
which of the time periods listed on this page is most important to you?” Answers range 
from 1: “the next few months” to 5: “more than 10 years.” The “planning period” variable 
is used in the selection equation as an instrumental variable in order to identify the model. 
Nevertheless, the inclusion of this variable as a predictor of both saving and of the likelihood 
of consulting each particular source did not change the conclusions presented here. 


21. After restricting the sample to respondents who save or invest, logistic regressions were 
also conducted. The results did not differ in any meaningful way from the results obtained 
using the bivariate probit model with selection, which is not surprising given that many of 
the values of Rho in Tables 2 and 3 are not significantly different from zero. 


22. This does not, however, exclude the possibility that the wealthiest households are using 
their networks as sources of information regarding which financial planner, accountant, 
lawyer or broker to use. 


23. Unfortunately, it is not possible to control for the selection of households that save or 
invest using multinomial logistic regression. 
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Appendix A. Means and Standard Deviations for Variables in the Analysis 


ey 


Mean (S.D.) 


iasi A eon) al wg s J en LC EC u = 


Dependent Variables:* 





Friends and Relatives .37 (.48) 
Exclusively AZ (632) 
In Conjunction with Other Sources uS (.44) 
Paid Financial Professional 192 (.47) 
Financial Planner .18 (.39) 
Broker Jd (.31) 
Accountant .08 (.28) 
Lawyer .04 (.19) 
Banker 24 (.43) 
Media 9) (.47) 
Magazines, Newspapers 119 (.39) 
Advertisements .10 (.30) 
Material in the Mail .09 (.29) 
Internet .08 (.28) 
Television, Radio .09 (.29) 
Total Number of Sources of Information 1.83 (1.65) 
Independent Variables: 
Socioeconomic Status: 
Level of Education 2.90 (1.25) 
(log) Income 10.11 (1.98) 
(log) Liquid Assets 7.92 (3.41) 
Controls: 
Marital Status: 
Single Female-Headed? 27. .45 
Single Male-Headed^ .14 35 
Race: 
Black‘ 12 (.32) 
Other Non-White* .10 (31) 
Age 
Financial Risk Tolerance 1.89 (.87) 
Planning Period’ 3.00 (1.32) 


n o a aaaea P 


Data are weighted and refer to the entire sample. 


a Variables indicate whether a household consults or gets advice from each source (no = 0, yes = 1). 
^ Dummy variables; reference category is “married or cohabitating.” 

* Dummy variables; reference category is “white” 

? This variable is used only in the selection equation of Heckman's probit model with selection as an 
instrumental variable to identify the model (see note 20). 


a 
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Appendix B. Results from Bivariate Probit Analysis, predicting whether 
a household saves or invests in the first place (taken from the 
full model predicting whether a household consults friends 


or relatives)! 


Socioeconomic Status: 
Education 


(log) Income 


Marital Status: 
Single Female-Headed 


Single Male-Headed 


Race: 
Black 


Other 


Age 


Financial Planning Period 


Rho 


N 


Household Saves or Invests 


2055 
(.028) 


.104 *** 
(.014) 


—.132 * 
(.074) 


—.093 
(.089) 


—117 
(.091) 


alil 
(.105) 


—.0002 
(.002) 


MS 
(.024) 


—.053 
(.262) 


4305 


Note: Standard errors are in parentheses and coefficients for the constant term are not presented. 


*p<1.0 **p<.05 *** p<.001 (two-tailed tests) 


*As mentioned in the second paragraph of the Results of Multivariate Analyses section of this article, 
the coefficients for the selection portion of the remaining analyses are virtually identical, except 

that the value of Rho is statistically significant in the models predicting seeking information from 
financial professionals, financial planners, brokers and accountants (as indicated in Tables 2 and 3). 
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Abstract 


The origin and enforcement of criminal law are central to the sociological 
study of crime, yet we know relatively little about how the coercive apparatus of 
criminalization is actualized through prosecutorial and court practices. We use 
Bourdieu extension of Weber's analysis of law to develop a perspective on fields 
of practice, the juridical field and the force of law at The Hague Tribunal for the 
former Yugoslavia. Our research is based on four years of prosecutor interviews, 
courtroom observations and analyses of trials covering four prosecutorial regimes. 
Successive and competitive practices have created an interlocking and cumulative 
force that is a prerequisite to promoting international humanitarian law. 


I. Introduction 


|! Prime Minister David Ben-Gurion defended Adolf Eichmann’s controversial 
Í Israeli trial for crimes against the Jewish people by insisting that “Israel does 
not need the protection of an International Court” (Arendt [1964] 1977:272). 
Hannah Arendt argued in response that in addition to the Jewish people, all of 
| humanity required an international criminal court for their protection, reasoning 
| that “insofar as the victims were Jews, it was right and proper that a Jewish court 
| should sit in judgment; but insofar as the crime was a crime against humanity, 
Í it needed an international tribunal to do justice to it" (1977:269). Since at that 
| time there was neither an international criminal tribunal nor sufficient support 
| to create one, as a political matter the issue was momentarily moot. 
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‘| © The University of North Carolina Press Social Forces, June 2005, 83(4):1499-1534 


1500 / Social Forces 83:4, June 2005 


We argue below that Arendt's case for international criminal law remains 
persuasive, and that when combined with Bourdieu's sociology of the juridical 
field can inform us about the now renewed force of international criminal law. 
The renewal of this field of law — as well as its abeyance for the last half of the 
20th century — is a reminder that the creation and enforcement of the criminal 
sanction is central to the sociological understanding of crime (see also, e.g., 
Deflem 1998; Jenness and Grattet 2001; Silbey 2002). 


II. Foundational Moments in International Criminal Law 


International criminal law flourished briefly at the Nuremberg and Tokyo 
Tribunals and then languished before reemerging in the last decade in the 
International Criminal Tribunals for the former Yugoslavia and Rwanda, Special 
Courts in Sierra Leone, East Timor and Cambodia, and the still largely nascent 
International Criminal Court. The most visible of these is the International 
Criminal Tribunal for the Former Yugoslavia (ICTY), which was established by 
the United Nations in 1993. 

The ICTY began with neither a building nor employees, little funding and no 
one in custody. After a troubled start, this tribunal grew in less than a decade to 
more than 1,000 employees, $100 million in annual UN funding, more than 50 
detainees, and the first sitting head of state, Slobodan Miloevic, on trial for crimes 
against humanity and genocide. The ICTY is the leading edge of a reconstituted 
transnational juridical field (see Meron 1998; Neier 1998; Robertson 1999). 

There is significant agreement about the necessary links between law, power 
and politics that allowed this resurgence of international criminal law. Testifying 
before Congress in support of the Nuremberg Tribunal, the criminologist 
Sheldon Glueck emphasized the role of politics and power in any new field 
of law, saying that “in the early stages of any system of law the executive and 
the judicial are not sharply differentiated; and international law is not a highly 
developed system" (1944:2; see also Hagan and Greer 2002). Max Weber and 
Hannah Arendt concurred. 

Weber's writings on legal and economic order emphasized that resort to 
coercion by military and police power is essential to the distinction between 
custom and law (1978:313—14), going so far as to characterize war as a “normal” 
form of international law enforcement (1978:319). For Weber, the hallmark of 
state law — and the basis on which it retains its effectiveness — is that its formal 
rationality is further backed by the power of the state to coerce enforcement. 
In this way, Weber anticipated the problem of international criminal law in the 
20th century, namely the absence of a coercive police power to enforce these 
emerging norms. | 

Arendt made related points in her Origins of Totalitarianism (1973) and 
Eichmann in Jerusalem ([1964] 1977). Her essay on the “Rights of Man” 
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(1973:290-302) further considered the extent to which legal mechanisms can 
secure human rights. She exposed the disparity “between the efforts of well- 
meaning idealists,” who promote a concept of inalienable human rights, and 
“the situation of the rightless themselves,” who lack institutions to guarantee 
their rights (279). Arendt noted that the 18^ century Declaration of the Rights 
of Man asserted inalienable rights — and, therefore, provided no protections for 
them. Protection in time was tied to national sovereignty and to a nation's people 
rather than to humanity more generally. But after World War I, it became fully 
apparent that those stateless peoples who most needed human rights protection 
lacked it. 

Arendt identified a resulting human rights debate between “cynics” and 
“idealists”: while cynics (including totalitarian leaders) point to stateless people 
as evidence that inalienable human rights do not exist, well-intentioned idealists 
speak for rights without the political power to defend them. The challenge is to 
find protective space between these polarities. The question is thus posed: if the 
rhetoric of human rights is often dominated by cynics or idealists, if the conflation 
of human and civic rights has led to their unenforceability for the stateless, and if 
legal guarantees are insufficient to ground their enforcement — how can human 
rights gain a relative autonomy and force in Arendt’s framework? (See Arendt 
1973:269, 292-93) 

In Eichmann in Jerusalem ([1963] 1977), Arendt considered the role of an 
international criminal tribunal and finds the possibility of a more forceful human 
rights agenda. While Arendt agreed that it was "right and proper" for Israel to try 
Eichmann, she also endorsed her mentor Karl Jaspers’ proposal of an international 
legal forum (Arendt 1977:269—277). It was Arendt's (275-76) fear of genocide’s 
recurrence that grounded her position: 


If genocide is an actual possibility of the future, then no people on earth 
— least of all, of course, the Jewish people, in Israel or elsewhere — can 
feel reasonably sure of its continued existence without the help and the 
protection of international law. Success or failure in dealing with the 
hitherto unprecedented can lie only in the extent to which this dealing may 
serve as a valid precedent on the road to international penal law. (273) 


Arendt reveals in this passage her appreciation of social and legal precedents and 
practices — more specifically, of both the creative and the repetitive features of 
law — which constitute crucial links to the concepts of habitus, doxa and fields 
of practice in Bourdieu's work that we introduce below. 

Meanwhile, Austin Turk anticipated the shifting power relationships that 
with the end of the Cold War would revive a more universal jurisdiction briefly 
anticipated at Nuremberg: 

Specifically, the needs to secure capital investments, open up new 

investment and trade opportunities, protect or improve military positions, 

and respond to the pressures of internal and external politics have led not 
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only to international cooperation and conflict but also to “interference” 
by some authorities in the “internal affairs” of others. As such pressures 
increase, so will interference... [P]olitical policing will be organized more 
and more on a multinational if not international basis. (1982:207) 


What Turk termed “political policing" is exactly the business of international 
criminal tribunals, such as the ICTY, and their “interference in internal affairs,” 
which reached a near term peak with the transfer of Slobodan Milosevic to the 
ICTY in The Hague. The transfer of Milosevic to an international court is the 
move Arendt argued must confront future crimes against humanity. 

The establishment of the ICTY and the arrest and transfer of Milosevic are 
thus explained broadly by existing accounts. Yet this leaves us knowing more 
about the origin than about the operation and institutionalization of this body 
of law. The concept of ‘relative autonomy’ provides little help, because while it 
acknowledges that law sometimes can operate apart from class and power, it tells 
us little or nothing about how, in doing so, law gains its own distinctive force 
(Hunt 1993). Pierre Bourdieu (1987) offers an empirically grounded approach 
to this problem with his reassertion of an influential “force of law” that is more 
than simply relatively autonomous. Bourdieu explains this legal force with his 
“sociology of the juridical field.” (See also Dezalay and Garth 1996.) 


III. Bourdieu and the Force Field of Juridical Practice 


Bourdieu's concept of the field locates actors within a structured set of relations 
and practices. The relations orient “the strategies which the occupants of the 
different positions implement in their struggles to defend or improve their 
positions” (Bourdieu 1993:30). Thus Bourdieu describes the literary or artistic 
field as based on objective relations, “a field of positions and a field of position- 
takings” (Bourdieu 1993:34). The juridical field is similarly a social space in which 
conflicts are translated into recognized forms, and in which agents accept (and 
thereby produce) the authority of legal expertise in resolving disputes (Bourdieu 
1987:830—32). The focus is on the objective relations among the positions and 
actions of those who produce legal texts and outcomes (Bourdieu 1993:33). 

In this way, Bourdieu agrees that legal texts (or other symbolic objects) 
bring specific possibilities to a field — but he does not privilege these texts 
as carrying their own autonomy, apart from the formal positions of actors 
in the field (Bourdieu 1993:32-34). The force of law, instead, derives from its 
enactment and practice. It is the struggle over the interpretation and direction 
of these texts, combined with law's authority to name orthodox principles and 
its rhetorical claim to universality, that constitutes law as a field with internally 
driven practices. 

These internal practices that ground the force of law are rooted in the habitus 
of the juridical field: the generally repetitive and sanctioned — and sometimes 
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innovative — ways about knowing and doing law that form a force field for the 
production of decisions and other effects. This force field involves struggle and 
competition within and between groups of actors who mostly work within a 
legal sphere that at least partially insulates their operations. The habitus, then, 
functions as “a matrix of perceptions, appreciations, and actions" (Bourdieu 
1977:83) — the "systems of dispositions" — (Bourdieu 1993:71; Bourdieu and 
Wacquant 1992:105) through which individual agents act. When combined with 
a tacit public grant of legitimacy to the legal order (Bourdieu 1987:844), the 
juridical field can then act back upon and thus influence the world beyond its 
boundaries. 

For Bourdieu, the habitus of the juridical field involves practices which 
include elements of orthodoxy and doxa. The orthodoxy consists of substantive 
and procedural norms — in the case of the ICTY framed for the Tribunal by a 
mixture of common and civil law custom, along with treaties, conventions and 
resolutions of the UN Security Council and General Assembly. The doxa — most 
often deriving at the ICTY from Anglo-American common law but increasingly 
from European civil law — are practices that are taken as so self-evident that 
they elicit immediate and tacit consent. (See also Bourdieu 1977:164—70.) When 
legal practices achieve their full effect, then, it is often through their operation 
as doxa — thereby becoming self-evident and uncontested. 

Not surprisingly, Bourdieu here draws closely on Weber's emphasis on the 
formal rationality of state law. For Bourdieu (1987:833), any predictability of state 
law (that Weber sought to understand through his analysis of legal rationality) is 
itself shaped by socialized actors, using formalized texts. In Bourdieu's terms, laws 
predictability and rationality stem from the repeated practices derived through 
legal studies and legal practice; and in this way a "legal habitus" is formed that 
shapes decision-making. Bourdieu also builds on Weber in acknowledging that 
law's rationalization itself carries important symbolic effects. Bourdieu (849—50) 
explains that this is precisely because legal actors can count on the norms of 
the legal game, for which they have the requisite knowledge and manners to 
develop successfully. In this way Bourdieu joins with Weber in giving attention 
to the promotion of legal knowledge, which emphasizes the process through 
which social conflicts are translated and formalized on legal terrains while also 
focusing on how legality is produced within the hierarchical constraints of the 
profession. (See also Dezalay 1986.) 

Bourdieu's theory of the juridical field further allows for, and anticipates, 
individual innovation. (See also Bourdieu 1977:95.) In extreme moments, when 
the iterative function of habitus is disrupted, individuals may initiate strategic 
change (Wacquant 1989:44). Yet even on a daily basis, actors will innovate 
in seeking advantage within the field. For instance, in the legal context, the 
courtroom work group is made up of individuals who are well socialized to both 
strategically work within and improvise beyond the previously understood rules 
of the game they are playing. In this way, predisposed and socially located actors 


1504 / Social Forces 83:4, June 2005 


— conceiving of the field through their habitus and structured dispositions 
while innovating in their practice — are what allows the emergence, evolution 
and reproduction of law. In Dezalay's terms (1986:104—5), it is often these very 
innovations that are then drawn into the legal field to maintain its legitimacy, so 
that law maintains a process of “continuous re-creation” through which the "quasi- 
heretical avant-gardes” — those who seek change on legal terrains and thereby 
continue to grant faith and belief in legality — gain legitimacy in the field. 

In their transnational research on international commercial arbitration 
and rule of law reforms in Latin America, Dezalay and Garth (1996 2002) 
elaborate Bourdieu's framework to demonstrate the importance — even in 
these international legal fields — of national predispositions, contacts, forms 
of capital and career paths. As they demonstrate, internationalization allows 
for the development of new legal regimes while also providing a forum in 
which individuals can pursue opportunities that promote their own national 
careers. Both the exportation and importation of law can provide individuals 
with competitive advantages, and the globalization of law thereby provides a 
multiplication of opportunities for reproducing and gaining power and prestige 
(Dezalay and Garth 2001:372n1). 

The accounts that Dezalay and Garth provide explain the substantive 
development of a field of law as an effect of the opportunities that are opened for 
participants in the legal field, and of the objective positions that individuals hold 
and take within it. Combined with attention to how substantive developments in 
a field can attain institutionalized force, this can provide sociological purchase 
on Arendt's concern — namely, how legal and political practices can become 
available in response to future atrocities. Our analysis takes up this question, by 
drawing on a Bourdieuian approach to explain the emergence and development of 
international criminal law at the ICTY. We focus most closely on the prosecutorial 
teams, which we call prosecutorial regimes, and on the courtroom playing field 
and its judges during the period of the ICTY's rapid growth and development 
over the past decade. 


IV. Applying Bourdieu's Framework to the ICTY 


Our analysis thus emphasizes the strategic competition within and between 
four investigative and prosecutorial regimes that have established the force and 
influence of the ICTY. This institutional analysis includes the regime of the 
ambitious Chair of the Commission of Experts Cherif Bassiouni, who headed a 
UN-mandated investigative body that presaged the ICTY; a second regime whose 
chief prosecutor, Richard Goldstone, launched the Tribunal's first inauspicious 
prosecutions; the successor regime led by the innovative first female Chief 
Prosecutor, Louise Arbour, who indicted Slobodan Milosevic for crimes against 
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humanity; and her bombastic successor, Carla Del Ponte, who reorganized the 
ICTY prosecutorial office and brought Milosevic to trial for genocide as well as 
for his crimes against humanity. Our argument is that the driving force in the 
struggle that has established the ICTY as a new juridical field is the strategic 
competition within and between these prosecutorial regimes and the judges 
who hear the cases tried at the ICTY. As with Dezalay and Garth's findings 
(1996, 2002), this competition is motivated by the different national resources 
and practices that each prosecutor has brought to bear, so that the competitions 
themselves should be understood as reflections of national reproduction in an 
international setting. 

Each of the above prosecutorial regimes built and advanced its institutional 
capital in unique ways, sometimes in an implicitly retrospective but path 
dependent competition with each other, as well as prospectively in conflict with 
the judges who heard their cases: such as the initial Tadic case,! the Karadzic- 
Mladic Rule 61 hearing,” the Srebrenica genocide trial,’ the Foca rape case* and 
the Milosevic trial? Although formally divided into their separate domains at the 
ICTY, the prosecutors and judges are the paid and internally housed employees 
who drive this institution. 

In contrast, the defense counsel and defendants are outsiders to the court 
(cf. Blumberg 1967) and often are unfamiliar with the habitus of this hybrid 
common and civil law venue. Their participation ranges from the arrogance of 
Slobodan Milosevic acting as his own defense counsel to the uncertainty and 
timidity of lawyers from the Balkans who are paid but not housed by the UN to 
act on behalf of defendants. As research similarly suggests in the American courts 
(Flemming et al. 1992), we will see in the following account that it is more often 
the ICTY judges than the defense counsel who rein in and limit the initiatives of 
the prosecutors, often denying and redirecting their efforts. The law gains its force 
and the juridical field advances out of this type of Bourdieuian competition for 
legal control, so that the force of law does not derive simply from case precedent, 
but from an interlocking and cumulative set of practices more generally. 

For example, below we first observe an evolution from the investigatory 
capacity to amass largely anonymous, hearsay evidence of atrocities, to the 
systematic exhumation of mass graves and the collection of documentary evidence 
through search and seizure operations. We then describe a sequence in which the 
Tribunal augmented its path-breaking development of a coercive arrest capacity 
through NATO, with the addition of a cooperative use of financial inducements 
by the World Bank and other international actors to undertake arrests and 
transfers of defendants using more peaceful means on its behalf. We further 
observe an expansion of charging options that builds from a narrow naming of 
crimes against humanity to a broadened application that includes rape and sexual 
enslavement and to a further evolution in applying the charge of genocide. The 
available sources of legal efficacy multiply at each turn, and we argue that the 


1506 / Social Forces 83:4, June 2005 


force of international law available to the prosecution at the end of this process 
was geometrically bigger than at the beginning with a proportionate increase in 
cumulative force. 


V. The Research Setting 


The ICTY has a sweeping UN mandate to prosecute persons responsible for 
serious violations of international humanitarian and criminal law committed on 
the territory of the former Yugoslavia since 1991. It is as a result a secretive and 
security-conscious institution where some of the most allegedly prolific murderers 
and torturers of the last half century are brought to trial. The focus of this study 
is on the Tribunal's office of the prosecutor (OTP) and the ICTY courtrooms 
where the prosecution's cases are brought under judicial and defense scrutiny. 

This study is based on more than 100 tape-recorded and transcribed interviews 
conducted in the OTP from 1999 to 2003 and on courtroom observations of ICTY 
trials and resulting trial transcripts for this same period. As a result, this study is 
part of two reinvigorated ethnographic traditions, one focusing on courts and the 
other on efforts to promote the rule of law more generally. (See Cicourel 1968; 
Emerson 1969; Feeley 1979; Sudnow 1965 to Latour 2002; Riles 2000; Yngvesson 
1993.) The goal is to explain, from the inside-out, how the ICTY has developed 
a set of practices that allow it to operate with relative autonomy and influence 
that extends beyond its own newly defined institutional sphere. 

The first author was present at the ICTY for periods of one week to a month, 
every second to third month, with courtroom observations filling in the periods 
between scheduled interviews. The interviews are built around an approximate 
one-third sample of OTP employees that is representative in gender, age and 
national origin. The research included repeated interviews with the chair of the 
Commission of Experts that preceded the ICTY, all three ICTY chief prosecutors, 
the deputy prosecutor, four chiefs of investigations and prosecutions, and 
employees at all levels. These include lawyers and investigators in the largest 
numbers, but also data and case managers, crime and military analysts, clerical 
assistants and several demographers and historians. Public figures at the Tribunal 
are quoted by name. 

This study of the ICTY began by focusing on the Commission of Experts that 
first sought to establish an evidentiary foundation for the ICTY as the forum 
from which to build a new juridical field. As Bourdieu would predict, this was a 
period of pitched conflict between powerful individuals and positions. The still 
nascent field of international criminal law confronted the preliminary necessity 
of resisting opposing forces that sought to block its first steps. 
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VI. Experts on Atrocity 


In the wake of the Balkan crisis, a struggle emerged between law and political 
diplomacy — a struggle over which would become the paradigmatic discourse 
in responding to atrocities — and this became the core contest from which two 
central groups emerged. One group was led by the British Lord David Owen and 
the American Cyrus Vance, and struggled persistently to broker various Balkan 
peace plans in the early 1990s. Meanwhile, women's and human rights groups 
documented the appalling carnage that mounted in the wake of these peace plans, 
and worked with journalists to expose shocking concentration camp conditions 
and accompanying massacres and rapes. The latter groups insisted the Serbs 
were the more deadly aggressors in the Balkans and that legal culpability must 
be imposed by an international tribunal. 

Dezalay and Garth (2002) argue that this Bourdieuian competition between 
political power brokers and advocates of legal humanitarian approaches, is the 
foundation for a “transnational legal field” that joins the agency of human rights 
activism with international humanitarian law. Groups within this field compete 
for media attention, funding and political influence — and in the process, law 
provides a forum for these groups to define their interests, positions and causes. 
(See Espeland 1998.) 

The efforts of the human rights activists forced the politicians to acknowledge 
their failed policy of “moral equivalency” and its unfounded claims, namely 
that all parties — Bosnian, Croat and Serb — were equally responsible for 
the atrocities (R#165a:02/01/01)°. This battle, nonetheless, persisted in the 
UN Legal Affairs office, where Owen and his British compatriots continued to 
resist legal interventions they believed would undercut their peace negotiations 
(R#106:09/26/00). Ultimately, Lord Owen was reduced to defending his initiatives 
against charges of appeasement (Binder 1993; Guest 1995:104). This conflict 
took a further turn when, in October 1992, the UN Security Council passed 
Resolution 780, creating a Commission of Experts to gather evidence in support 
of establishing a tribunal for the former Yugoslavia. 

Yet the struggle between law and political diplomacy — over which discursive 
paradigm would gain continuing control of the process — persisted. The legal 
route of a Commission of Experts was placed in the leadership hands of a retired 
Dutch legal academic, Frits Kalshoven. Kalshoven was picked as the first chair of 
the commission over the Muslim-American law professor, Cherif Bassiouni (Guest 
1995:57). Bassiouni had invested much of his career in developing the doctrinal 
bases for an international criminal law, while simultaneously advocating the 
case for international criminal law to prosecute perpetrators of atrocities, rather 
than defer to diplomatic remedies (Sula 1999). Meanwhile, even if Kalshoven 
had wished to pursue evidence of atrocities, he was given no budget by the UN 
with which to work, and the British (who continued to promote a diplomatic 
approach) went on to covertly stall the commission’s efforts (Gutman 1993). This 
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included indications to Kalshoven that the commission should not pursue high- 
level Serbian officials, including Milosevic. In these ways, a discursive legal form 
— the UN Resolution establishing a Commission of Experts to gather evidence 
— was captured and filled with a competing content (in this case, political 
diplomacy). International criminal law may have been advancing formally, but 
it continued to be informed by a logic of political diplomacy and an emphasis 
on moral equivalency. 

Bassiouni was initially appointed as a voluntary member of the commission 
and believed that the appointment of Kalshoven and the withholding of resources 
were efforts to stall the commission's work. He nonetheless succeeded on his own 
in raising a million dollars in funding from the MacArthur and Soros foundations, 
both of which were heavily invested in the support of NGOs and the human 
rights paradigm. (See Dezalay and Garth 2000; Keck and Sikkink 1998.) Bassiouni 
persuaded Chicago's DePaul University, which was itself interested in developing 
a distinctive human rights law specialization, to donate a half floor of law school 
space for the collection of evidence of Balkan war crimes. Bassiouni described this 
work as a “dragnet” for evidence that could build a prima facie case of war crimes 
(R#106:09/26/00) and for UN establishment of a tribunal (R#108:07/19/00). 
This soon became a resource base for a preliminary force field that could drive 
forward the cause of international criminal law while simultaneously fighting 
off its political opposition.’ 

This struggle between opposing modalities took an unexpected turn when 
evidence of British stalling at the UN became public. A reporter investigating the 
commission discovered — from both the UN’s Office of Legal Affairs and the 
U.S. Ambassador to the UN in Geneva — that Lord Owen was the source of the 
instructions to sidetrack the commission’s efforts (Gutman 1993). 

The story embarrassed the UN into allowing the commission to get 
investigations underway. At this point, Kalshoven, himself frustrated and fatigued, 
resigned and Bassiouni finally was named chair of the commission. Working 
out of DePaul’s law school and moving along the doctrinal academic lines of 
international criminal law that he had advocated throughout his career, Bassiouni 
adopted an explicitly legal framework, arguing that the computerized database he 
had developed contained nearly 6,000 “cases” of putative war crimes violations. 
The focus on case-based evidence was an obvious appeal to the orthodoxy of 
conventional court work — precisely the work that Bourdieu associates with 
the juridical field. The information in the archive had a corroborative weight 
that could be used to establish patterns of violations. For example, it could be 
concluded convincingly that “the data base contains substantially more allegations 
of violations committed by Serbian and Bosnian Serb forces against Bosnian 
Muslim civilians than by or against any other ethnic or religious group” (Bassiouni 
1994:297). This conclusion was a conspicuous and compelling challenge to the 
previously orthodox UN presumption of moral equivalency (R#106:09/26/00). 
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Further UN resolutions establishing the ICTY were passed in the spring of 1993, 
while the commission was told to continue its work into 1994. 

In March of 1994 Bassiouni made a provocative proposal to the U.S. State 
Department that would have raised the political and legal stakes by helping the 
Tribunal prepare a case for the legal prosecution of three Bosnian Serb generals 
for the siege of Sarajevo: 


I could tell you who the commanding general was over the three generals 
who were in command of the Sarajevo Romanija First Corps who 
surrounded Sarajevo.... I could document that in the three years of the 
siege, on a daily basis, how many rounds of artillery and mortar fell, how 
many sniper shots were fired, how many people were injured or wounded. 
If I can show you also the targeting of civilians, the recurrence of the 
targets, and even the timing of it, then I have made a case for command 
responsibility for that general.... Now that suddenly gives a whole different 
dimension than the individual person who was targeting somebody. That 
puts the commanding general at risk, and if you see that the commanding 
general... was a member of what would be the equivalent to their Joint 
Chiefs of Staff, under Mladic,... and that the supplies and artillery shells 
came directly from Serbia, and so on and so forth, you’ve got a dammed 
good case of command responsibility. (R#106:09/26/00) 


Bassiouni, however, never received a response to this proposal. 

This struggle between law and political diplomacy — and the continuation of 
a hybrid diplomatic frame within the apparently legal form of the commission 
and forthcoming tribunal — continued when Bassiouni's effort to become the 
first ICTY chief prosecutor was rejected by the UN Security Council. Bassiouni 
speculated that this defeat and his proposals to extend the work of the commission 
by developing a major Sarajevo case were scuttled because he had aimed too high, 
and that this again disturbed the British: 


From the beginning I said I was not interested in going after the little 
soldier who commits the individual crime. I was after building a case 
against the leaders who make the decisions. So I was going to establish 
that there was ethnic cleansing as a policy, that there was systematic rape 
as a policy, that there was destruction of cultural property as a policy, 
that the destruction of Sarajevo was a systematic process. What I didn't 
realize was that this was precisely what the British, and to some extent 
the French and the Russians, did not want.... [T]hat was not the political 
reality (quoted in Sula 1999:26). 


The London Times agreed, noting that, while the official reason given for rejecting 
Bassiouni was lack of actual administrative experience as a prosecutor, “diplomatic 
sources said the real reason is that the European countries are afraid Dr. Bassiouni 
will move too quickly to charge Serb and possibly Croat leaders with war crimes” 
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(Bone 1993). Bassiouni was forced to settle for producing a report that could 
provide a political springboard for the Tribunal. 

The commission's Final Report is backed by more than 65,000 pages of 
documents and 300 hours of videotaped testimony and news footage, as well as a 
3,300-page, five-volume appendix supporting its conclusions (Final Report of the 
Commission of Experts 1994). All of this material was placed in a cargo container 
and shipped to The Hague Tribunal in the spring of 1994 (R#106:09/26/00). 
Little of this evidence would become part of the formal case law of the Tribunal. 
Yet, as we discuss below, this legal paradigm would eventually win out over 
political diplomacy and moral equivalency even by losing in the short-run. By 
accumulating evidence of atrocities, the commission gave a preliminary force of 
law to the work of the Tribunal, and helped to establish this nascent institution 
and the resurgent field of international criminal law that was potentially capable 
of addressing — even if in the short term, unable to address — the still unfolding 
military and political events in the Balkans. 


VII. The Virtual Tribunal 


It took the UN Security Council more than a year to agree on the appointment 
of Justice Richard Goldstone of South Africa as its first chief prosecutor in 1994. 
Although held in high esteem, Goldstone’s own legal practice was in commercial 
law, and he had no experience as a prosecutor or in criminal law generally 
(R#134a:12/06/00). Yet his symbolic capital was impeccable: he had just finished 
as chair of the South African Commission that exposed high level, right wing 
government instigated police violence. Goldstone’s appointment could thereby 
accommodate political interests. Unlike others vying for the position, he was 
experienced in diplomacy and came recommended for the position by Nelson 
Mandela — who had himself just been inaugurated in May 1994 as President 
of South Africa. Britain and France swiftly agreed to Goldstone’s appointment 
(Scharf 1997). 

From the beginning, Goldstone recognized that his advantage lay in his 
understanding and dexterity within the diplomatic rather than the legal field. An 
early interviewer at the Tribunal remarked that “upon arriving in The Hague in 
mid-August of 1994, Goldstone recognized that his was a big-picture diplomatic 
role and that the hands-on prosecution work could be pushed down” (Horne 
1995:7). 

Goldstone left most of the office responsibilities to Graham Blewitt, his deputy 
prosecutor (R#111a:12/14/99), while he traveled incessantly and worked the 
media, beginning on his first day in The Hague with an interview by Mike Wallace 
for 60 Minutes. In a signal of trouble to come, Wallace asked how Goldstone was 
going to bring the Balkan war criminals to justice, saying, "[Y]ou' ve got to get 
them here on trial" (CBS News 1994). 
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As Bassiouni found before him, Goldstone sensed that in the beginning the 
UN did not seem to want him to do much (R£134b:04/24/02). As Bassiouni 
explained, “the green light had not been given” and “the amber light was on,” by 
which he meant that the UN was continuing to give priority to peace negotiations 
(R#106:09/26/00). As time went on, however, a courtroom was constructed and 
detention facilities were prepared. Now the UN budget committee was suddenly 
impatient: “I was advised ‘you have to have an indictment out, otherwise you're 
not going to get any money’...” (R#134a:12/06/00). Meanwhile, several members 
of Goldstone’s staff were working through the report from the Commission of 
Experts, which featured horrific accounts of the participation of an individual 
named Dusko Tadic in camp killings and rapes (R#177:06/28/99). Yet there was 
no apparent evidence that Tadic was an instigator or planner of these crimes 
(R#165a:02/01/01). 

Where Bassiouni had wanted to start with the siege of Sarajevo, Goldstone 
chose instead to begin with Dusko Tadic. Tadic, who had fled from Bosnia, was 
picked up by the German police and charged under a unique German provision 
for the domestic prosecution of genocide. Goldstone reasoned that it was a case of 
Hobson’s choice: “We had an empty prison. There was a great deal of frustration. 
The judges were frustrated ... I didn’t mull it over. It seemed an obvious thing 
to go get him” (R#134b:04/24/02). Yet Tadic was not a leadership figure and 
Goldstone was taking him from a national jurisdiction with a uniquely suited 
genocide statute (R#174b:07/02/02). The decision to focus on Tadic signaled to 
some that he lacked a sense of the field of international law he was attempting 
to lead (R#114:08/16/02). 

Goldstone made a huge organizational commitment to the Tadic case, 
assigning a team of 20 OTP employees to its pre-trial investigation in the fall of 
1994 (R#196:07/07/00). Beginning with this case, Goldstone formed a policy of 
relying on what became known in the OTP as the Queen’s Counsel model of 
investigations (R#176c:09/16/02). This meant assigning nearly full responsibility 
for the development of cases to autonomous field investigators in a reactive 
mode that kept lawyers and prosecutors of the Tribunal at arm’s length from the 
collection of case evidence (R#144:07/10/00). 

Two investigations that were exceptions to this Queen’s Counsel model 
were the Foca rape case and the inquiry into the Srebrenica massacre. These 
investigations were run by legally trained Europeans in the tradition of the 
continental civil law that would later emerge in a struggle to reorganize the OTP. 
Neither the Foca nor Srebrenica teams had more than a few members, and they 
competed unsuccessfully for investigative resources (R#169:07/20/01). 

Perhaps also because the Srebrenica massacre was so massive in scale, involving 
the killing of some 7,000 Muslim men, the investigation necessitated the formation 
of unique external alliances beyond the Tribunal (R#179:05/07/01). This made 
Goldstone’s diplomatic contacts relevant again. For example, the Srebrenica 
investigation first involved gaining authorized use of Central Intelligence 
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Agency (CIA) aerial imagery to identify mass grave sites for exhumation 
(R#173a:02/01/01). The Tribunal invoked its UN statutory authority — articulated 
by the Trial Chamber as “a judicial authority which combines elements of 
common and civil law traditions” and which “will naturally create procedures 
for which there is no analogue”* — to justify entering Bosnian Serb territory. 
Specifically, Article 18 of the Tribunal Statute states that the prosecutor “shall 
have the power . . . to conduct on-site investigations" and in so doing “may, as 
appropriate, seek the assistance of the State authorities concerned.” 

Since the Bosnian Serb authorities were unwilling to facilitate the exhumations, 
Goldstone and Blewitt turned to the U.S. State Department for the state assistance 
they needed (R#156:01/30/01). During negotiations of the Dayton Accords, they 
were able to convince Richard Holbrooke to place pressure on Slobodan Milosevic 
to support trips by an Assistant Secretary of State, John Shattuck, into the area 
around Srebrenica (R#179:05/07/01). As Holbrooke (1999:261) has reported, “A 
few days later, this request produced a strange sight: Milosevic’s special military 
security forces escorting Shattuck into Banja Luka, which no American official 
had visited in several years, as he sought access to mass grave sites of massacres 
committed by Serbs.” This opened the door for the Tribunal’s exhumation work. 
Once many of the victims’ bodies were exhumed, the Tribunal further claimed 
search and seizure powers under the Accords to collect documentary evidence 
from Bosnian Serb military headquarters about the massacre (R#130b:05/07/01). 
Invoking these early investigatory powers and practices was essential in establishing 
the evidentiary base for the Srebrencia case. 

Yet these powers were not developed and exercised easily. For example, just 
as Richard Holbrooke was arriving in Sarajevo to engage in shuttle diplomacy 
associated with the recently signed 1995 Dayton Accords, two senior Bosnian Serb 
officers made a wrong turn into Bosnian controlled territory outside of Sarajevo, 
where they were taken into custody by the Bosnian army."? 

Holbrooke reasoned that the two officers, who were traveling in a civilian car, 
had been apprehended in a manner that violated the free movement provisions 
of the Dayton Accords, and he maintained that normally he would have insisted 
that the Muslims release them immediately: “But Justice Goldstone complicated 
matters considerably; from the International War Crimes Tribunal in The Hague, 
he issued a warrant for the two men — even though they had not been indicted” 
(Holbrooke 1999:332). Holbrooke cooperated in transferring the two men to 
the Tribunal, but within two months both were released (R£111c:02/01/01). He 
concluded that *the seizure of the two men, neither of whom was ever indicted, 
had disrupted the implementation process and set a bad precedent for the future" 
(Holbrooke 1999:332). Once more, Goldstone's lack of knowledge of criminal 
law practices had resulted in loss of face with a potential ally. 

The Tadic trial also went poorly when key witnesses developed by errant 
investigators proved unavailable, unreliable or unsuitable. First, just as the case 
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went to trial the rape charge against Tadic was abruptly dropped. The victim 
who was placed in a witness protection program became too frightened to testify 
(R#165a:02/01/01). The prosecution ultimately was able to prove to the court's 
satisfaction that Tadic was at least present at numerous killings, and he was found 
guilty of persecution as a crime against humanity." Still, the Tadic case could 
contribute little to grounding this emerging juridical field (R#114:08/16/02). 

Finally, as Goldstone’s short two-year term at the OTP was coming to its end, 
he began to find the means to put his political capital back to work outside the 
Tribunal. In the early months of 1996, a potential star witness fell into his hands 
when Drazen Erdemovic confessed to reporters in Serbia that he had personally 
shot as many as 70 Muslim men in the Srebrencia massacre (R#11 1c:02/01/01). 

When Erdemovic was taken into custody in Belgrade, Goldstone insisted that 
the authorities there serve a summons ordering him to appear before the Tribunal, 
later explaining that “the reason I went public with Erdemovic was to save his life" 
(R#134b:04/24/02). This public demand combined with American government 
pressure had its desired effect, and Erdemovic was transferred to the ICTY within 
several weeks. He was soon providing vital information about the location of 
burial sites around Srebrenica, and Goldstone’s previous success in securing CIA 
and Defense Department assistance with aerial imagery began to pay off in the 
location and planning of exhumations of mass graves (R#111c:02/01/01). 

Goldstone was uncomfortable with striking a plea bargain with Erdemovic 
(R#134b:04/24/02), thinking as Justice Robert Jackson had at Nuremberg that plea 
bargaining was an affront to the dispositions of an international court (Robert 
Houghwout Jackson Oral History:1204). Nonetheless, a plea agreement evolved 
over time and was later openly endorsed by Goldstone’s successor and the judges’ 
chambers as an appropriate way of rewarding cooperation, and as a necessary 
part of the criminal justice doxa of this court. (See Bohlander 2001; Combs 
2002.) The ICTY now had a star witness around which to develop a Srebrencia 
genocide case, although the Tribunal still had no high ranking defendant in 
custody to prosecute. 

In the summer of 1996, one year after the massacre at Srebrenica, Richard 
Goldstone’s tenure as chief prosecutor was nearly finished. The Tadic trial had 
not sustained the attention of the media, and meanwhile NATO as well as the 
U.S. and Britain were not helping the Tribunal to arrest the indicted Bosnian 
Serb leaders, Karadzic and Mladic, nor many others under indictment. The 
courtroom and prison, not to mention the judges and prosecutors, continued 
to be drastically underutilized. Goldstone decided to move ahead with a Rule 61 
hearing — a hearing designed essentially to get around the inability of the ICTY 
to apprehend defendants (R#114:08/16/02). 

The concept of an in absentia hearing is controversial because it is a variation 
on a trial in absentia, which the UN Security Council Statute creating the Tribunal 
explicitly forbade. Martin Bormann was tried in absentia at Nuremberg, and 
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French law permits trials in absentia for persons charged with crimes against 
humanity. Nonetheless, Guest (1995:126) points out that "such trials are 
prohibited by the International Covenant on Civil and Political Rights." 

Members of the prosecution staff emphasized their reservations about the 
Rule 61 procedure, arguing strongly that it was essentially a violation of the doxa 
and legal habitus of accepted criminal procedure (R#114:08/16/02). In contrast, 
Goldstone believed that a Rule 61 hearing on Karadzic and Mladic could pull 
together much that had been learned through ongoing investigations and put 
pressure on international authorities to arrest these indictees (R#134b:04/24/02). 
The seven-day hearing was convened in the end of June, 1996;" as in the Tadic 
case, Goldstone did not appear in court. 

Drazen Erdemovic provided dramatic testimony at the Rule 61 hearing, and 
the prosecution was able to make public some of the exhumation and other 
forms of evidence it was collecting in the Srebrenica case (R#173a:02/01/ 01). The 
downfall of the hearing was that it also made apparent to the public that only a 
handful of the more than 70 individuals the ICTY then had under indictment 
were in custody. The absence of defendants or their legal representatives in the 
Rule 61 proceeding gave the Tribunal a “virtual” quality. Rather than making this 
new institution seem substantive and self-evident, Goldstone risked making it 
seem artificial and contrived: a forced legal abstraction more than a functioning 
legal reality (R#103b:02/20/02). 

Yet even at this low point in its formal substantive development, the Tribunal 
was gaining some of the force of law in the world outside. The indictment and 
vilification of Radovan Karadzic at the Rule 61 hearing helped to drive him 
from elected office, leading Richard Holbrooke (1999:190) to concede that “the 
tribunal emerged as a valuable instrument of policy that allowed us, for example, 
to bar Karadzic and all other indicted war criminals from public office.” Once 
Karadzic was displaced, it became easier to enforce provisions in the Dayton 
Accords and to implement the exhumations and search and seizure operations 
described above (R#168:02/01/01). 

Goldstone further accomplished his goal of getting the institution a sustaining 
budget of about 30 million dollars a year from the UN. He developed the Tribunal 
with an emphasis on the political and the diplomatic. His own self-assessment 
confirms this: “I think I spent more of my time on diplomacy and pushing 
and talking and screaming and shouting for the Tribunal than on the simply 
prosecutorial work” (R#134b:04/24/02). 


VIII. The Real-Time Tribunal 


Goldstone recommended his successor, the Canadian Louise Arbour, and the 
UN approved her appointment in 1996. Arbour was experienced in criminal 
law and inherited an OTP staff that understood its basic problem: of 76 persons 
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Goldstone had indicted, only six were in custody (R#128:02/01/01). Louise 
Arbour’s position as the Tribunal’s second prosecutor allowed her to build on 
Goldstone’s tenure; yet in doing so, she brought with her training in criminal 
law rather than international diplomacy, and legal views and habits formed in 
relation to a Canadian paradigm of human security. (See Canadian Department 
of Foreign Affairs and International Trade 1999.) This provided precisely the sort 
of internal change that international criminal law required in order to retain 
(and, in this case, develop) its legitimacy. In a Bourdieuan framework, the game 
of law is contingent on the gradual recognition and acknowledgment of new 
legally-based frameworks — and the advocates who wield them — as a means 
of “replying to the challenges” that are presented by entrenched approaches. (See 
Dezalay 1986:104.) 

A faction of idle and frustrated judges was now openly arguing for trials 
in absentia, which Arbour immediately labeled as an affront to the orthodox 
expectations of criminal justice by saying, “I didn’t go there for theater purposes.” 
She argued instead that orthodox tools of criminal law could be joined with the 
UN mandate to get arrests underway (R#103a:11/19/99). Arbour’s plan involved 
a policy based on sealed indictments — or more to the point secret indictments 
(R#103a:11/19/99). Secrecy was a crucial means of introducing surprise and 
vulnerability so that arresting authorities would be at less risk of reprisals and 
injuries (R#171:07/05/00). 

The use of secret indictments, which are common in national jurisdictions, 
was presented to the diplomatic world by Arbour as self-assured doxa. In 
contrast, Goldstone, with his orientation to the diplomatic field, had resisted 
sealed indictments as an affront to deference and transparency in international 
relations. Arbour drew on her background in criminal law to convince an 
international audience that this non-transparent technique of law enforcement 
was fully appropriate, insisting this practice was simply self-evident to those who 
know criminal law. 

Arbour and her staff sensed that the judges would support the secret 
indictment policy because it also promised to generate arrests and therefore 
trials. Arbour explained, “I was absolutely determined, arrest was so clearly the 
key, because I thought if we start getting arrests, then everybody will be busy.” 
(R#103a:11/19/99) One member of Arbour’s legal team, Clint Williamson, was 
already laying the groundwork for an early secret indictment arrest when Arbour 
arrived at the Tribunal (R#200:05/07/01). 

The arrest plan involved trapping the former mayor of Vukovar (Croatia) 
who was secretly indicted for taking part in a massacre of 200 hospital patients. 
Not knowing he was indicted, the mayor agreed to return to Vukovar to discuss 
compensation for property he owned. The Polish soldiers who met him instead 
handcuffed the shocked indictee and took him to a military base; there, a Belgian 
plane then flew him to the ICTY detention center (R#200:05/07/01). 

If the secret indictment was orthodox criminal procedure, albeit perhaps not 
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in the international context, the ensuing arrest was clearly unorthodox. Although 
the ICTY has no arrest authority, The Observer (Doyle 1997) reported that "a 
snatch squad sent by the International War Crimes Tribunal has captured the 
former Mayor of Vukovar,” and the International Herald Tribune (Reuters 1997) 
observed that “it was the Tribunal, rather than the UN peace-keepers that carried 
out the arrest” President Bill Clinton (1997) issued a statement through his 
press secretary saying, “I congratulate the ICTY and UNTAES on their successful 
apprehension.” 

An ICTY investigator offered the following explanation: “Yeah, well arrest is 
a technical term... We were present at the time that he was arrested and then 
we took over custody of him because, of course, we had no powers of arrest.” 
(R#122:07/20/00) Arbour later added in an interview that “the plot, if I can put 
it this way, wasn’t dead easy to execute, but it was very clean, and we knew that 
legally we had a very good chance of making it all stand.” (R#103b:12/06/00) In 
this way, her unorthodox sealed indictment strategy became doxa. 

Having shown how easily an arrest could be accomplished, Arbour could 
now push the well-armed and authorized NATO troops to begin using the secret 
indictments to make arrests (R#200:05/07/01). British-led Stabilization Force 
(SFOR) troops were the first to make a subsequent arrest (ICTY Press Release 
CC/PIO/225-E 1997), followed by Dutch-, U.S.-, and later French-led troops. 

Arbour and her advisers used the evolving experience in the field as a further 
strategic opportunity to change the formal procedural rules of the Tribunal. 
This simultaneously disempowered local Serbian authorities (by removing their 
protective shield of sovereign immunity) and empowered NATO forces. For 
example, “the rules that made it look as if you could only send a warrant to a 
state, so that Republika Srpska eventually had to do the arrest, were changed 
so that warrants no longer have to be addressed to states, so that they could be 
addressed to the person on the envelope.” (R#174b:07/02/02) NATO SFOR troops 
could now more directly take on the arrest role that the local Serbian authorities 
in this region would not. “We tried to take out all the rules that were limiting our 
flexibility in arrests, and we pushed these proposals through plenary sessions of 
the ICT,.... to put the legal framework in place so the secret indictment process 
would run through consistently and... keep the local authorities out of it.” 
(R#174b:07/02/02) These practices were now official ICTY orthodoxy." 

The arrestees soon began arriving at a more stable rate of about one or two a 
month, including the Croatian political leader Dario Kordic (ICTY Press Release 
CC/PIO/246-E 1997). The Croatian government was showing new signs of 
cooperation in exchange for foreign-aid money (R#103b:02/20/02). Arbour began 
insisting in her contacts with diplomatic representatives of the more affluent 
Western nations that the financial assistance they were providing in the Balkans 
should be made contingent on cooperation with the ICTY (R#103b:02/20/02). 
These efforts set a precedent for using financial leverage by getting Croatia to 
transfer indictees in conjunction with its acceptance into the Council of Europe 
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and receipt of monetary aid. This quid pro quo became a new kind of foreign 
relations doxa. 

By June 1998 about 30 defendants were under detention in the Tribunal's 24- 
unit facility that just a year before was nearly empty. The result was that the ICTY 
was now under-budgeted, encouraging Arbour to initiate a campaign for increased 
funding. This required that Arbour shift her position within the legal field. Rather 
than maintain the posture of a discrete and presumably impartial judge, Arbour 
soon took up the role of a publicly available and outspoken prosecutor. 

“Tt took me a while,” Arbour observed, “but eventually I started appreciating 
that the fundamental premises of criminal law, traditionally, were that it’s public 
and local... When I applied that internationally it became apparent that the 
only way we could even remotely be public and local was through the use of the 
international media. We had to be out there.” (R#103a:11/19/99) Arbour (1999) 
made her case to the public by emphasizing three points: (1) that the extra- 
territorial authority of the Tribunal trumped claims of sovereign immunity, (2) 
that the Tribunal’s demeanor was appropriately coercive and intrusive rather than 
deferential and diplomatic, and (3) that the Tribunal was an agency of real-time 
law enforcement rather than an after-the-fact institution of historical record. 

Having developed a field-based and real-time approach to investigations 
meant that, by late 1998, ICTY staff members were obliged to investigate the 
highly visible crime scene that Kosovo was in the process of becoming. Thus 
when 45 people were found dead near the Macedonian border in the Kosovo 
town of Racak in January of 1999, Arbour was obliged to react. (See also ICTY 
Press Release CC/PIU/379-E 1999.) She was accompanied by a team of three 
investigators, two bodyguards and one legal adviser as she left to enter a war in 
progress (R#103a:11/19/99). This was the beginning of the Tribunal’s role as a 
real-time institution acting on unfolding events that it was trying to influence 
— if not control. 

The team Arbour led was turned back at the border. This was followed by three 
futile days of negotiations involving NATO General Wesley Clark (2001:161) with 
Milosevic in Belgrade. Arbour returned to The Hague ready to resign in defeat 
until she encountered a completely different reality focused around a picture of 
her in the New York Times “facing down” the border guards: 


When I came back to The Hague I realized that what I had lived, and 
that what in reality was a failure, had been lived differently in The Hague, 
because they weren't there, they were seeing it only in the press, as the 
greatest success story of the Tribunal... What became the conventional 
wisdom of the reality of those three days was not what I lived but what 
the Tribunal, the bad guys, NATO, the politicians, the world, my mom, 
everybody in the world perceived — which was that I had single-handedly 
put the Tribunal on the map.... I went to the border and I didn't get in. 
But the spin, the images — as my assistant put it, “we made the New York 
Times] she said, 'above the fold!" (R4103b:02/20/02) 
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Her assistant was referring to the picture of Arbour in a yellow flack jacket, 
*above the fold" on page one of the Times (Perlez 1999). Her three themes had 
been captured in one image: that of a criminal prosecutor from the ICTY being 
turned away from a real-time crime scene by a sitting head of state using claims 
of national sovereignty to conceal evidence in a transparent concession of guilt 
(R#171:07/03/00). 

It still took until the end of the NATO bombing for the ICTY to get its 
investigators into Kosovo, but then they were able to go in side-by-side with 
the entering troops. They were armed with leads from interviews in the refugee 
camps to begin building an indictment of Slobodan Milosevic for crimes against 
humanity. Once the indictment was begun in late March, it took 52 days to write. 
It was presented for judicial confirmation on May 22, 1999 (R#165a:02/01/01). 

Arbour next reapplied the doxa of secrecy to have this indictment sealed 
for four days, saying she wanted to protect a humanitarian mission that was 
underway in Belgrade. The actual reason was to make it more difficult for others 
to interfere with the indictment decision. Her public relations spokesman later 
reiterated that “she didn’t want to give them the chance to co-opt or slow down 
the indictment, because that would really get at the heart of her independence 
as a prosecutor. So she was very careful to have Judge Hunt confirm it and then 
hold it under wraps, under seal — as we say — for four days, and in that period 
she made phone calls to each of the foreign ministers.” (R#171:07/03/00) 

The Clinton administration was ambivalent about the indictment, with 
voices for the politics of diplomacy yet again arguing that Milosevic could be 
manipulated to strategic advantage in the peace-making process, perhaps through 
the promise of amnesty from prosecution. An official in the administration in an 
interview for this study quoted even Madeleine Albright, the Tribunal’s strongest 
advocate in the cabinet, asking rhetorically, “Is this exactly the right time to indict 
this fellow?” (R#179:05/07/01) Wesley Clark (2001:325) also reported that right 
up to the May 27, 1999, announcement of the Milosevic indictment, *[S]ome in 
the Pentagon and the White House were unhappy about this.” Albright's question 
was rhetorical, of course, because the indictment was already done. By initiating 
the indictment, Arbour in effect “called the bluff” of the American commitment 
to international humanitarian law and the Tribunal. 

Arbour maintains that in the end it was getting the arrests started that was the 
strategic turning point. She put it this way: “Arrest was the issue and it was clear 
that if we got people arrested then everybody inside the Tribunal would be busy.” 
(R#103b:02/20/02) She is adamant about the independence she believes the ICTY 
achieved, saying with regard to the Milosevic indictment, “Many political men 
believed that I would not be so presumptuous, that I would consult them. But 
I gave myself one bit of advice: never ask for something that can be refused. So, 
I said nothing.” (Trean 2002) This explicit claim of independence — articulated 
here as a tension between law and diplomacy (“political men” in Arbour’s terms) 
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— is evidence of what, in Bourdieu's model, provides the legal field with its 
privilege and autonomy, ensuring that the game is played on "law's" terrain. 
Appointed to the Supreme Court of Canada, Arbour left the ICTY soon after 
the Milosevic indictment in the summer of 1999, one year short of the end of 
her term. She had tripled the Tribunal budget in three years, multiple trials 
were now simultaneously underway, and important arrests also had been made 


in the Foca and Srebrenica cases. 


IX. “The Problem from Hell” 


Indicting Milosevic was one thing, getting him to The Hague was another, 
and once there, there was the issue of whether to charge him with genocide, 
otherwise known as “the problem from hell.” (Power 2002) The former Swiss 
attorney general, Carla Del Ponte, was appointed as Arbour's successor in 1999, 
becoming the first former prosecutor to serve in this role at the ICTY (ICTY Press 
Release JL/P.I.S./429-E 1999). With high profile experience in the prosecution of 
organized crime, Del Ponte's own background — in contrast to her predecessors 
at the Tribunal — lay in her experience overseeing complex legal prosecutions 
with large scale financial dimensions, and often organized through international 
organizations such as the World Trade Organization and World Bank. As 
demonstrated below; her tenure has involved high-profile trials, an internal 
reorganization of the OTP, and the coordination of transgovernmental financial 
inducements with political pressure to accomplish Tribunal goals. 

Del Ponte's first efforts were aimed at getting the Bush administration to 
exert financial pressure on the Serbs to turn over the indicted Milosevic. These 
efforts built on the earlier success of Arbour with the Croatians. Colin Powell was 
the principal transnational player who linked the funds to rebuild Yugoslavia's 
shattered economy to its cooperation with the Tribunal. He led with a U.S. pledge 
of $181 million to be committed at a donors' conference in Brussels after it was 
made certain that Milosevic would be transferred to the ICTY (Black 2001). The 
Daily Telegraph (2001) reported that "the west bribed Belgrade to hand over 
the tyrant, and the Serbian nation, weary of its pariah status, acquiesced in the 
bargain." (See also Crossette 2001.) 

John Meyer (1987) notes that when multinational corporations use monetary 
bribes and threats to manipulate states and regimes (for example, to seal sales 
and contracts), this often is seen as the height of illegitimacy, but when states and 
regimes use such tactics they usually are treated as predictable, as self-evidently 
acceptable, or as what Bourdieu calls doxa. States effectively have a strategic 
monopoly control over such inducements, with the result (Meyer 1987:65) that 
they “bribe each other routinely.” In this case, the transgovernmental sponsorship 
of the donors’ conference was used to make these monetary manipulations less 
direct and more legitimate (Ikenberry 2001). Colin Powell was following well 
established but seldom discussed political doxa that Louise Arbour had already 
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endorsed and applied. Milosevic was unwillingly delivered from Belgrade to the 
Tribunal in the summer of 2001 (ICTY Press Release C.C./P.1.S./597-E 2001, 
E.H/P.I.S./598-E 2001). 

The Foca and Srebrenica trials unfolded while Milosevic was still at bay. 
The Foca case documented the role rape and sexual enslavement can play as 
instruments of terror in crimes against humanity.” The Srebrenica case established 
that the massacre which took the lives of 7,000 Muslims constituted genocide 
for which command responsibility could be assigned.'^ The legal struggles over 
these issues" help make Bourdieu's point that it is often competition between 
contesting parties within the courtroom that produces the force of law. 

For example, as the Foca case concluded in June of 2000, the prosecution 
was confident it had established a widespread or systematic pattern of sexual 
assaults (R#162:07/25/00). The OTP nonetheless remained concerned about just 
how widespread or systematic this pattern must be to constitute a crime against 
humanity.!* The prosecution therefore asked to present an expert witness who 
had helped to prepare a report on the occurrence of rape in Bosnia. Presiding 
Judge Hunt indicated such testimony would be a form of hearsay: “What you are 
asking us to do is to accept that there was a pattern of allegations of widespread 
rape, and to jump from that into accepting that it was much more widespread..." 
(Prosecutor v. Kunarac and Kovac, Case No. IT-96-25, Trial Chamber Transcripts, 
May 29, 2000:4155-4156.) 

While Judge Hunt eventually ruled that this expert testimony could be 
introduced to provide historical context, the prosecution the following day 
declined “out of an abundance of caution” to do so (Prosecutor v. Kunarac and 
Kovac, Case No. IT-96-23, Trial Chamber Transcripts, May 30, 2000:4177). The 
prosecution decided overnight that the judges were in effect saying, “They had 
heard enough.” (R#176c:09/16/02) Judge Hunt responded to the prosecution 
decision with the advice that the statute “does not require you to prove that 
the rapes were widespread, it only requires you to prove that the armed conflict 
against the civilian population was widespread.” (Prosecutor v. Kunarac and Kovac, 
Case No. IT-96-23, Trial Chamber Transcripts, May 30, 2000:4179-80, emphasis 
added) The prosecution immediately concurred, saying, “Absolutely. Rape is 
one of the constituent ingredients in the widespread or systematic attack.” Judge 
Hunt then reiterated, “We do have to be satisfied beyond a reasonable doubt that 
the attacks were widespread.” (4180, emphasis added) Innovative jurisprudence 
with the force of legal precedent was formed out of this clash on the courtroom 
playing field. It sprang from the struggle and then synthesis of the judges and 
prosecutors interacting to make sense and then precedent of the evidence and 
events before them. The result was new law with new force in new circumstances. 
This was the kind of precedent Arendt realized only an international tribunal 
could provide. 

In the Srebrenica case, the struggle was over the introduction of an intercepted 
telephone communication in which the defendant, General Krstic, was allegedly 
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recorded ordering a subordinate to “kill them all. God damn it.... [Not a] single 
one must be left alive.” (Prosecutor v. Krstic, Case No. IT-98-33, Trial Chamber 
Transcripts, November 1, 2000:6806-7) This evidence was introduced by the 
prosecution during rebuttal, leading one of the judges, rather than defense 
counsel, to interject, “Just one second. I don’t quite understand. You've had this 
for a while or you just got hold of it yourself recently?” (6801) This question 
was then put aside over the subsequent closing phases of the trial. The intercept 
was dramatic and attracted considerable publicity. Yet when the presiding judge 
read the verdict that found General Krstic guilty of genocide, he emphasized that 
“the Trial Chamber did not admit a recording in which a voice alleged to be that 
of General Krstic is heard saying, ‘Kill them all? I emphasize this point because 
it might have appeared that this exhibit was part of the case-file whereas that 
radio tap has not been admitted and is not an exhibit.” (Prosecutor v. Krstic, Case 
No. IT-98-33, Trial Chamber Transcripts, August 2, 2001:10185) Even though the 
defense was helped by the judge’s intervention to form the objection that excluded 
this prosecution evidence from the case-file, the intercept formed a persuasive 
part of a larger body of evidence that established the occurrence of genocide at 
Srebrenica. This intercept was important both in assessing the specific role of 
the defendant and the utility and validity of the intercepted communications 
more generally. 

The Foca and Srebrenica cases were thus hard-fought trials in which the judges 
regularly challenged the prosecution's strategies.? From Bourdieu's perspective, 
these struggles often lend force to strategic courtroom interactions. In significant 
part as a result of the convictions in these cases, rape became a constituent 
element of the charge of crimes against humanity in the Milosevic case, while 
the Srebrenica massacre formed a crucial part of the charge of genocide. Still, the 
task of adding indictments to the Milosevic case for crimes against humanity in 
Croatia and genocide in Bosnia was challenging for Del Ponte. 

First, the charge of genocide itself was demanding, involving the intended 
destruction of a people in whole or in part, while a crime against humanity more 
modestly required widespread or systematic attacks on civilians (Robertson 1999). 
Second, the strongest case for genocide was in Bosnia, where Sarajevo, Srebrenica 
and ethnic cleansing more generally had claimed the largest number of lives; 
but the chain of command leading to and from Milosevic was clearer in Kosovo 
because it was still part of Yugoslavia. Third, the events in Bosnia had occurred 
in the first half of the 1990s, while the events in Kosovo had peaked in the latter 
half of that decade, leaving fresher evidence (R#165a:02/01/01). 

Nearly four months after Milosevic appeared at the Tribunal, in late September 
of 2001, Del Ponte announced she had signed an indictment for Croatia (ICTY 
Press Release JL/P.I.S./627e 2001) and that a Bosnian indictment would follow 
shortly (Simons 2001). The Croatian indictment listed nearly 700 deaths and 
more than 170,000 deportations.? There was speculation that Del Ponte was 
strategically using her public voice as prosecutor to place internal pressure on 
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her own investigators to complete the Bosnian indictment, which she indicated 
would now take another month and include the charge of genocide (Klarin 2001). 
Adding the genocide charge was the subject of considerable debate inside the 
Tribunal?! as well as outside (R#124:07/05/02). Arbour had earlier adopted the 
view that the Kosovo case of crimes against humanity would result in a swifter 
and more predictable conviction, while a combined genocide trial with Milosevic 
at its center risked being more drawn out and less certain in its outcome. 

When the Bosnian indictment came in late November, Milosevic was identified 
as having planned, ordered, or at least inspired genocide and the full range of 
other crimes as the head of a “joint criminal enterprise"? The challenge behind 
this comprehensive indictment, and the certain cause of its long development, 
lay in the accumulation of evidence that could persuasively delineate the chain 
of decision-making that gave Milosevic de facto command responsibility for the 
events already demonstrated to have occurred in Srebrencia and Foca as well as 
other places (R#187:01/29/01). 

Getting this new kind of evidence focused attention on how the Tribunal was 
doing its investigative work and linking its results to subsequent prosecutions 
and trials (R#146:07/03/02). The problem was no longer simply establishing a 
crime base and securing an indictment that could lead to an arrest, but instead 
now revolved around the end-game challenge of gaining a conviction. This also 
coincided with building pressures from the Bush administration for the ICTY to 
end its investigations and to complete its trials by the end of the decade. 

Although the print media followed Del Ponte’s role in developing the 
Croatian and Bosnian indictments of Milosevic, little or no attention was given 
to the major internal reorganization of the OTP Del Ponte initiated in March 
of 2001.2 Del Ponte and Blewitt noted that problems often were resulting from 
a sharp separation of the investigation and prosecution functions of the OTP 
(R#124:07/05/02; R#111c:02/01/01). In essence, the problem was that this division 
was impeding strategic planning and the collection of crucial forms of “inside” 
evidence. 

An experienced trial lawyer described the functional separation in the OTP as 
what we earlier called a Queen’s Counsel model in which the senior trial lawyers 
were “detached from the investigative component and essentially the recipient of 
a brief... as opposed to a federal prosecutor’s model or public prosecutor’s model 
that you would find in parts say of the British Isles or Canada or the U.S. or in 
the civil law countries that have investigative magistrates, or the German system 
with a public prosecutor.’ (R#176c:09/16/02) This loosely coupled system also 
involved considerable variation in practices across cases and in general meant 
that prosecutors had little advance preparation (R#190:07/02/02). 

This was the problem the Prosecutor Mark Harmon confronted as lead counsel 
in the Blaskic prosecution,” a case portrayed by a colleague as like a complicated 
clock in a thousand disassembled pieces (R#118:01/29/01); or the situation 
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another senior trial attorney similarly described as like “getting inundated with 
three filing cabinets full of statements and documents” that were incoherently 
organized, so that in the prosecution phase “youre playing catch up” and left 
asking the question, “Why do we have X number of our witness statements that 
have to be redone or rewritten?” (R#176c:09/16/02) 

The first step was to appoint a new chief of investigations. Patrick Lopez Terres 
emerged as a strong internal candidate (R#111c:02/01/01). Lopez Terres had no 
doubts about why he was chosen: “The real reason I was put in this function, 
... Was as part of the reorganization the prosecutor wanted — to try to put an 
end to a very strong separation that was in the house between the lawyers and 
the investigators. The idea was that because of my background, a kind of hybrid 
person, having been a lawyer dealing with investigative matters most of my career, 
I could be the right person” (R#190:07/02/02). Initially there was resistance from 
the investigators to giving lawyers a larger role in overseeing their work.” 

Blewitt and Del Ponte went on to search for a new chief of prosecutions. After 
considering internal and external candidates, Blewitt and Del Ponte decided to 
proactively pursue Michael Johnson, a prosecutor from New Hampshire who 
had worked periodically on loan to the OTP on procedural matters and in fund- 
raising efforts. 

Decisions about sending investigators on missions now came under closer 
scrutiny. This involved Lopez Terres “scrutinizing what the teams are doing [on 
mission]... saying ‘no’ more often than was ever said before.” (R#111¢:02/01/01) 
Significantly, he saw this as a move toward the civil law approach, saying 
“obviously the background of the prosecutor is very similar to my background... 
[and] inclines more naturally to have the lawyers in charge of decisions.” 
(R#190:07/02/02) Yet a North American lawyer also commented, “We’re more 
perhaps of an American federal prosecutorial style now.” (R#168:02/01/01) Both 
the Europeans and the North Americans could see elements of their national 
legal practices in the new approach. By the time the Milosevic trial began early 
in the new year, the OTP had undergone a major internal transformation that 
placed the prosecuting lawyers much more firmly and proactively in charge of 
investigations (R£111c:02/01/01). 

The results were variable, with the prosecutors in this case among the first to 
concede that their success in "turning" high-level witnesses was mixed." Three 
years after the delivery of Milosevic to The Hague and 200 witnesses into the 
trial, observers began to comment that no insider, smoking gun witnesses were 
forthcoming. “At this juncture,” the Dutch international law expert, Gerard 
Strijards, concluded, “[I]t is Milosevic who has the upper hand.” (Franks 2002) 

In the middle of July 2002, Del Ponte met in Belgrade with Yugoslav President 
Kostunica. A result was that Yugoslavia freed some former Milosevic associates 
to reveal confidential information without threat of prosecution at home for 
revealing state secrets (Kratovac 2002). The stage was set for the delivery of 
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compelling testimony during the final weeks of the Kosovo phase of the trial. 
The build-up to this testimony described a Milosevic led cover-up of ethnic 
cleansing operations in Kosovo and the removal of bodies to Serbia. Two Serb 
policemen who were characterized as “unwilling insiders” confirmed that the 
reburial of bodies from Kosovo was ordered by Milosevic and his interior 
minister (Klarin 2002). Later, as the Milosevic trial shifted to the joint criminal 
enterprise in Croatia, witnesses from the former Yugoslavian army testified that 
all significant military decisions were taken in Belgrade. One witness explained 
that Belgrade was “a synonym for Milosevic,” turning at one dramatic point in 
the cross-examination of Milosevic to say, “You were Belgrade!” (Klarin 2002; 
Prosecutor v. Milosevic, Case No. IT-02-54, Trial Chamber Transcripts, October 
30, 2002:12561) 

As this testimony unfolded in an ICTY courtroom, Del Ponte simultaneously 
addressed the UN Security Council. Speaking in general terms that seemed 
aimed at the U.S. in particular, Del Ponte insisted, “If the Tribunal is to meet the 
completion strategy targets and the deadlines that are expected of us,... other 
problems have to be tackled by the international community.” (Klarin 2002) 
These other problems involved gaining more cooperation from the Belgrade 
government in accessing archives and making arrests. 

The significance of Del Ponte’s remarks was that they revealed a chief 
prosecutor strategically speaking back to the pressure placed on the Tribunal 
by the Bush administration. The point is frequently made by Milosevic, among 
other critics of the ICTY (e.g., Parenti 2000), that the Tribunal is a pawn of the 
U.S., Britain and other Western nations. Yet there is evidence that the ICTY acts 
without direction from the U.S., the UN and other nations, as evidenced by 
Arbout’s initial Milosevic indictment and by Del Ponte’s demands for American 
and other sources of external assistance. Del Ponte made this point clear in 
response to the Bush administration’s criticism of the ICTY before Congress and 
in the aftermath of 9/11. She remarked at a joint news conference held with the 
U.S. Ambassador Prosper in The Hague, “There may be people who are saying 
[after 9/11] the world has moved on and the issue of the day is now terrorism, 
[but] we cannot take that view of international justice.” (Buncombe 2002) 

International criminal law — as developed across four chief prosecutors at the 
ICTY — is emerging on contested terrain that now is visibly characterized in a 
competition for authority and legitimacy by blunting political interruption, most 
evidently in response to the hostility of the Bush administration to the renewal 
and growth of this juridical field. By studying the Tribunal from the inside-out, 
we have demonstrated how the force of international criminal law — and the 
world view that underwrites it — is enacted and promoted through official 
representations, daily practices and a development of coercive and cooperative 
mechanisms (see Bourdieu 1987:847-48) that divert contrary political efforts to 
limit the supply of cases and connected initiatives. 
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Pierre Bourdieu describes the competition between artistic generations in the 
field of cultural production as a struggle between "established figures" and 
“newcomers.” (Bourdieu 1993:60) We describe a homologous sequence in the 
field of international criminal law in which new legal practices are added by 
successive prosecutorial regimes to the previously available repertoire. Decisions 
to initiate exhumations of mass graves on foreign territory, search and seize 
archival evidence from sovereign nations, seal indictments against citizens of 
recognized countries, and manipulate financial inducements otherwise known 
as international assistance — these all involve invocations of normative practices 
by successive prosecutorial regimes to advance the ICTY's extra-territorial 
jurisdiction and the moral authority of international law. Although the process 
is characteristically incremental, the addition of such practices into the field 
often reflects a process of internal competition as well as cooperation in which 
different actors bring knowledge and experience of previous national practices to 
address unresolved international problems. In Bourdieu's terms, this is a process 
in which “new modes of thought and expression" (1993:60) are asserted, enacted, 
concretized and thereby converted into legal doxa. 

Although there is no doubt the ICTY is subject to external pressure and 
political influence, a central point of this analysis is that as the juridical field 
of international criminal law is developing, it is strategically acting back on the 
world outside its courtrooms, and it does this as much through the force field of 
its socially organized practices as through the doctrinal detail of the case law it 
produces. The competition of prosecutors and judges in the social organization 
of court-connected encounters and scenarios is an essential vehicle through 
which the force of law is developed and exerted. The social history of the ICTY 
offers unique contemporary evidence of how this happens in international legal 
practice, and this study is an effort to document and explain this process from 
the inside-out and through a succession of prosecutorial regimes. 

Cherif Bassiouni, as the eventual head of the Commission of Experts, drew 
on his cross-national knowledge of domestic and international criminal law 
and the entrepreneurial possibilities provided by foundation funding to amass 
a body of evidence to make the prima facie legal case that the Serbs were the 
primary aggressors in the Balkan conflicts of the early 1990s. Although much 
of this evidence was hearsay and marked no more than an early stage in the 
exercise of investigatory powers, the collection of this evidence nonetheless was 
shaped within the orthodox corroborative habitus of Western common law. That 
is, the collection of evidence used multiple, corroborative sources to establish a 
foundation for the initiation of individual indictments. The amassed evidence 
undercut claims of moral equivalency among the warring parties that were 
previously used by elite politician-diplomats to privilege peace negotiations over 
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the criminal prosecutions favored by the new human rights and international 
law groups. 

The latter groups worked alongside Bassiouni in successfully getting the UN 
to establish the ICTY, although with no sustaining budget and a first prosecutor, 
Richard Goldstone, who knew little about criminal law and who was hesitant to 
initially pursue major war criminals. The Tribunal was also initially stalled by its 
lack of investigation and arrest powers. Neither Bassiouni nor Goldstone had a 
plausible plan to achieve arrests and transfers of indicted suspects to the ICTY for 
trial. Nonetheless, the Srebrenica investigation was initiated during Goldstone's 
tenure and established a power of legal entry to begin mass grave exhumations 
in areas under Bosnian Serb control, which led in turn to the development of a 
power to search and seize military evidence about the Srebrenica massacre. 

Goldstone's legal policies and procedures threatened the future of the 
Tribunal when he was unable to convert indictments into arrests or conduct 
viable trials. He sufficiently vilified the indicted Bosnian Serb leader Radovan 
Karadzic so that he was removed from office, establishing some force to the 
ICTY’s institutional position, but Goldstone was not able to bring Karadzic or 
any other major indicted figure to trial, and in the process the ICTY began to 
take on the appearance of a “virtual tribunal.” 

In contrast, Louise Arbour drew on her experience as a criminal lawyer 
in Canada to better establish the prosecutorial powers of the Tribunal. She 
recognized that her UN mandate could be viewed as granting a criminal law 
power to use the domestic criminal practice of sealed or secret indictments 
to facilitate surprise arrests of indicted Balkan war criminals by international 
authorities, such as NATO military forces. The challenge was to claim the 
traditional criminal law prerogative of secret indictments as international criminal 
doxa, notwithstanding expectations of deference and transparency that Goldstone 
previously emphasized were required by norms of international diplomacy. 
Arbour used sealed indictments to co-opt U.S., British and other NATO led forces 
as agents for arrests, and later as a means of controlling the staging and timing 
of the Milosevic indictment that the U.S. did not consistently support. 

Arbour also encouraged international political and financial authorities, 
starting with the Council of Europe, to make monetary aid contingent on transfers 
of indicted individuals, thus encouraging Croatia to send indicted war criminals 
to The Hague. Arbour often assumed rather than requested the authority to 
initiate and pursue her policies and practices, and with her success in doing so 
she established a new force to the ICTY and by implication for international 
criminal law. 

With Milosevic removed from office but still in a Belgrade jail awaiting transfer 
to The Hague, Carla Del Ponte worked to elaborate Arbour's monetary strategy. 
This strategy drew on her Swiss financial experience and involved convincing 
the United States to join with the World Bank and European Union in using 
international aid as an inducement with the new Federal Republic of Yugoslavia. 
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International financial assistance was made contingent on the transfer of Milosevic 
and broader Serbian cooperation in providing documentary evidence and fellow 
indictees to the ICTY. The force of ICTY law was now moving along both the 
coercive dimension of a forcible arrest power and the cooperative dimension 
of financial inducement. Del Ponte also expanded the naming power of the 
Tribunal's criminal law mandate by upping the charges against Milosevic from 
crimes against humanity in Kosovo to include genocide in Bosnia. Arbour had 
worried the genocide charge would result in a protracted trial with a less certain 
outcome. Meanwhile, Del Ponte also introduced a reform in the organization of 
field operations that placed prosecuting lawyers directly in charge of investigators, 
from the beginning to the end phases of evidence collection and preparation 
for trial. 

The advances of the ICTY prosecutorial regimes then were based on elements 
of competition and reproduction. Each regime featured a set of competitive 
strategies that advanced the work of the Tribunal and, through its actions, the 
institutionalization of international criminal law; in this way, the practiced 
strategies of each regime reveal how new force and autonomy can be given to 
law. We have argued that the current Tribunal has accumulated a combination of 
coercive and cooperative powers that have an interlocking and cumulative effect in 
allowing the prosecution a range of options to achieve its enforcement goals. 

These powers — to collect onsite forensic evidence; to search and seize archival 

evidence; to secretly indict and financially, as well as forcibly, induce arrests 
and transfers of defendants; and to choose among serious charges in a tightly 
coordinated investigation and prosecutorial regime — were not operational or 
even clearly comprehended when the Tribunal began. Today they are established 
elements of the field. The competitive edge in advancing the strategic aims of the 
Tribunal crucially depends on the practiced application of these multiple powers. 
The alternation and coordination of these powers is a foundation that gives real 
world significance to the force field of international law. Of course, there is no 
claim here that the force of this law is wholly independent of sponsoring parties, 
such as NATO and especially the United States. Nonetheless, this account and 
its focus on the competitive strategies of successive prosecutorial regimes tells 
us something about a process by which a relative autonomy from such powerful 
parties can be achieved, particularly by those who participate in the process and 
legitimize it at each turn. 


Notes 


1. Prosecutor v. Tadic, Case No. IT-94-1-T, Trial Chamber, May 7, 1997; for an overview, see 
ICTY Case Information Sheet (Tadic Case), April 18, 2002; and on early proceedings, see 
generally Prosecutor v. Tadic and Borovnica, Case No. IT-94-1-T, Indictment, February 13, 
1995; ICTY Press Release CC/PIO/004-E 1995, 008-E 1995, 009-E 1995, 011-E 1995, 012-E 
1995, 015-E 1995, 016-E 1995. 
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2. See Prosecutor v. Karadzic and Mladic, Case No. IT-95-18-R61 and IT-95-5-R61, Trial 
Chamber Transcripts, June 27-28 1996, July 1-11, 1996; ICTY Press Release CC/PIO/092-E 
1996. 


3. See Prosecutor v. Krstic, Case No. IT-98-33, Trial Chamber, August 2, 2001; ICTY Press 
Release OF/P.1.S./609e; and see Prosecutor v. Krstic, Case No. IT-98-33, Indictment, October 
30, 1998. 


4. Prosecutor v. Kunarac, Kovac and Vukovic, Case No. IT-96-23 and IT-96-23/1, Trial Chamber, 
February 22, 2001; and see ICTY Press Release JL/P.I.5./566-e 2001. 


5. For background, see the indictments regarding Kosovo ( Prosecutor v. Milosevic, Milutinovic, 
Sainovic, Ojdanic and Stojiljkovic, Indictment, May 24, 1999, Amended Indictment, June 
29, 2001, Second Amended Indictment, October 29, 2001), Croatia (Prosecutor v. Milosevic, 
Indictment, October 8, 2001, First Amended Indictment, October 23, 2002), and Bosnia 
(Prosecutor v. Milosevic, Indictment, November 22, 2001). 


6. Citations in this format denote information gathered from interviews. For additional 
details, please direct correspondence to John Hagan. 


7. Bassiouni wanted to lead the commission himself, but settled temporarily for being its 
rapporteur, along with a Canadian military lawyer, Bill Fenrick. Fenrick was not himself in 
tension with Kalshoven's approach, since he more narrowly regarded the commission as an 
experiment in preparing court ready cases (R#128:06/ 28/99). 


8. Prosecutor v. Blaskic, Case No. IT-95-14, Decision on the Objection of the Republic of Croatia 
to the Issuance of Subpoenae Duces Tecum, July 18, 1997:45. 


9. See also Prosecutor v. Kordic and Cerkez, Case No. IT-95-14/2, Decision Stating Reasons for 
Trial Chamber's Ruling of 1 June 1999 Rejecting Defence Motion to Suppress Evidence, June 
2511999: 


10. Prosecutor v. Dukic, Case No. IT-96-20-I, Indictment, February 29, 1996); ICTY Press 
Release CC/PIO/031-E 1996, 032-E 1996, 033-E 1996, 034-E 1996. 


11. See Prosecutor v. Tadic, Case No. IT-94-1-T, Trial Chamber, May 7, 1997. 


12. See Prosecutor v. Karadzic and Mladic, Case No. IT-95-18-R61 and IT-95-5-R61, Trial 
Chamber Transcripts, 27-28 June 1996, 1-11 July 1996; ICTY Press Release CC/PIO/092-E 
1996. 


13. As such, the Judges of the International Criminal Tribunal amended and adopted a 
significant number of rules at the 14th plenary session of the Tribunal, ending on November 
12, 1997. For instance, this included the adoption of Rule 53 bis, which addresses the ability 
to serve an indictment on an accused once they are taken into custody, so that “[s]ervice of 
the indictment shall be effected personally on the accused at the time the accused is taken 
into custody or as soon as reasonably practicable thereafter.” ICTY Annual Report 1998:105; 
ICTY Rules of Procedure and Evidence: Rule 53 bis. 


14. The Canadian Federal Cabinet Order in Council confirms that the appointment was 
approved but undisclosed on May 26, 1999, the day before Arbour’s announcement of the 
Milosevic indictment (Privy Council Office 1999). 


15. Prosecutor v. Kunarac, Kovac and Vukovic, Case No. IT-96-23 and IT-96-23/1, Trial 
Chamber, 22 February 2001:436ff. 
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19. Field notes, July 2000. 


20. Prosecutor v. Milosevic, Indictment, October 8, 2001, First Amended Indictment, October 
23, 2002. 


21. Field notes, July 2001. 
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23. Field notes, July 2001. 


24. For background, see Prosecutor v. Kordic, Blaskic, Cerkez, Santic, Skopljak, and Aleksovski, 
Case No. IT-95-14, Indictment, November 10, 1995; Prosecutor v. Blaskic, Case No. IT-95-14, 
Amended Indictment, November 22, 1996, Second Amended Indictment, April 25, 1997. 
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Consent, Informal Organization and Job 
Rewards: a Mixed Methods Analysis* 
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Abstract 


This study uses a mixed methods approach to workplace dynamics. Ethnographic 
observations show that the consent deal underlies an informal stratification that 
divides the workplace into an “informal periphery,’ a “conventional core” and an 
“administrative clan.” The “consent deal” is defined as an exchange of autonomy, 
voice and schedule flexibility for intensified commitment, and is modeled as a 
single factor underlying these elements. When constructed as an additive scale, 
consent allows informal organization to be included in workplace models. Despite 
its derivation from subjective and informal processes, informal structure exerts an 
independent effect on objective job rewards such as wages. 


Introduction 


The workplace is one of the most intensively studied areas in sociology, yet despite 
a century of studying organizations, we still have only a vague understanding of 
the dynamics between formal and informal organization, between structure and 
culture, and between objective and subjective elements. We can enter a workplace 
and adjust to the culture, identify and work with key informal power brokers, 
and manipulate that intersubjective dynamic which we know affects workplace 
outcomes. However, we have yet to fulfill Gouldner’s (1959) call for reconciliation 
_ between the “rational” and “natural system” models and develop a theoretical 
framework that would move us from an understanding of the dynamics of a 
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workplace to a generalized model of the workplace. Without this general model, 
we can assert but not demonstrate claims that: the formal, structural side of the 
workplace has a minor role in determining workplace attitudes, or the informal 
dynamics has a measurable impact on objective outcomes. 

This study is about measuring the effects of one important intersubjective 
dynamic that is observable in the generalized workplace. I take a mixed methods 
approach: first using ethnographic data to identify the elements of that dynamic, 
and then using survey data to model it between the structural aspects and job 
rewards. This key dynamic creates an informal stratification of workers into (a) an 
administrative clan: an elite group that works under normative control and enjoys 
upper-tier, primary labor market working relations; (b) a conventional core: the 
majority of primary market workers who work under bureaucratic conditions; 
and (c) an informal periphery: whose members work under the harshest “market” 
relations with the strictest technical or personal control. I identify this dynamic 
as the “consent deal" — an informal relationship between managers and workers 
that reflects the intensity by which managers enforce formal work rules and 
by which workers extend effort on workplace tasks (see Littler and Salaman 
1984). The intensity of this dynamic is measured as an exchange of autonomy, 
schedule flexibility and voice by managers for organizational commitment by 
workers. For each worker, a high level of all elements indicates membership 
in the administrative clan and a low level of all indicates membership in the 
extended periphery. 

Using these elements, I model the consent deal as a second-order latent 
factor, then construct an additive scale and measure its distribution between 
formal workplace levels and occupation types. Finally, I incorporate consent as 
a measure of the informal intersubjective dynamic within a broader workplace 
model and test its effects relative to the effects of objective structural factors on 
job satisfaction, worker identities and wages. 


CONSENT 


Burawoy’s (1979) conception of workplace consent has been important to 
analyzing power relations within the workplace. A number of ethnographies 
since have elaborated on how management generates consent and limits class 
consciousness (e.g., Hodson et al. 1994; McCabe 1999; Smith 1996). However, 
these have failed to offer a conception of consent that can be used to broaden 
the research using statistical modeling. Indeed, studies that have used statistical 
modeling have deviated from a relational definition in favor of worker-centered 
measures — e.g., Vallas (1991) used worker perceptions indicative of class 
consciousness, and Hodson (1999) used organizational citizenship and resistance 
behaviors. ; 

A more useful approach was offered in Littler and Salaman’s (1984) discussion 
of control and consent. They begin with the observation that the key factor in 
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determining organizational structure is the need to convert raw labor power to 
productive labor (Braverman 1974; Hache 1988), and that this is done through 
various labor control mechanisms (Edwards 1979). They then echo Burawoy's 
observation that some process to generate worker compliance must also be 
designed into the organization of the work process. They recognize that while 
control is often established through management's work rules and procedures, these 
rules cannot realistically account for all circumstances that arise in production. 
A normalized production flow therefore requires some amount of give and take, 
such as bending rules for extra effort. In fact, workers sometimes engage in a 
form of resistance called “working to rule" in which they refuse to participate 
in this give and take, thereby slowing production. This requirement leads Littler 
and Salaman to characterize real work behaviors and relationships as the result 
of continual negotiations between workers and their immediate supervisors over 
interpretations of formal work rules, a flexibility which is offered in exchange for 
a working commitment to the overall objectives of management. Because this 
involves an effective suspension of the rules, procedures and regulations around 
which the formal organization is based, they conclude that consent is developed 
“outside formal organizational procedures for establishing legitimacy, in what is 
described as the ‘informal’ structure of the organization.” (1984:68) 


INFORMAL ORGANIZATION 


Organizational theorists since Roethlisberger and Dickson’ ([1939] 1967) have 
recognized that informal organization rivals formal organization in its effect on 
the day-to-day functioning of an enterprise. Subsequent theorists have discussed 
its importance for organizational dynamics (e.g., Burns and Stalker 1961; 
Pfeffer and Salancik 1978; Selznick 1949; Thompson 1967), and its effect on 
enhancing or restricting productivity (e.g., Burawoy 1979; Graham 1971; Mayo 
1933; Reif et al. 1973; Roy 1959; Sayles 1963). However, despite this attention, 
there has been little consensus on how informal organization manifests across 
organizations. While Roethlisberger and Dickson describe informal organizations 
as the “actually existing patterns of human interaction” by which the work of 
the organization is performed ([1939] 1967:559), others characterize it in such 
terms as the “natural” v. “rational” system (e.g., Selznick 1949; Thompson 1967), 
“organic” v. “mechanic” model (Burns and Stalker 1961), “culture” (see Ouchi 
and Wilkins 1985), “negotiated order” (Fine 1984), and “discourse” (Stinchcombe 
1990), none of which easily lend themselves to modeling. Lawrence and Seiler 
(1965:187) approach a usable construction with their discussion of workers 
as having a “status,” which is determined by “position in the informal social 
organization” but never specifies how that position was determined. Without a 
clear specification, quantitative work on informal organization has not advanced 
much beyond network analysis, which reduces it to the number and type of 
communication links between workers (e.g., Mizruchi 1994; Podolny 1990; Vogel 
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1968; Wellman and Berkowitz 1988;). This specification has obvious limitations 
for cross-organizational studies. 

Some characteristics of informal groups can be found in the literature. 
Gouldner (1959:410) acknowledges the ambiguity regarding informal organization 
by describing it as *a residual or cafeteria concept of diverse and sprawling 
contents” He relates the “natural-systems model” (which focuses on informal 
organization) with his earlier discussion (1954) of “representative bureaucracy,’ 
which he says has its basis in consent. He notes that consent springs from a 
“consensus of ends and values” (1954:223), and that an important component 
is worker perception of having “some measure of control over the initiation and 
administration of the rules.” He also notes that “formal rules gave supervisors 
something with which they could ‘bargain’ in order to secure informal cooperation 
from workers” (1954:173) and identifies schedule flexibility as an important 
bargaining “chip” for informal cooperation. 

Other hints can be found in discussions of the informal coalitions that are 
found within organizations. Thompson describes coalitions as workers who want 
to “maintain or enhance their positions regardless of the official, authorized 
positions they hold” (1967:125), and who tend to have high levels of discretion 
and some voice in enterprise decision-making processes. Similarly, Pfeffer and 
Salancik’s coalitions consist of workers who are (1) involved in the “enactment 
of the organization’s environment” — an essential part of decision-making, and 
(2) driven in part by “the quest for discretion and autonomy.” (1978:261) 


Observations of Informal Workplace Stratification 


I observed informal stratification and the elements of consent that underlie it 
while conducting ethnographic observations of Family Finance Corporation 
(FFC), a family-owned financial enterprise that went public, overextended, and 
was absorbed by a larger corporation.’ These observations were made while I 
worked over four years in several roles: a temporary employee assigned to the 
company, a part-time computer programmer, a full-time administrator and 
an outside consultant. Relevant observations are presented in summary form 
because the agreement under which management allowed these observations 
excluded interviews or quotations attributed to members of the workplace. Much 
of the data therefore consists of natural conversations and incidents observed 
in the course of my duties. To underscore that people did recognize my dual 
role as worker and observer, I was given the nickname of "the professor" by my 
supervisor, and I encouraged its use among coworkers. 

FFC had two periods with distinct cultures during the study, before and after 
its move to a building constructed during its intensive growth phase. When I 
started, the culture was very informal, with an "organic" management style (per 
Burns and Stalker 1961), and had working relationships such that all levels of 
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workers intermingled regularly and most people could be expected to do any task. 
EEC's main office was located in the converted storefronts of a strip mall, and 
there were no partitions between desks and few between functional areas. There 
were scores of temporary workers throughout the offices helping with what turned 
out to be a disastrous manual conversion of account records between computer 
systems, and these workers were often treated as regular staff. 

Just prior to the move to the new building, a new management team was 
recruited for finance, information systems and personnel functions, and the 
formation of this team coincided with a change to a more formal, *mechanic" 
style. Working relationships became much more formal and professionalized. 
Functional units were separated, workers were given cubicles, managers were 
given offices, and security procedures were implemented to restrict access to 
the building by non-employees. Hierarchical distance quickly appeared between 
management, professionals, clerical staff and temporary workers. The corporation 
appeared to objectify its new organizational stratification in the distribution of 
functional units by floor, which I have depicted in Figure 1. 

The move to the new building with offices, cubicles and new furniture 
and equipment signaled a change in culture away from the business's earlier 
incarnation as an entrepreneurial free-for-all to an established corporation. 
Workers seemed to take themselves more seriously by dressing more formally 
and decorating their cubicles and offices in a more “professional” style. The 
corporation took an active role in developing the culture by implementing a 
corporate newsletter and staging occasional picnics and holiday parties. Functional 
units began developing subcultures — traditions, languages and social ties that 
build unit solidarity. 


INFORMAL STRATIFICATION 


My duties included developing, installing and troubleshooting computer systems, 
and these responsibilities allowed me to travel to all social and physical levels of 
the new building where I was able to observe the transformation in workplace 
relationships and attitudes. I could see that the informal networks that developed 
prior to the move didn’t completely disappear, but congealed into an informal 
administrative structure that appeared to shadow the formal structure. I also noted 
the emergence of an informal polarization that had either been camouflaged or 
minimized by the earlier culture. This polarization created three groups that were 
distinguishable, but whose boundaries were continually in flux. I came to call 
these groups the informal periphery, conventional core and administrative clan. 
Their distribution within the building (and among functional units) is depicted 
in the shaded areas in Figure 1. 
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Informal Periphery 


The informal periphery was the *bottom" of the informal structure. Workers in 
this group appeared to visibly manifest the dissolution of the boundary between 
temporary and permanent workers described by Smith (1997). Many were either 
traditional contingent workers (temporary or part-time workers) or permanent 
workers who were treated as contingent. Their work tasks were well-defined and 
managerial control was intensive and often antagonistic, whether conducted 
through direct attention from a supervisor or automated into the technology of 
their work. These workers were generally given such highly routinized tasks that 
their contribution came more from attendance than application of a skill set, a 
stark reality that was reflected in the minimal schedule flexibility they received. 

For many workers, this outsider status was temporary, until their general 
proficiency was recognized to be sufficient to warrant more relaxed supervision or 
until they were able to demonstrate some level of commitment or skill. However, 
there appeared to be some workers who were consistently relegated to out-group 
status. Some of these were simply due to deficient individual performance. 
Others were due to membership in a racial or ethnic minority or a subordinate 
work group being assimilated through a merger. Still others simply worked in 
positions which had experienced such high levels of turnover that incumbents 
needed to show extraordinary patience or proficiency before any opportunity to 
improve their situation. 

Workers in the informal periphery were rarely acknowledged by management, 
but when they were they were usually referred to with terms that indicated 
unreliability and expendability. On one instance when I was walking on the 
first floor with one of the higher level managers, he told me that these workers 
were “clock watchers.” Many workers in the informal periphery responded in 
kind to this treatment, showing low levels of dedication and sometimes evincing 
minor forms of resistance. For them, the enterprise represented an onerous work 
environment with few redeeming features beyond a paycheck. 


Conventional Core 


The majority of workers were in what I called the “conventional core.” These 
are the workers for whom the formal organizational rules apply. Occupations 
in this group include everything from clerical to management, but the worker- 
organization relationships tend to be dominated by bureaucratic concerns. For 
example, these might be clerical or technical workers who are highly committed, 
but their autonomy and other work conditions were no different from their less 
committed colleagues as dictated by their formal position. Similarly, the core also 
included professionals or managers whose positions gave them very high levels 
of autonomy, schedule flexibility and input into the organization, but whose 
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level of commitment — the alignment of interests with the corporation — were 
relatively low in comparison to their peers. 


Administrative Clan 


Soon after starting work, and despite my status as a temporary worker, my 
technical skills and managerial background placed me in an informal technical- 
administrative group that cut across the organization's functional units and 
formal hierarchical structures. This group, which I titled the "administrative 
clan" after Ouchi's (1980) organization type, appeared to operate in the manner 
of the *coalitions" described by Thompson (1967) and Pfeffer and Salancik 
(1978). It centered on management and professional staff, but extended deep 
into the organization, drawing in workers with critical institutional knowledge 
or idiosyncratic skills. The group often appeared as a clique or “in-group” of 
workers at various levels through which the most critical administrative activity 
seemed to flow. Membership was not necessarily commensurate with structural 
factors such as formal hierarchical position, formal skill set or tenure. 

There were numerous examples of offices in which workers would hold 
identical job descriptions with the same formal authority, but only one worker 
would be recognized and treated as part of the clan. Some clan members would 
have minor differentials in title from workers who were otherwise their peers; 
however, these differentials apparently served only to legitimize a greater reliance 
on the “other duties as assigned” clause of their formal job description. One work 
unit that was given the title of “special projects” was staffed predominantly with 
clerical clan members who would conduct high priority data entry (or data clean- 
up) projects to assist any functional unit of the organization. Even this unit was 
stratified among clerical clan who worked autonomously and clerical non-clan 
who were more closely supervised. 

As noted, identifying members of the clan was generally easy, but specifying 
characteristics of membership was not. There were no set boundaries between 
members and non-members nor were there designated rituals defining 
membership; people moved in and out of the group as their proverbial “star” 
rose or fell. Membership appeared to take the form of an aura of reliability, as 
if it had been confirmed by some ordeal. In many cases it had been — some 
members were known for their willingness to put forth heroic efforts for critical 
projects, others for their ability to help define or represent some important aspect 
of organizational culture. All were trusted workers who knew how to manage 
critical tasks under minimal direction. 

One characteristic that seemed important was engagement with corporate 
cultural activities — regardless of the actual feelings the clan member had toward 
the activity. Clan members would help plan and even bake special dishes for 
parties, read and even contribute to the newsletter, and participate in a “vision 
committee" that appeared to be FFC's equivalent to quality circles. 
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This reliability and trustworthiness appeared to reflect a sense that the heroic 
efforts of these workers were motivated by a sincere concern for advancing FFC's 
interest. Members had aligned their interests so completely with the corporation 
that, for them, the typical control processes were not necessary. This alignment 
closely resembled Ouchi's (1979, 1980) clan organization form, but unlike for 
Ouchi, the alignment applied to this informal group instead of to the organization 
as a whole. Interestingly enough, my label for this group as the "administrative 
clan" resonated well for coworkers with whom I discussed the validity of the 
concept. 


WORKPLACE POLARITY 


The administrative clan and informal periphery embody a number of workplace 
polarities. Members of the administrative clan tend to be on the "fast track" for 
promotions, get the highest raises, and have better ties with other supervisors 
and coworkers, while members of the informal periphery are generally ignored. 
The correspondence between informal structure and flexibility theory's "core" and 
*periphery" (e.g., Berger and Piore 1980; McLoughlin and Clark 1988; Osterman 
1975; Piore 1971) became apparent in worker responses to staff cuts that were 
forced by FFC's financial troubles. The business had been closing branch offices for 
months with little reaction from workers at headquarters because the enterprise 
was thought to be consolidating work there. However, when almost all of the 
external branches were cut, headquarters experienced two waves of layoffs. The 
first wave hit workers at all levels of the formal organization, but was focused on 
members of the informal periphery. Although this wave included some managers 
and professionals, survivors were clearly not concerned for their own jobs. When 
asked about the layoffs of managers and professionals, they replied with statements 
suggesting that management was merely “cutting dead weight.” However, when 
the second wave of layoffs included administrative clan members (including 
myself), survivors displayed considerably more concern that the enterprise was 
in serious trouble and that their own jobs were at risk. An organization that 
would cut clan members was now seen as being in deep trouble. When I was 
brought back as a consultant, I was told that each unit was required to give up 
a staff member, and that I was cut because of my dual commitment to FFC and 
to my academic research. 

In an interesting addendum to the layoffs, as if to underscore the contingent 
nature of the extended periphery, FFC hired a new cohort of these workers within 
weeks after the layoffs. When I asked why FFC was simultaneously hiring and 
laying off workers, Human Resources staff indicated that turnover within these 
positions was so high that these positions were not considered when making 
strategic staffing decisions for the enterprise. 
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CHARACTERISTICS OF INFORMAL ORGANIZATION: THE CONSENT 
DEAL 


Despite the three distinct informal strata I observed, there appeared to be an 
underlying continuum that created "degrees" of clan and periphery as people 
moved into and out of those groups. This continuum reflected varying levels of 
consent — a characteristic of the corporation-employee relationship indicating 
the level of active cooperation in the process of production. However, within this 
context consent means much more — implying a level of engagement between 
the worker and the corporation, or at least one of the organizational coalitions 
that administer the corporation. At its high end this engagement means Ouchi's 
clan relations, where control is normative (e.g., Kunda 1992) and maximum effort 
is assumed, and at the low end the lack of engagement means a relationship of 
suspicion, where control is technical or direct (Edwards 1979), and supervision 
is close, leading to the "less reliable" worker performance described by Gouldner 
(1954:161). 

This continuum of engagement is observable in what I refer to as the "consent 
deal? based on Littler and Salaman's (1984) conceptualization of consent as an 
exchange of relaxed enforcement of rules for alignment of interest. This takes 
the form of an exchange of perceived autonomy, voice and schedule flexibility for 
organizational commitment, and is as clearly visible in the administrative clan 
as it is clearly absent in the extended periphery. Perhaps the most important 
of these offerings from management is the perception of autonomy, which is 
regularly brought up in the literature as a characteristic of desirable positions. 
In the administrative clan, even members at non-professional, clerical support 
levels perceive themselves to have high levels of autonomy more characteristic of 
professional positions. In the informal periphery, even members of management 
are closely scrutinized. 

The next offering from management is the perceived ability to participate in 
organizational decisions. Administrative clan members, even from lower formal 
positions, perceive themselves as influencing the corporation. This influence is 
not always direct, although they are sometimes asked and are always listened 
to when offering an opinion. Administrative clan members are aware that they 
exercise disproportionate influence indirectly by preparing formal reports and 
participating in information channels which shape management's conception of 
organizational issues (i.e., Pfeffer and Salancik’s “enactment process”). On the 
other hand, periphery members generally learn about decisions and sometimes 
even the problems they address as the decisions are being implemented. In a 
more direct application of Hirshman’s (1970) terms, when problems develop, 
clan members are more comfortable exercising “voice” while peripheral workers 
are more likely to “exit.” 

The third job characteristic in the consent deal is perceived schedule flexibility. 
For administrative clan members, this often starts with their working long hours 
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when they are called to help complete time-sensitive projects in exchange for 
“compensatory time off.” In many cases this flexibility takes the form of clan 
members not observing strict time rules, as in arriving late or leaving early. 
Often this flexibility is more perceptual than real, because their commitments 
as administrative clan require face time at the workplace. This flexibility is best 
exemplified by a clerical worker in my unit who often had problems finding 
daycare for her children, especially when she was called in on her days off to 
work on critical problems. She would occasionally bring her children, and we 
would set up games on a computer to occupy them as she worked. We would 
not even have considered this for someone who was not a clan member. My own 
schedule is another example — I often came in late, but while I was occasionally 
teased, coworkers recognized that I regularly worked at home. Members of the 
periphery, either because their contribution comes more from their presence 
than their skills or because they are under intense pressure from managers, have 
no such flexibility. If they were “clock watchers” as described by managers, their 
tardiness and leaving early was being no less closely scrutinized by those same 
managers. 

The fourth characteristic, organizational commitment, is the employee’s 
contribution to the consent deal. It is critical for clan membership and absent 
in the informal periphery. Administrative clan members act as if their interests 
are fully aligned with the enterprise, and give much of themselves to it, often 
to the detriment of their families and social life. This commitment is regularly 
tested in extra responsibilities not associated with their formal job duties. For 
instance, the Information Systems unit at FFC was also responsible for snow 
removal and administering the building’s cleaning contracts. One clan member 
in that unit was severely tested when he found that someone on the night shift 
had defecated on the floor of the executive suites’ restroom. He accepted his 
responsibility to clean the restroom because the cleaning company could not be 
called before the executives came in for the day. The incident can also be seen as 
expression of contempt from peripheral workers, who held the same low regard 
for the corporation as they felt the business had for them. 

The relative level of each of these of these components reflects the extent of 
the consent deal being made at the individual level. All characteristics are present 
at high levels for members of the administrative clan, and all are low or absent 
for the informal periphery. Workers with mismatched or moderate levels are in 
the undifferentiated mass of the conventional core. A professional or manager 
who has high levels of autonomy, voice and flexibility but does not return high 
levels of commitment is not regarded as clan.’ At the other end, a temporary or 
entry-level worker who works in a very restrictive job but demonstrates a high 
level of commitment would not be regarded in the same way as members of the 
informal periphery, and would be more likely to be advanced in formal position 
before others with more tenure. 
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This mismatch between the worker's commitment and job characteristics 
offered informally by management can be seen as indication of a boundary 
through which workers are moving into and out of the clan or periphery. 
It also suggests that the direction of effects between job characteristics and 
commitment actually goes back and forth over time. Workers demonstrating 
higher commitment than their position warrants are sometimes informally 
extended greater levels of autonomy, voice and flexibility within their jobs, or 
might even be promoted to positions with the commensurate characteristics. 
Perhaps the best example of this was the Special Projects Unit, which offered 
clerical workers an “elite” status and very flexible work rules for working on 
critical problems that required initiative and creativity. 

On the other hand, workers in professional and managerial positions who 
do not demonstrate commitment commensurate with the levels of autonomy, 
voice and flexibility that come with their positions often have those characteristics 
restricted. Three examples illustrate this point. In the first case, a vice president 
who had been a college friend of the president began making greater demands 
for money and perks from the enterprise because of a successful project he had 
initiated. When this did not come quickly, he was found to be secretly negotiating 
for a position at a rival company. This act of disloyalty overcame his value to the 
organization so his autonomy, flexibility and authority were severely restricted 
until he left. In the second case, a new set of managers from a conglomerate 
that took over the enterprise imposed the same restrictions on the founder and 
president, and drove him out. Perhaps the best example was the third case of an 
administrator who was part of the clan for much of the observation period, but 
who lost status through his declining commitment. His personality was abrasive, 
and the company moved away from the technology which was his specialty, but 
his early demonstrations of commitment were sufficient to warrant clan status for 
most of his tenure. The action which precipitated his fall from the clan and his 
subsequent dismissal was telling an assistant not to perform some work requested 
of his unit. The words least compatible with clan status are the following: “I 
won't do that — it is not my job.” 


The Consent Scale 


The ethnography demonstrated that an informal structure that is important to 
organizational dynamics is observable through the four elements of the consent 
deal: autonomy, voice, schedule flexibility and commitment. I next wondered if 
this informal workplace dynamic could be observed more generally in statistical 
models constructed from survey data. This formulation would allow me to test 
propositions about the relative effects on job rewards of this subjective element 
and the more traditional structural elements used in workplace models. In the 
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following, I refer to this subjective element as either “consent” or “informal 
organization,’ depending on the context, but it must be understood that since 
the consent deal underlies informal organization, I regard them as analytically 
synonymous — dual sides of the same coin. 

The procedure for modeling the consent scale is simple enough, but it first 
must face a paradigmatic objection that the three elements of the consent deal 
which are generally regarded as structural characteristics — autonomy, voice and 
schedule flexibility — are well established as causally prior to subjective states such 
as the third element — organizational commitment (e.g., Lincoln and Kalleberg 
1990; Mathieu and Zajac 1990; Mowday, Porter and Steers 1982). This objection 
is met first by noting that some of the early proponents of structural analysis 
recognized the subjective nature of these characteristics as outcomes of power 
struggles (Kalleberg, Wallace and Althauser 1981). Secondly, when obtained from 
surveys, these characteristics are in reality self-reported subjective perceptions of 
an individual worker's situation. Hackman and Lawler (1971) demonstrated that 
these make good approximations? for objective structural characteristics, but they 
do not have the consistency or objectivity implied by the paradigm. For example, 
a worker's autonomy and schedule flexibility are vulnerable to changes brought 
on by a change of managers. A job can be completely revamped and incumbents 
*reined in" by a new manager without making any formal changes to job 
descriptions or organizational charts. Self-reported job characteristics are also not 
objective because, when asked to rate their level of autonomy, an administrative 
clan member in a clerical position might offer the same responses as a manager, 
creating a perceptual equivalence that belies the very real differences in autonomy 
between clerks and managers based on the differences in their tasks. 

Ultimately however, this paradigmatic concern about levels of causality is 
resolved by modeling the consent deal as a latent factor operating at a level that is 
causally prior to and measured by all four subjective or perceptual characteristics 
— a relationship measured by confirmatory factor analysis. 


METHODS AND DATA 


The data set used in the model was collected as the Indiana Quality of 
Employment Survey (Wallace, Jamison and Shin 1996), which was conducted 
in the summer of 1996 using the facilities of the Center for Survey Research at 
the Indiana University Institute for Social Research. The IQES resulted in 705 
completed cases (64 percent response rate) from across Indiana selected randomly 
from working adults (defined as people over aged 18 working more than 20 
hours per week) employed in non-agriculture jobs. A randomizing procedure 
for selecting respondents from households ensured against bias on the basis of 
who answered the telephone. The questions used in constructing measures in 
this study are presented in the Appendix, arranged by factors which they were 
initially designed to measure. 
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One note regarding sample size: because the dynamics of informal organization 
are different for small organizations (« 10 workers), the model is restricted to 
organizations with 10 or more workers. This reduced the sample size to 582. 


CONFIRMATORY FACTOR ANALYSIS OF CONSENT 


Given that autonomy, schedule flexibility and organizational commitment are 
themselves measured as latent factors, consent is modeled as a second-order 
latent factor underlying these first-order factors. Figure 2 depicts the model 
with parameter estimates. The number of cases for this model is 557 due to 
list-wise deletion of missing values. Estimates are computed by AMOS 3.62 
using asymptotically distribution-free estimators to compensate for distribution 
problems caused by categorical variables. (See Bollen 1989; Kline 1998.) The 
original model included freedom as a third measure of autonomy (see Appendix), 
but that produced fit statistics which indicated that the model did not fit the data 
(x2 = 51.2, df = 32, p = 016). By dropping freedom, the fit statistics supported 
the assertion that the revised model fits the data (y? = 21.2, df = 24, p= 0.63). 
Even though this left two indicators for autonomy, the model is still identified 
per Kline (1998:235). Cronbach’s coefficient (a = .74) indicates that this is a 
reliable measure for consent. 


DISTRIBUTION OF INFORMAL ORGANIZATION 


A more useful form for consent is an additive scale in which the items are weighted 
by the paths from the latent factor (see Figure 2). Heise and Bohrnstedt (1970) 
offer several statistics to test the suitability of composite scales. With these data, 
their invalidity statistic for consent (W<.01) verifies that there is only one factor, 
their validity statistic (p, — .88) shows a high correlation between the scale and 
the underlying factor, and along with their reliability statistic (Q — .80), the use 
of the composite scale is supported. 

The actual boundaries between informal periphery, conventional core and 
administrative clan are ill-defined and fluid, so consent really can be viewed as a 
continuous variable rather than a categorical variable. However, for analyses in 
which the categories are important — such as their relative proportions within 
workplace categories — a reasonable split can be made in which the informal 
periphery consists of workers whose consent scores are more than a standard 
deviation below the mean, and the administrative clan consists of workers whose 
consent score is more than a standard deviation above the mean. Table 1 reports 
these distributions across formal organizational positions, occupations and profit 
status, and includes the F statistic which tests for differences in means of the 
underlying consent score for each workplace category. 

Panel A equates formal organization to Wright’s (1978) formulation of social 
classes, effectively returning Wright’s formulation to its origin as stratification 
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Figure 2. Measurement Model for Consent (Standardized Parameters) 
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Table 1. Cross Tabulation of Combined Data for Informal Organization 
Groups by Formal Structural Classifications, Occghupations and 





Organization Type 
informal conventional administrative consent 
Panel A. Class (per Wright) periphery core clan cases mean 
Se Ii ca 2 M M 
Non-Autonomous Worker 23.7% 70.5% 5.8% 224 25.98 
Semi-Autonomous Worker 11.9% 71.9% 16.3% 135 26.78 
Front Line Supervisor 16.2% 71.6% 12.2% 74 29/39 
Lower Manager 8.096 64.8% 27.3% 88 31.24 
Upper/Middle Manager 4.7% 58.1% 37.2% 43 33.38 
Large Employer-Capitalist 0% 38.9% 61.1% 18 37.05 
SERA | hoe duc dun TUE SSS 
15.5% 68.2% 16.3% 582 28.56 
F statistic for means 23.53** 
die 5 
informal conventional administrative consent 
Panel B. Occupation periphery core clan cases mean 
fiery Dect ese Met Pueri condo c OHIO TENE. Sp sr apu YP Pp ORIS RM 
Managers 6.3% 61.1% 32.6% 95»12M32:69 
Professionals 14.396 64.3% 21.4% 84 29.18 
Technicians 15.4% 80.8% 3.8% 26 25.88 
Sales Workers 6.5% 71.7% 21.7% 46 30.49 
Office Workers 20.2% 71.4% 8.3% 84 26.91 
Service Worker 18.7% 65.3% 16.0% 75 27.68 
Production Worker 9.9% 81.7% 8.5% 71 28.39 
Laborer 26.7% 63.4% 9.9% 101 26.05 
15.5% 68.2% 16.3% 582 28.56 
F statistic for means 9.85** 
Gr? as 
informal conventional administrative consent 
Panel C. Organization Type periphery core clan cases mean 
Government (ES,L) 21.296 68.796 10.196 99 26.38 
Private Company 13.4% 70.3% 16.4% 434 28.93 
Not-for-profit 22.4% 49.0% 28.6% 49 29.77 
15.5% 68.2% 16.3% 582 28.56 


F statistic for means 6.47** 
dioe. 
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in the workplace. This panel demonstrates a relationship between formal and 
informal organization roughly depicted in Figure 1, where even workers at high 
formal positions have informal periphery-level consent scores, and workers at 
the lowest formal levels can have administrative clan-level scores. Managers and 
executives whose consent scores place them in the informal periphery might be 
considered examples of Hogan, Curphy and Hogan's assertion that the failure 
rate among executives is 50 percent and that the “base rate for managerial 
incompetence is between 60 and 70 percent." (1994:494) However, the basic trend 
in Panel A is the expected increase of the proportion of clan at increasing levels 
of formal organization and the increasing proportion of informal periphery at 
lower formal levels. 

Panel B reports that the three occupations with the largest percentage of clan 
members are managers, sales and professionals, a result which demonstrates the 
importance of sales workers to the informal administrative networking of the 
organization. The occupation types with the lowest percentage of clan and highest 
percent periphery are laborers, office workers, service workers and technicians, a 
result which demonstrates that nonprofessional office workers (e.g., secretaries, 
receptionists and account clerks) and technicians (e.g., legal assistants and licensed 
practical nurses) now have lower standing than production workers. 

Panel C reports that government and non-profit corporations have higher 
percentages of workers in the informal periphery than for-profit corporations do. 
In addition, the government has a lower percentage of people in the administrative 
clan while non-profit enterprises have the highest percentages of workers in the 
clan. This can be interpreted as supporting the assertion that profit-producing 
organizations are more concerned with generating consent than government 
corporations, and that workers in not-for-profit corporations are highly polarized, 
but these ideas need to be explored further. 


MODELS OF INFORMAL ORGANIZATION 


The final step in this analysis is to construct a workplace model that includes 
the effects of informal organization on job rewards. This model (Figure 3) draws 
conceptually on typical models used for attitudinal studies (e.g., Leicht and 
Wallace 1994; Lincoln and Kelleberg 1990; Mathieu and Zajac 1990), but differs 
in using the consent deal as a measure for position in the informal structure 
of the workplace, placed causally between structural job characteristics and 
outcomes. 

The model starts with individual characteristics such as age, sex, race, education 
and marital status, and then adds the respondent's workplace characteristics 
such as corporation size, scope (local to international), organization type (i.e., 
government and not-for-profit corporation) and the industry concentration 
by sales — one of Pfeffer and Salancik’s (1978) measures of the effect of the 
corporation’s environment. These individual and organizational characteristics 
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Figure 3. Workplace Model that Includes the Effects of Informal 
Organization on Job Rewards 


are seen as influencing positional characteristics such as the level of technical 
change in the job, whether it is part time, the number of hours worked at home, 
the number of hours worked at other jobs, the substantive complexity, objective 
skill requirement (from the Dictionary of Occupational Titles), organizational 
tenure, union membership, and formal position (using Wright's categories in 
Appendix 1). All of these are modeled as determinants of position in the informal 
organization as measured by the consent deal, and the results are reported in 
Table 2. The effects of formal position in the organization (Wright's scale) are 
reported both for the individual strata and for all levels as a whole using a sheaf 
coefficient (Heise 1972). 

Position in the informal organization appears to be primarily determined 
by job characteristics, though with an R? of only .29, the largest portion of the 
variance remains unexplained by structural factors in the model. The strongest 
effect comes from position in the formal organizational structure, followed by 
substantive complexity, job skill, and union membership. Other determinants 
such as being male, education, not working for government, and working in 
a competitive industry appear to be mediated through formal position as they 
drop out when formal structure is entered in the model, suggesting a somewhat 
meritocratic approach to informal stratification once formal stratification is 
controlled. 

In the extended model, informal position is included among the determinants 
for three common work outcomes that are prominent in the literature: wages, job 
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Table 2. OLS Regressions of Informal Position on Individual, 
Organizational, Job Characteristics and Formal Structural 


Position 
Full Models Individual Organization Job Formal Position 
Individual characteristics 
Gender (female=1) -.09* -.09 -,09* -.06 
Race (white=1) .03 .02 .00 .00 
Age (by category) .04 .06 .00 .01 
Education (years) NL 319728 -.02 .00 
Marital status (married=1) .03 .04 -.02 -.02 


Organizational characteristics 


Employer size (by category) -.02 .00 .00 
Organization Scope (local-global) -.04 -.02 -.02 
Not-for-profit (=1) .04 .02 .01 
Government (=1) “12 -.11* -.07 
Industry concentration (by sales) = 1225 -.10* -.07 


Position and job characteristics 


Technological change .03 .02 
Part-time work .03 .04 
Hours at other jobs -.03 -.02 
Hours worked at home .07 .03 
Tenure with organization .05 .02 
Substantive complexity 26." ee 
Occupational skill (a) moo” gites 
Union Membership oe -.10* 


Formal position: Wright’s class 


Class: Large Employer (= 1) eee 

Class: Large Manager (= 1) .14** 

Class: Small Manager (= 1) A5 

Class: First Line Supervisor (= 1) .10* 

Class: Autonomous Worker (= 1) -.10 
Class Sheaf Coefficient ur 
R2 .04 .08 29 29 
N = 537 





Note: Coefficients are standardized. 


(a) From Dictionary of Occupational Titles (U.S. Department of Labor). 
(b) The reference category for class is non-autonomous workers. 


*p<.05 ** p<.01 (two tailed test) 





1554 / Social Forces 83:4, June 2005 


Table 3. OLS Regressions of Work Identity, Job Satisfaction and Wages on 
Individual, Organizational and Position Characteristics 


La DM S mie E ER EDEN E 
Full Models Work Identity Job Satisfaction Wages 
T Ee uu em ra ae S ee e Er EE ass: 


Individual characteristics 


Gender (female=1) .01 .07 202 
Race (white=1) .07 S202 .02 
Age (by category) -.03 .05 .07* 
Education (years) -.06 -.01 AS 
Marital status (married=1) -.01 -.03 .05 


Organizational characteristics 


Employer size (by category) .02 .01 .02 
Organization Scope (local-global) .01 -.01 OF" 
Not-for-profit (=1) BIO -.03 -.02 
Government (=1) .07 -105 s, 1025 
Industry concentration (by sales) -.07 .02 .06* 


Position and job characteristics 


Technological change .07 .01 .01 
Part-time work .02 -.04 -134 
Hours at other jobs .07 .02 -.03 
Hours worked at home .00 -.07 .07* 
Tenure with organization .03 -.04 J9* 
Substantive complexity qas 2255 .08* 
Occupational skill (a) .02 -.03 2855 
Union Membership .02 .01 JS 


Formal position: Wright's class 


Class: Large Employer (= 1) -.02 -.0 A8 

Class: Large Manager (= 1) .01 -.10* .06 

Class: Small Manager (= 1) .01 241323 .04 

Class: First Line Supervisor (= 1) -.02 -15** .01 

Class: Autonomous Worker (= 1) .08 -.02 -.06 
Class Sheaf Coefficient .08 du Ss 2035 
Informal position 

Consent 247 5925 .09** 
R Ed .40 .60 
N 997 537 512 





Note: Coefficients are standardized. 


(a) From Dictionary of Occupational Titles (U.S. Department of Labor). 
(b) The reference category for class is non-autonomous workers. 


*p<.05 **p<.01 (two-tailed test) 
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satisfaction and worker identity (often referred to as work commitment). These 
results are reported in Table 3, and demonstrate that informal position exerts a 
strong effect on the job rewards tested in the model. 

Position in the informal organization is the strongest factor influencing both 
the respondent's satisfaction with the job and subjective sense of identity as a 
worker. The results regarding job satisfaction are interesting in that informal 
position dwarfs the other significant factors — substantive complexity, formal 
position, being white, and working for government — a finding that suggests 
that in an economy transitioning from production to service and office work, 
job satisfaction in large organizations is derived more from informal workplace 
relationships then being intrinsic to the type of work performed. Another 
interesting result is that the substantive complexity, a subjective measure of 
the emphasis placed on skill, is significant while the objective measure of skill 
based on the Dictionary of Occupational Titles is not. Finally, while the sheaf 
coefficient measures the magnitude of the effect of formal position, the direction 
is obvious based on the negative results of supervisors and managers relative 
to nonautonomous workers, indicating that less autonomy (or perhaps the 
responsibility that accompanies it) leads to satisfaction. The results for worker 
identity demonstrate that there are relatively few factors beyond informal position 
in this post-industrial era that make identity as a worker salient, only substantive 
complexity and working for a non-profit corporation. 

The model for wages is particularly interesting. Many factors in the model are 
significant predictors of wages, with the strongest effects coming from being male, 
objective skill requirements, formal position, union membership, and education. 
However, the results show that the net effect of informal organization is almost 
half the effect of formal organization. It has the same effect as organizational 
tenure, and has more effect than substantive complexity, the differential for age, 
working in an organization that is broader in scope, working in a position which 
requires hours at home, and working in a more highly concentrated industry. 
Again, even with the controls for objective position and job characteristics, 
this effect on one of the most visible and objective work outcomes can only be 
interpreted as a very real manifestation of the informal and intersubjective side 
of the organization. 


Conclusion 


Consent, measured by the exchange of autonomy, voice and schedule flexibility 
by employers for organizational commitment by workers, creates an informal 
workplace stratification that mirrors traditional stratifications based on structural 
factors. As a product of the intersubjective world of the workplace, consent offers 
a finer resolution on the effects of the types of experiences that affect workplace 
attitudes and behaviors than traditional objective structural measures. Created 
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as a scale from survey data, it offers quantitative analysts an ability to study this 
intersubjective world across workplaces in ways that complement the findings of 
ethnographers and network analysts. This measure offers a large step in fulfilling 
Gouldner's call for reconciling the rational and natural systems models into a 
more powerful synthesis. 

While the ethnographic data support the content validity of the scale and the 
Heise-Bohrnstedt validity statistic demonstrates that the additive scale adequately 
measures the underlying factor, determinations of construct validity can only be 
established through a process described by Bollen as testing whether the measure 
“relates to other observed variables in a way consistent with theoretically derived 
predictions" (1989:189). This study begins that process by demonstrating that the 
distribution of informal periphery and clan among occupations and by formal 
position complements, but does not duplicate formal structures. The workplace 
models continue it by demonstrating that informal organization exerts a dominant 
influence on subjective outcomes such as worker identity and job satisfaction, but 
also has an important influence on objective outcomes such as wages. 

The proposed measure of consent and the connection between consent and 
informal organization is latent but not obvious in the workplace literature. 
Perhaps one of the virtues of this analysis is that it adds no new measures to the 
already long list of variables available for workplace studies. While it is no great 
news that some function of autonomy and commitment is highly correlated with 
much of what sociologists find interesting in the workplace, this repackaging 
of variables is an important reconceptualization of the workplace. It offers a 
reduction and clarification of existing models, and an opportunity to revisit 
existing data sets as well as develop additional new data to expand the study of 
consent and informal organization. 

Subsequent analysis using consent should focus on other attitudes, perceptions, 
behaviors and outcomes. Models that include informal organization could 
shed light on how the intermix of subjective and structural factors influence 
perceptions of workplace relations, discrimination, promotion and pay equity, 
as well as broader perceptions such as the meaningfulness of the job and the 
rights of workers and management. Ultimately, it is hoped that this scale will 
round out workplace models, and facilitate the study of how workplace relations 
transcend organizational boundaries and affect worker attitudes on social factors 
not directly related to the workplace. 

This study demonstrates the power of mixed methods research. Additional 
mixed methods research should identify other informal dynamics that have clearly 
observable effects within the overall workplace. Ethnographic observations of the 
workplace should include a search for measures that can be used in surveys and 
a discussion of the dynamics that can guide the construction of models by their 
more quantitative colleagues. 
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Notes 


1. This study was conducted in accordance with procedures approved by the Bloomington 
Campus Committee for the Protection of Human Subjects, Bryan Hall, Room 10a, 
Bloomington, IN 47405-1219 under protocol 497-1509. A more detailed report of results is 
still being compiled. Family Finance Corporation is a pseudonym. 


2. Discussion offered in this study revolves around my observations at FFC, but is augmented 
by years of observations during my prior career as a technician and manager of data 
processing systems. 


3. I note again that my conflicting commitments to research and to FFC became a 
consideration in my getting laid off. 


4. The clan informants who related this story to me were shocked by it, treating it as an 
example of despicable heresy. 


5. Hackman and Lawler (1971) found high correlations between employee, supervisor and 
researcher ratings for job characteristics such as variety, autonomy and task identity. This has 
supported the use of worker perceptions as an approximation for objective measures. 


6. These are computed using the communality and factor scores from SPSS's Principle Axis 
Factoring extraction method. Because Cronbach's a is known to be a lower limit, Heise and 
Bohrnstedt offer a more generalized reliability measure. 
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Abstract 


The sociological study of the mental health of racial-ethnic minorities depends on 
the measurement quality of the instruments used to evaluate mental health. A 
commonly used instrument in research on mental health disparities, the Center 
for Epidemiologic Studies Depression Scale (CES-D), has not been thoroughly 
validated for use in the multiethnic and foreign-born populations currently living 
in the U.S. Using data from the National Longitudinal Study of Adolescent Health, 
this analysis provides the first multiethnic evaluation and psychometric analysis 
of the CES-D by acculturation level among youth ages 12—20. Correcting for the 
measurement problems contained in the CES-D improves the ability to detect 
differences in depression across ethnocultural groups, and to identify relationships 
between depression and other outcomes. 
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In 1998, President Bill Clinton's initiative to eliminate racial and ethnic disparities 
in health, including mental health, became a national priority (Office of Minority 
Health 2002; U.S. Department of Health and Human Services 1998). Census 2000 
data, showing rapid growth in our foreign-born and non-white populations, make 
this initiative all the more imperative. Hispanic and Asian populations are the 
fastest-growing segments of American society (U.S. Census Bureau 2003). And 
by 2050, non-Hispanic whites will no longer be the majority population in the 
U.S. (U.S. Census Bureau 2000). 

To evaluate health disparities in an increasingly multiethnic population, social 
and health scientists require measurement instruments that provide culturally 
equivalent measures of health outcomes. Without such instruments, cross-ethnic 
comparisons of physical and mental health are problematic (Finch et al. 2002; 
McGee et al. 1999; Roberts 1990). 

Several health measurement instruments used in current U.S. community- 
based surveys were originally designed to measure health outcomes in a primarily 
European-American population with native English speaking abilities. These 
instruments have not been thoroughly validated for use in the multiethnic and 
foreign-born populations currently living in the U.S. (Flores 2002). 

With respect to mental health outcomes, the Center for Epidemiological 
Studies Depression Scale (CES-D) is one of these key measurement instruments. 
Developed in 1976 for use in general adult population (aged 18 or older), the 
standard CES-D is a 20-item self-report scale that measures depressive symptoms 
(Radloff 1977). Table 1 lists the items used in the CES-D. Although it may be 
used to identify populations at risk of developing clinical depression or anxiety 
disorders, it is not intended as a clinical diagnostic tool. Since its introduction, 
the CES-D has been used to assess depression risks in several populations 
(e.g., adolescent, elderly, ethnic and clinical populations) for whom it was not 
originally designed. Some of the national surveys that have used short (1.e., 4 to 
10 questions) and long forms (i.e., 20 questions) of the CES-D to measure the 
risk of depression include: the National Longitudinal Study of Adolescent Health 
(Add Health), the Health and Retirement Survey (HRS), U.S. National Alcohol 
Survey, the National Survey of Families and Households (NSFH), General Social 
Survey (GSS), the National Longitudinal Survey of Youth (NLSY), the National 
and Hispanic Health and Nutrition Examination Surveys (N-HANES, H-HANES), 
and The Commonwealth Fund Survey of the Health of Adolescent Girls. 

We aim to evaluate the validity of the CES-D as a tool for making cross- 
cultural comparisons regarding mental health in an adolescent population ages 
12-18. In this analysis, we focus on evaluating the measurement equivalence of 
the CES-D across four racial-ethnic groups (white, Asian, Hispanic and black). 
Within each racial-ethnic group, we also evaluate differences by immigrant 
generation (1st, 2nd and 3rd+). Through this systematic analysis of the CES-D 
we are able to identify a subset of five CES-D items that allow for comparisons 
in depression risk across the 12 ethnocultural groups in this sample. 
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Table 1. Center for Epidemiological Studies-Depression (CES-D) 
Scale Questions 


How often was each of the following things true during the past week? 
Answers range from 0 (never or rarely) to 3 (most of the time or all of the time). 


Negative Affect Items: 
(1) You felt depressed. 


(2) You thought your life had been a failure. 

(3) You felt sad. 

(4) You felt that you could not shake off the blues, even with help from your family and 
your friends. 

(5) You were bothered by things that usually don't bother you. 

(*) You felt life was not worth living. 


Postive Affect Items: 
(6) You felt hopeful about the future. 
(7) You were happy. 
(8) You enjoyed life. 
(9) You felt that you were just as good as other people. 


Somatic Complaint Items: 
(10) You didn't feel like eating, your appetite was poor. 


(11) You had trouble keeping your mind on what you were doing. 
(12) You felt that you were too tired to do things. 

(13) You felt fearful. 

(14) It was hard to get started doing things. 

(15) You felt lonely. 

(16) You talked less than usual. 

(17) I had crying spells. 

(18) My sleep was restless. 


Interpersonal Relations Items: 
(19) People were unfriendly to you. 


(20) You felt that people disliked you. 


* The item “life” is an addition to the CES-D first made by Garrison (1991). The items “crying” and 
“sleep” in numbers 17 and 18 were excluded from the CES-D contained in the Add Health survey. 


To our knowledge, this is the first multiethnic evaluation of the CES-D for 
use with adolescents of various immigrant generations. Previous analyses have 
been limited to one or two ethnic groups including Japanese (Kikuzawa 2001), 
Chinese (Kuo 1976; Ying 1988), Koreans (Noh et al. 1998), American Indians 
(Chapleski 1997; Curyto et al. 1998;), Russians (Dershem et al. 1996), and Mexican 
Americans, Puerto Ricans and Cuban Americans (Guarnaccia et al. 1989; Morton 
et al. 1989). Few analyses have evaluated the psychometric quality of the CES-D 
across acculturation levels or immigrant generations within an ethnicity (Roberts, 
Roberts and Chen 1997). 
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By doing this analysis of the CES-D, we illustrate three methodological issues 
that should concern all social scientists developing scales for use in making 
cross-cultural comparisons. First, we demonstrate the importance of conducting 
multiple-group comparisons on the measurement structure of scales. Second, we 
illustrate the importance of creating scales from effect indicators (i.e., indicators 
that are determined by the latent variable or factor that they are presumed to be 
measuring). Third, we show how a failure to design cross-culturally equivalent 
measures and to model the measurement structure in analyses can lead to biased 
and inefficient estimates in statistical models. 

In the first section of this article, we discuss the use of the CES-D in the 
sociology of mental health and illness and review the psychometric literature 
on the CES-D. We then review measurement theory and the requirements for a 
cross-culturally valid measure of the risk for depression. In the second section, 
we provide information on the data, measures and analytic techniques used in 
this analysis. The third section summarizes our results, and the fourth section 
concludes. 


Background 


THE SOCIOLOGY OF MENTAL HEALTH AND ILLNESS 


In viewing emotional life as molded by social structures, Marx, Weber, Durkheim 
and Simmel began a long history of research on psychological distress (Smith- 
Lovin 1995). Since its development in 1977, the CES-D has become one of the 
most popular instruments for identifying psychological distress in community- 
based samples (Link 2002; Schwartz 2002; Vega and Rumbaut 1991). Sociologists 
have used it, in regards to socio-economic conditions, to identify risk and 
protective factors for depression as well as the short- and long-term consequences 
of mental illness for individuals and their families (Cockerham 1989; Mirowsky 
and Ross 1989). 

Much of the research using the CES-D has been conducted on elderly 
populations, as depressive symptoms begin to increase in midlife and the elderly 
tend to have more risk factors that are associated with depressive symptoms 
(Akiyama and Antonucci 2002; Ranzijn and Luszcz 2000; Schieman et al. 
2001). Risk factors for the elderly, for example, include impairment in physical 
functioning, disability, poor self-rated health, low education, lack of social 
support, retired status and widowhood or living alone (Hughes and Waite 2002; 
Hybels et al. 2001; Schieman et al. 2001). Among adults in general, risk factors 
for depression as measured by the CES-D include homelessness, poverty and 
material hardship, homosexuality, work among single mothers, transitions into 
a care-giving role, divorce and female gender (LaGory et al. 2001; Lewis et al. 
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2001; Marks et al. 2002; Petterson and Friel 2001; Sweeney and Horwitz 2001; 
Wilk 2001). Protective factors are represented commonly by the flipside of risk 
factors, with social support, network stability, volunteer work and marriage 
reducing symptoms of depression for adults (Bost et al. 2002; Simon 2002; Thoits 
and Hewitt 2001). Research that has focused on particular ethnic groups in the 
U.S. reports social support to be one of the most important protective factors 
and discrimination and acculturative stress important risk factors for depressive 
symptoms as measured by the CES-D (Finch et al. 2000; Vega, Kolody and Valle 
1986). 

The CES-D has also been used in numerous studies of risk and protective 
factors associated with adolescent or young adult emotional health, with a handful 
focusing on particular race or ethnic groups. Protective factors for adolescents 
that have been identified include the family context and in particular, parent- 
child relationships and parental support, while family risk factors include parental 
separation and divorce (Harker 2001; Videon 2002). Other family life stressors 
associated with depressive symptoms in young people include poverty and low 
socioeconomic status, single-parent household structure, and other negative life 
events (Gore et al. 1992; Harker 2001; Jackson and Finney 2002; Steinhausen 
and Metzke 2000). Involvement in certain risk behaviors has also been found to 
increase the risks of depression in youth, such as violence, especially in intimate 
adolescent relationships (Hagan and Foster 2001). Research on adolescents in 
specific ethnic groups finds that cultural orientation towards one's own ethnic 
group accompanied by low acculturation into U.S. society is associated with 
depressive symptoms among Asian American youth (Wong 2001). 

Much less research has been done on the consequences of depressive 
symptoms as measured by the CES-D, and almost all this research examines 
adult populations. For example, depression has been found to be associated with 
greater alcohol consumption in the short-term, and the combination of alcohol 
use and depression increases the likelihood of disease and mortality in the long- 
run (Greenfield et al. 2002). Similarly, depressive symptoms lower individuals' 
self-reports of quality of life items, such as satisfaction with family life, financial 
security and recreation (Michalos et al. 2000). Among adolescents, one study 
found higher levels of emotional distress to be associated with increased sexual 
risk behaviors and use of alcohol and drugs (Murphy et al. 2001). 

In sum, the sociological and broader social science literature has relied 
extensively on the CES-D as a reliable measure of depressive symptoms, both in 
the adult and adolescent populations and in multiple race and ethnic groups. In 
addition, various studies use the CES-D to make racial and ethnic comparisons 
in the protective and risk factors associated with depressive symptoms, as well as 
in the effects of depressive symptoms on outcomes. The starting point for such 
research, however, is establishing the different levels of depressive symptoms 
according to race and ethnicity. 
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In their review of the literature on the mental health of racial-ethnic minority 
groups in the United States, Vega and Rumbaut (1991) find few consistent 
patterns. Our own review, focusing on ethnic youth, also finds few consistent 
patterns. Comparisons between African Americans and their European American 
counterparts find a decreased likelihood of mental health disorders or depressive 
symptoms relative to European Americans, no difference, and an increased 
likelihood of mental health disorders and symptoms (Office of the Surgeon 
General 2001, Siegel 1998). Similarly, while some studies of Hispanic and Asian 
Americans find that they are more likely to report depressive symptoms than their 
European American peers (Portes and Rumbaut 2001; Roberts and Sobhan 1992; 
Tienda and Kleykamp 2000; Ying 1988), others find quite the opposite (Office 
of the Surgeon General 2001). 

Although some of these ethnic differences in mental health may be explained 
by differences in the gender, age and socio-economic composition of each ethnic 
group (Office of the Surgeon General 2001), Vega and Rumbaut (1991) emphasize 
the importance of controlling for acculturation levels within racial-ethnic groups. 
Studies of immigrant youth have demonstrated that higher levels of acculturation 
to the U.S. are positively related to psychiatric disorder (Burnam et al. 1987; Portes 
and Rumbaut 1996; Vega et al. 1998), depressive symptoms (Harker 2001; Harris 
1999), and psychological distress (Kaplan and Marks 1990). 


Psychometric Evaluations of the CES-D 


Given the lack of consistent findings regarding differences in depression between 
ethnic youth, it is plausible that the scales used for detecting depression are 
measuring something different within groups being compared. Depression scores 
in various ethnocultural groups may be biased by response patterns that differ 
between immigrant and ethnic groups, not because a group actually has more/less 
disorders or symptoms, but because the group expresses psychopathology in a 
way not captured by measures developed from a Western conceptualization of 
depression (Vega and Rumbaut 1991). Previous studies suggest that Latinos and 
African Americans somatize their emotional/mental health into physical health 
constructs (Angel and Guarnaccia 1989; Angel and Worobe 1989; Finch 2002; 
Guarnaccia and Roberts 1990). Asians avoid expressing positive affect. They do 
not distinguish between emotional and physical well-being and identify a strong 
interpersonal component to depression (Kim 2002; Marsella et al. 1975; Noh 1998; 
Ying 1988). And some American Indian cultures have no concept of depression 
and instead focus on discouragement or demoralization (Roberts 1990). 

In addition, recent research suggests that language significantly affects 
psychopathology in ethnic populations. Feelings reported in a native language 
may be expressed with more precision and emotion than those expressed in a 
second language (Cuellar and Roberts 1984; Roberts 1989). In part due to this 
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risk, the CES-D was originally not recommended for use in bilingual populations 
(Radloff 1977; Trieman 1975). | 

Lastly, unfamiliarity with questionnaires, social desirability, extreme 
responding and acquiescence can produce artificial cross-ethnic differences 
(Van de Vijver and Leung 2001). Researchers have found that Hispanics tend to 
show more extreme responses than non-Hispanic whites, and that this tendency 
decreases with time in the U.S. (Hui and Tiandis 1989; Marin et al. 1992; Ross 
and Mirkowsky 1984); younger and better educated respondents of a variety of 
ethnic backgrounds use more midpoint responses (Stening and Everett 1984); 
and respondents who are older, Mexican or of a lower socio-economic status are 
more likely to provide affirmative answers to survey questions (Mirowsky and 
Ross 1984; Ross and Mirowsky 1984). 


Measurement Theory and the Use of the CES-D 


Use of the CES-D to measure mental health outcomes for different racial, ethnic 
and immigrant generation groups (i.e., 1st generation, 2nd generation and 
3rd-- generations) requires measurement equivalence (also called, invariance) 
in the CES-D across these groups. There are two main levels of measurement 
equivalence: equivalence in model form and equivalence in model parameters 
(for more detailed expositions of these issues, see Bollen 1989). 

At the most basic level, a scale must have the same structural form within 
groups before meaningful group comparisons can be made. Comparisons of 
scale values across groups assume that scale indicators correspond to the same 
basic underlying concepts (i.e. factors or latent variables) in each group being 
compared. If these underlying concepts are equivalent across groups, then the 
factor structure is equivalent across groups. If the dimensionality of the scale 
(i.e. number of factors) differs across groups or if the patterns of fixed, free and 
constrained parameters differ between groups, then the assumption of equivalence 
in structural form is violated. 

Figure 1 illustrates the four most commonly identified structural forms for 
the CES-D: a single-factor model, a four-factor model, and two different three- 
factor models with correlations between each factor. If the CES-D is structurally 
equivalent across groups, then we would expect to confirm the existence of the 
same form in each group. In other words, one of these diagrams should hold 
for all groups. 

While structural equivalence is necessary for meaningful group comparisons, 
it is not sufficient. The next level of measurement equivalence requires that 
parameter estimates in the structural model be equivalent. These parameter 
estimates include factor loadings, factor co-variances and variances, factor means 
and intercepts, and error variances and co-variances. The equivalence of each of 
these sets of parameter estimates can be tested separately. It is up to the researcher 
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Figure 1. Four Alternative Factor Patterns for the 19-item CES-D 
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to decide which parameter estimates to test for equality across groups and in 
what hierarchical order. Ultimately, the similarity in parameter values, like the 
similarity in structural form, is a matter of degree. 

When both the assumptions of structural and parameter equivalence are 
violated, the scale in question may be measuring different constructs (e.g., 
discrimination or self-esteem vs. depression) in each group; it is inappropriate 
to compare group means on the scale, and common statistical procedures such 
as regressions are no longer meaningful. 

When scales are found to lack measurement equivalence across groups, 
researchers have two options. First, they can disaggregate the construct being 
measured into more discrete and equivalent sets of underlying concepts. Second, 
they can include a measurement model in their estimations. By explicitly modeling 
the measurement structure and allowing it to vary between groups, researchers 
can obtain unbiased and more efficient estimates of the primary relationships 
of interest. In this analysis, we use both approaches. 

To help us disaggregate the CES-D and identify a cross-cultural equivalent 
measure of the risk for depression in each group, we focus on the fact that the 19- 
item CES-D mixes effect indicators, cause indicators, and outcomes (Bollen and 
Lennox 1991). Techniques for the evaluation of multi-item scales (e.g., reliability 
estimates) rest on the assumption that all scale items are effect indicators. Effect 
indicators are determined by the latent variable or factor that they are presumed 
to be measuring and should be positively correlated. Causal indicators, in contrast, 
are indicators that determine the latent variable and may be either positively or 
negatively correlated with each other. Outcomes are not indicators at all but are 
consequences of the latent variable. 

Figure 2 shows a hypothetical model of the 19-item CES-D with five effect 
indicators (depressed, life, happy, sad and blues), 11 cause indicators, and three 
outcomes (appetite, focus and start). As discussed by Bollen and Lennox (1991), 
“I felt depressed” and “I felt sad” are clearly effect indicators because we expect 
change in the latent depression variable to lead to a change in the response to these 
items. However, ^I felt lonely" could be a cause indicator because loneliness may 
cause depression rather than vice versa. Similarly, feeling that people *disliked" 
you or were "unfriendly" could lead to depression. Appetite, focus and start are 
hypothesized to be outcomes because persons with depression often exhibit poor 
appetite, a reduced ability to concentrate or focus on what they are doing, and 
difficulty starting new projects. However, these outcomes may be associated with 
various other personality traits and/or physical and mental health problems. They 
are not exclusively indicators of depression. 

To illustrate the consequence of correctly modeling the measurement structure 
of the CES-D, we evaluate probit estimates of the effects of depression on suicidal 
ideation using the traditional CES-D scale; an all-effect indicator version of the 
scale; and a structural equation model that incorporates the measurement model. 
Results from this analysis are presented in the final section of this paper. 
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Figure 2. Hypothetical Model of Effect Indicators, Cause Indicators, and 
Outcomes in the and Outcomes in the CES-D scale 


Through these probit analyses, we show that treating variables as effect 
indicators when they are actually causal indicators leads to model specification 
error, which can bias parameter estimates and lead to incorrect assessments of 
the relationships between variables (Bollen and Ting 2000). We can best eliminate 
this bias by focusing only on effect indicators in the measurement of the CES-D 
and incorporating the measurement model into our estimation. 


Methods 


DATA 


The National Longitudinal Study of Adolescent Health (Add Health) is a 
nationally representative study of adolescents in grades 7 through 12 in the United 
States 1995. The study was designed to explain the causes of adolescent health and 
health behavior, focusing on the multiple contexts in which young people live. 
Add Health used a multistage, stratified, school-based, cluster sampling design. 
Included in the sample were students from 80 high schools (both public and 
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private) and a corresponding feeder junior high or middle school. While some 
minority ethnic groups were sampled in proportion to their size within the U.S. 
population, smaller ethnic groups, including Chinese, Puerto Rican and Cuban 
youth, were over sampled. This aspect of the Add Health makes possible the 
analyses we conduct by generation within racial-ethnic group (Harris 1999). 

Add Health involves three waves of data collection and several data collection 
components. For the majority of this analysis, we use data from the in-home 
interviews conducted during the first wave of data collection. (For details on 
the Add Health design, see Bearman, Jones, and Udry 1997.) Interviews were 
completed with 20,745 youths at Wave I and 14,738 youths at Wave II. Due to 
missing data on sampling weights we excluded 1,782 youths from the analysis. 
An additional 59 youths were excluded because data on gender, age, ethnicity 
or immigrant generation were missing; or their primary ethnic identity was 
American Indian/Native American (N — 331). Comparisons across immigrant 
generations were not appropriate for the American Indian/Native American sub- 
sample. Finally, after an analysis of non-response patterns to the 19-item CES-D 
measure contained in the Add Health survey, we excluded 113 observations for 
which data were missing on at least one of the CES-D items.' The final analytic 
sample included 18,460 youths ages 12-21.’ 


MEASURES 


The primary variables of interest in this analysis are the 19-items of the CES-D 
measured in Add Health. Each item has a possible value ranging from 0 to 3 
(0 = rarely or none of the time, 1 = some of the time, 2 = a lot of the time, and 
3 = most or all of the time). As shown in Table 1, the items factor along four 
dimensions: negative affect, positive affect, somatic complaints and interpersonal 
relations (Radloff 1977). While some researchers have recommended scoring each 
dimension of the CES-D separately, most sum across all the items (positively 
worded items are reverse scored) to create a composite score ranging from 0 to 
60 (57 in our case). Higher scores indicate more prevalent depressive symptoms 
and a correspondingly greater risk of depression. Scores of 24 in females and 22 
in males are thought to represent clinical (DSM-defined) cases of depression in 
adolescents (Garrison 1991; Roberts et al. 1991). In clinical samples, these scores 
maximize the sensitivity (i.e., true positive rate) and specificity (i.e., true negative 
rate) of the CES-D instrument. (For a good overview of the properties of the 
CES-D and other widely-used symptom checklists, see Link and Dohrenwend 
1980.) 

The CES-D instrument used in Add Health was missing two items: I had 
crying spells and my sleep was restless? 'Two items were worded differently than 
in the original CES-D: I felt that everything I did was an effort and I could not 
get going. One item was an addition to the original CES-D: I felt that life was not 
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Table 2. Sample Size by Race-Ethnicity and Immigrant Generation 
(a percentage of total sample)* 


Immigrant Generation 


Race-Ethnicity 3rd 2nd Ist Total 
White 9,332 527 65 9,924 (54%) 
Black 3,731 205 i. 4,013 (22%) 
Asian 174 569 579 1,322 (7%) 
Hispanic 1,013 1,418 770 3,201 (17%) 
Total 14,250 (77%) 2,719 (15%) 1,491 (896) 18,460 


è unweighted 


worth living. The two changes in wording were in keeping with the development 
of the CES-D for children (Faulstich et al. 1986; Weissman et al. 1980). The 
additional item (I felt that life was not worth living) was first added to the CES- 
D by Garrison and colleagues (1991) as an important component of depression 
in adolescents. 

For this analysis, we evaluate the measurement of the CES-D across 12 
ethnocultural groups constructed by cross-classifying each adolescent's primary 
race-ethnicity identification with his/her immigrant generation. Race-ethnicity 
was defined using the respondent’s self-reported ethnic identity (i.e., Hispanic/ 
Latino) and race (i.e., white, black/African American, Asian/Pacific Islander). 
The final variable included 4 ethnicity-race groups: (1) Hispanic of any race, (2) 
non-Hispanic Asian, (3) non-Hispanic black, and (4) non-Hispanic white. For 
immigrants and the children of immigrants, we validated their self-reported ethnic 
background with country of birth information (Harris 1999). Table 2 presents 
the distribution on this variable. Of the analytic sample, 54% were white, 22% 
black, 17% Hispanic, and 7% Asian. 

Immigrant generation is defined by a three-category variable signifying that 
the adolescent is foreign-born to foreign-born parents (1st generation), U.S.-born 
to foreign-born parents (2nd generation) and U.S.-born adolescents to U.S-born 
parents (3rd+ generations or native) (Harris 1999). Adolescents in native-born 
families (3rd+ generations) may have grandparents or great grandparents who 
were immigrants, but because the immigration experience is much farther 
removed from the social context of their childhood and adolescent development, 
this category is considered the native population and the fundamental comparison 
group for immigrant children and the children of immigrants. The majority 
(77%) of the sample was native born (Table 2); and 2nd and Ist first generation 
immigrants comprised 15% and 8% of the sample, respectively. Reflecting general 
demographic trends in the U.S., there were few blacks (N = 77) and whites 
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(N=65) and relatively larger numbers of Hispanics (N = 770) and Asians (N = 
579) within the 1st generation. 

For use in our probit models, we also included measures of age, gender 
(Females — 1) and suicidal ideation. To measure suicidal ideation we combined 
data from Waves I (1995) and II (1996) of Add Health. Respondents were 
considered at risk for suicide in 1996 if they indicated that they had "seriously 
thought about committing suicide" in the 12 months preceding either their Wave 
I or Wave II interview. Due to missing data on suicidal ideation in either Wave 
I or II, our sample size for analyses using this variable was reduced to 18,397 
adolescents. Within this sample, 1896 of the adolescents were at risk for suicide 
in either 1995 or 1996; 4996 were female, and the average age was 16. 


STATISTICAL MODELS 


We conduct confirmatory factor analyses (CFA) and multiple group analyses 
(MGA) using a maximum likelihood structural equation program called Mplus 
Version 2.13 (Muthen and Muthen 2003). Both CFA and MGA analyses require 
the estimation of measurement models that use structural equations to describe 
the relationship between latent and observed variables while allowing for 
measurement error.* In these models, all variables are measured as deviations 
from their means and, by assumption, the measurement errors are uncorrelated. 
To adjust for the complex sample design of the Add Health data, we use weights 
in all analyses and adjust the estimates for clustering at the school-level (Chantala 
and Tabor 1999). 

As with any SEM package, Mplus provides several measures of fit that can 
be used to evaluate the model. These include a x° value, the Comparative Fit 
Index (CFI), and the Root Mean Square Error (RMSEA). (For details on these 
fit statistics, see Bollen 1989:256—289.) We report all three of these statistics. 
The null hypothesis that the model fits is not rejected if the x?-value is less than 
expected given the degrees of freedom in the model, the CFI is approximately 
one, and the RMSEA is approximately zero. At a given level of accuracy, a more 
parsimonious model is preferable. Unlike the chi-square, the CFI and RMSEA 
are sensitive to parsimony. 


ANALYTIC TECHNIQUES 


We begin our analysis by evaluating the dimensionality of the 19-item CES-D 
through a series of nested confirmatory factor models estimated among 3rd+ 
generation whites. Following Riddle (2002), four different factor structures are 
hypothesized (Figure 1).° First, we estimate a single-factor model that represents 
the use of a composite CES-D score, and that assumes depression can be measured 
as a global, uni-dimensional construct. Second, we estimate the four-factor model 
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first identified by Radloff (1977) and repeatedly replicated by others using both 
exploratory and confirmatory factor analyses (Golding and Aneshensel 1989; 
Hertzog et al. 1990; Roberts et al. 1989, 1990; Sheehan et al. 1995). Next we 
estimate two three-factor models that have been identified as superior for some 
populations (Riddle, Blais and Hess 2002). The first two-factor model treats 
the negative and positive affect dimensions as opposite poles of a single affect 
continuum. In the second two-factor solution, the negative affect dimension is 
combined with the somatic complaints dimension. This dimensionality has been 
supported by research in ethnic and non-Western communities (Guarnaccia 
et al. 1989; Iwata and Roberts 1996; Manson et al. 1990; Somervell et al. 1993; 
Ying 1988). 

To make statistical comparisons between a hierarchical pair of estimated 
CFAs, we set the metric for the latent variables by setting their variances equal 
to one. We then estimate an unrestricted CFA in which the correlations between 
latent variables are freely estimated. Finally, we estimate a second CFA in which 
we constrain the correlations between the latent variables to one. We can reject 
the null hypotheses that the latent variables are perfectly correlated if the change 
in X? between the restricted and unrestricted models is significantly large. If we 
reject the null hypothesis, then we can conclude that the factors measure distinct 
dimensions of depression and must be treated separately. By making a series of 
these nested comparisons, we can identify the best fitting factor model for the 
native (3rd+ generations) white population. 

After identifying the best fitting structural form for the measurement model, 
including its dimensionality in the native white population, we use MGA to 
evaluate the invariance of this dimensionality across our 12 ethnocultural groups. 
If we reject the hypothesis of structural invariance, this suggests that, at some 
probability level, each ethnocultural group manifests depression differently. 
Moreover, the CES-D is probably measuring something different within the 
groups being compared and mean values on the CES-D should not be used to 
make cross-group comparisons regarding the risk of depression. 

If we reject the hypothesis of structural invariance, we use regression analysis 
to explore difference between groups in factor loadings, variance and covariance 
patterns, and factor means and intercepts. Using the parameter estimates from the 
MGA for the native white population as the dependent variables, we run three sets 
of regressions on the parameter estimates for each of the 11 other ethnocultural 
groups.’ The first regression set evaluates the equivalence of the factor loadings 
and co-variances between native whites and the other subpopulations. The second 
regression set evaluates the equivalence of the variances and residual variances 
between the native whites and the other subpopulations. The third regression set 
evaluates the equivalence of the means and intercepts between the groups. Thus, 
a total of 33 regressions are estimated and the number of observations in each 
regression is equal to the number of parameter estimates being compared. For 
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each regression, we can conclude that the estimated parameters are equivalent 
if both the intercept in the regression equation is zero and the slope in the 
regression equation is one. 

As discussed earlier, factor analyses assume that all items in a scale are effect 
indicators. Therefore, one reason for a failure to find structural invariance across 
groups may be that a scale mixes effect, outcome and cause indicators. We 
hypothesize that only five of the CES-D items (depression, life, start, happy and 
blues) are effect indicators and that the remaining items in the scale are either 
outcomes of depression or cause indicators (Bollen and Lennox 1991). Using the 
Confirmatory Tetrad Analysis (CTA) designed by Bollen and Ting (2000), we test 
this hypothesis against the alternative hypothesis, assumed in the CFA, that all 
items are effect indicators.? If, based on the Confirmatory Tetrad Analysis, we 
find that the new five-item CES-D is an improvement over the 19-item CES-D, 
we then repeat the previous analyses and evaluate the structural invariance of 
the five-item CES-D. 

If we do not reject the hypothesis that the structural forms are invariant 
across groups, we can move down the measurement hierarchy and formally test 
for equivalence in the parameter estimates of our measurement model. Our 
formal test for equivalence in the parameters include the following: (1) a test 
for the equivalence of factor loadings, (2) a test for the equivalence of factor 
loadings and means, (3) a test for the equivalence of factor loadings, means and 
intercepts, (4) a test for the equivalence of factor loadings, means, intercepts and 
variance-covariances, and (5) a test for the equivalence of factor loadings, means, 
intercepts, variance-covariances and error variances-covariances.'” We conduct 
these equivalence tests using MGA and comparing the difference in fit statistics 
(32, CFI, and RMSEA) of the more constrained model with equal parameters to 
the unconstrained model. For several of our analyses, differences in x?-values are 
significant but the CFI and RMSEA fit statistics suggest that the more constrained 
model fits well. Because of our large sample size, it is not unusual for the x- 
difference to be large despite the fact that the constrained model fits well. This is 
similar to the problem in regression analyses of large samples where researchers 
find that coefficients are statistically different from zero although these differences 
are very small and not particularly meaningful. For this reason, it is more prudent 
to base our findings on the CFI and RMSEAs. 

Lastly, to demonstrate the importance of correcting for measurement 
inequivalence and measurement error when using the CES-D, we estimate 
three probit regressions of suicidal ideation on depression. The first model is 
a standard probit model of suicidal ideation on the 19-item CES-D measure. 
The second model is a standard probit model of suicidal ideation on the five- 
item CES-D measure. The third probit model is estimated through structural 
equations and includes a measurement model of the five-item CES-D. To ensure 
the comparability of the coefficients on depression in these probit models, the 
standard deviations of the depression variable are adjusted to equal each other. 
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Results 


EXPLORATORY ANALYSIS 


Although we question its appropriateness for this scale", with Cronbach's alphas 
ranging from .85 to .89, the 19-item CES-D appears to be highly reliable within 
each of our 12 ethnocultural groups. Moreover, these reliability estimates are 
consistent with Cronbach’s alpha measured in other samples (Doerfler et al. 1988; 
Roberts et al. 1990; Schoenbach et al. 1982; Tolor and Murphy 1985). Similarly, 
mean CES-D scores and case rates (i.e., the percentage of the sample identified with 
DSM-defined depression) are comparable to the means and rates found in other 
studies of adolescent youth (Radloff 1991; Roberts et al. 1990; Roberts, Roberts 
and Chen 1997; Vega and Rumbaut 1991). The mean CES-D score in our sample 
is 11, and 7.7% of the sample is at risk of clinical depression. A comparison of 
means across our 12 ethnocultural groups suggests that 3rd+ generation blacks; Ist 
generation Asians; and Ist, 2nd, and 3rd+ generation Hispanics are at significantly 
(p < .05) greater risk for depression than their white counterparts." Clinical cases 
of depression were lowest (4-596) in 1st and 2nd generation blacks and highest 
(1296) in 2nd generation Hispanics. However, as discussed above, the Cronbach's 
alpha, means, and case statistics are not appropriately measured unless we find 
that the CES-D meets the criteria for invariance in structural form and parameters 
estimated for the measurement model. 


Factor Structure Among 3'4+ Generation Whites 


The first panel of Table 3 shows the fit indices for the four hypothesized 
measurement models (i.e., the single-factor model, the three-factor model 
combining the somatic and negative affect dimensions, the three-factor model 
combining the negative and positive affect dimensions, and the four-factor model). 
The CFI value of .91 for the four-factor model is just above the conventional cutoff 
value of .90 (Bentler and Bonet 1980) and the estimated correlations between 
the four latent variables ranged between .52 and .84. While the two three-factor 
models present an improvement over the single-factor model, the chi-square 
difference test indicates that the four factor model provides a statistically better 
fit than either the combined negative-positive affect three factor model (Ax... , — 
2,420) or the combined somatic complaints-negative affect model (Ax, , — 319). 
Thus, we use the four-factor CFA model as the primary bases for our multiple 
group comparisons. Complete results for the four-factor model are reported in 
the online Technical Appendix on the Social Forces Web site (http://socialforces. 
unc.edu). 
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Table 3. Comparisons of Competing Models 
eem E e IR LENE, AE ETE NS EE 
Model N Chi-sq df CFI RMSEA 


K LL rtr uu, ee ee ee ee oe 


CFAs in 3rd+ Generation Whites 


Single-Factor Model 9,332 7,287 152 .804 .071 
Three-Factor Model 9,332 5,438 149 .855 .062 
Three-Factor Somatization Model 9,332 3,181 149 .917 .047 
Four-Factor Model 9,332 2,616 146 .932 .043 


o e a re ei a i i rg i t 


MGaAs in 12 Ethnocultural Groups 


Four-Factor Model (19-item CES-D) 18,460 9:256 20915752 .908 .053 
Single-Factor Model (5-item CES-D) 18,460 192 60 .992 .038 


0008 00 r 


Confirmatory Tetrad Analysis in 3rd+ Generation Whites 


All-Effect Model 9,332 1,907 119 — — 
Cause-Effect-Outcome Model 9,332 1,643 83 — — 
Cause-Effect Model 9,332 1,267 54 — — 





MGA Equivalence Tests for 5-Item CES-D in 12 Ethnocultural Groups 


(1) Factor Loadings (FL) 18,460 349 104 .985 .039 
(2) FL + Means (M) 18,460 400 115 .982 .040 
(3) FL+ M + Intercepts (I) 18,460 605 159 972 .043 
(4) FL + M +1 + Variances (V) 18,460 629 170 971 .042 
(5) FL -- M 4 I *- V 4 Error Variances 18,460 1,498 225 .920 .061 





Note: The power of the chi-square difference test is larger in big samples than in small ones. 

Given the large size of our sample, it is helpful to examine other measures of fit as well. The CFI and 
RMSEA statistics are less sensitive to sample size and take account of the degrees of freedom in the 
model. Mplus version 2.13, which was used for this analysis, did not allow estimation to simultaneously 
account for design effects and the categorical nature of the CES-D. Therefore, all models were estimated 
with and without clustering and weighting. Our results were robust to the estimation technique. 


MULTIPLE GROUP AND REGRESSION ANALYSIS OF THE 19-ITEM 
CES-D 


The second panel in Table 3 shows the fit statistics for the multiple group 
comparison of the 19-item CES-D. These results reveal that the four-factor 
measurement model does not fit the data well for the 11 other ethnocultural 
groups in our sample. We can reject the hypothesis that the structural form of 
the CES-D is equivalent within each of our ethnocultural groups. Thus, for one 
or more of our ethnocultural groups, one of the alternative measurement models 
pictured in Figure 1 may be more appropriate. 
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As described above, we explore the sources of variation in the measurement 
models between 3rd+ generation whites and each of the 11 other ethnocultural 
groups using regression analyses. These results (not shown) indicate that the 
structural invariance between groups may be driven not only by differences in the 
dimensionality of the CES-D within each group but also by differences between 
groups in the loadings, variance-covariance patterns, factor means and factor 
intercepts. The differences in parameter estimates from the MGA are greatest for 
Hispanics of all generations, 2nd and 3rd+ generation blacks, and 3rd+ generation 
Asians. In contrast, the measurement model identified for the 3rd+ generation 
whites appears to fit 1st and 2nd generation whites well.” 


CONFIRMATORY TETRAD ANALYSIS 


We hypothesize that the reason why the four-factor solution model does not fit 
the non-white groups well is because the current CES-D mixes effect indicators 
of depression, causal indicators of depression and outcomes of depression. Be- 
cause classic measurement theory and factor analyses presume that all indicators 
are effect indicators, the inclusion of causal and outcome indicators in the CES- 
D leads to model specification error, which can bias parameter estimates and 
lead to incorrect assessments of the relationships between variables (Bollen and 
Lennox 1991). Using Confirmatory Tetrad Analysis (Bollen and Ting 2000) with 
the native white subsample14, we tested whether all items in the CES-D were 
effect indicators (all-effect model); whether only the items “blues,” “depressed,” 
“happy; “sad” and “life” are effect indicators and the other items are cause indi- 
cators (cause-effect model); and whether the items “appetite,” “focus” and “start” 
are better treated as outcomes (cause-effect-outcome model).15,16 

Because these models are nested within each other, identifying the best 
fitting model simply requires calculating three chi-square differences. The null 
hypotheses in these tests are that the cause-effect-outcome model is a better fit 
than the all-effect model; the cause-effect model is a better fit than the all-effect 
model; and the cause-effect model is a better fit than the cause-effect-outcome 
model. As reported in the third panel in Table 3, results from the Confirmatory 
Tetrad Analysis indicate that the cause-effect model is the best fitting model. 
Because the inclusion of cause indicators in the CES-D violates classic test theory, 
we can improve the measurement quality of the CES-D by eliminating these 


indicators. This leaves us with a new five-item CES-D. 


MULTIPLE GROUP ANALYSIS OF THE FIVE-ITEM CES-D 


By conducting a new MGA of the five-item CES-D, we evaluated the extent to 
which our shortened CES-D improved the structural invariance among our 12 
ethnocultural groups. As before, the measurement model fitted for the 3rd+ 
generation whites is set as the comparison group. Complete results for this 
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model are provided in the online Technical Appendix.17 All the fit indices from 
the MGA clearly support the hypotheses that the five-item CES-D is structurally 
invariant across the groups (see Table 3, panel 3).18 

Since we do not reject the hypothesis that the structural form of the five-item 
CES-D is invariant across the groups, we move down the measurement hierarchy 
to evaluate the equivalence of parameters. The last panel of Table 3 provides the 
results of these equivalence tests. We fail to detect substantively large differences 
in the parameter estimates (i.e., loadings, covariances, variances, means and 
intercepts) for all indicators considered together.” These results support the 
conclusion that all parameters estimates are equivalent across the groups. In 
contrast to the 19-item CES-D, the five-item index of depressive symptoms can 
be appropriately used to make inter-group comparisons.” 


ADDITIONAL PSYCHOMETRIC PROPERTIES OF THE FIVE-ITEM CES-D 


Since each of the items in our shortened CES-D scale is an effect indicator, we 
can appropriately evaluate the internal consistency of the scale by looking at 
inter-item correlations and standard measures of reliability such as Cronbach's 
alpha. The Cronbach’s alpha for the shortened CES-D is .78 in the full sample. 
The alpha is somewhat lower, for non-white, first-generation immigrant youth 
(.74), and blacks (.73).” 

We can also consider the appropriateness of using the scale items to create 
a linear composite as a substitute for the latent variable in regressions or other 
similar analyses (Bollen and Lennox 1991). One common linear composite is an 
equally weighted sum of the scores on each item. For our short CES-D measure, 
this would produce a score ranging from 0-15 for each respondent. For our 
sample, the mean on this score is 2.5. It varies from 2.0 among Ist generation 
whites to 3.1 among 2nd generation Hispanics. Comparisons of means across 
groups indicate that black 3rd+ generation youth and Hispanic 2nd and 3rd+ 
generation youth are at greater risk of depression than their white counterparts. 
Thus, the pattern of mean differences in the CES-D scores using the five-item 
scales is similar to the pattern identified using the 19-item scale. While the same 
general pattern for differences in means across ethnicity and generation was 
identified using the 19-item and five-item scales, this does not diminish the 
importance of using a structurally invariant measure of the risk of depression or 
any other scale. The lack of structural invariance can have substantial effects on 
mean differences. Moreover, as can be seen in the next section, the measurement 
structure of a scale can influence the relationships between other independent 
and dependent variables in multivariate analyses. 

Using a composite score poses no problem as long as depression is being 
evaluated as the dependent variable. However, using a composite score as an 
independent variable may introduce bias into any estimation unless the squared 
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correlation between the composite score and the latent variable depression is high 
(a correlation of at least .90 is recommended by Bollen 1980). 

To calculate the squared correlation of the index and its latent variable, we 
used the formula for reliability given by Bollen (1980): 


Í D kun 
abii S. 03 P loading ; )“ Var (latent — variable) 
(> factor — loading ; )“ Var (latent — variable) + >’ Var( factor — error; ) 


The reliability of our five-item CES-D is .81. This means that 19% of the 
variance in the five-item CES-D is attributable to measurement error. The 
reliability is not high enough so that we can safely say that a researcher can ignore 
the measurement error. Thus, whenever possible, researchers should follow our 
example below and incorporate a measurement model into their estimations. 


ILLUSTRATIONS OF THE EFFECTS OF BIAS IN THE CES-D 


In Table 4, we present the results of our three probit estimations of suicidal 
ideation. The estimated relationship between suicidal ideation and depression 
change substantially between each model. The 19-item composite has the second 
highest coefficient of the three approaches. This is not surprising given our 
support for the mixed cause-effect model where some of these measures were 
determinants of depression while others were effects of it. As such the indicators 
are associated with depression. Adding indicators together results in an index that 
will correlate with the latent depression variable even if the indicators are not all 
effects of the construct. There are two main problems with this strategy: (1) even 
an index of 19 items is not a replacement for the latent variable, and (2) our earlier 
tests showed that the relation of these 19 indicators to the depression construct 
is not the same across groups. The former problem means that measurement 
error is present in the depression measure and will bias the estimates. The latter 
means that the composite scores are not directly comparable across groups, and 
this too can distort the estimates. 

The estimated relationship between depression and suicidal ideation is almost 
20% lower when we use the five-item CES-D in place of the 19-item CES-D. The 
estimated relationship is over 10% higher when the five-item CES-D is used and 
we adjust for the measurement error in the scale. These results demonstrate a 
sizable upward bias in the estimated relationship between depression and suicidal 
ideation due to the structural inequivalence of the scale across ethnocultural 
groups. In other words, the full CES-D predicts suicidal ideation better than 
the five-item form, but it achieves this because it taps constructs other than 
depression. 

This bias not only affects the results on the depression variable but it also 
affects independent variables included in the model. For example, under the 
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correctly specified model, we find that the blacks are approximately 50% less 
likely to report suicidal ideation, whereas Asians are 100% more likely to report 
suicidal ideation. The effects of being in the Ist generation are reduced 14%; 
the effects of being female are increased over 50%; and the effects of age are 
reduced to near zero. These are sizable and substantive changes. Because of the 
correlation between other independent variables and measurement error in the 
depression index, the use of standard probit estimation techniques also results 
in considerable inefficiency in the estimates of other independent variables. The 
standard errors of the estimates in the SEM model are reduced considerably 
and significant relationships are more easily detected. Clearly, the structural and 
measurement problems with the original CES-D have important consequences 
for the conclusions we make regarding the effects of depression risk and other 
correlated variables on adolescent well-being. 


Discussion 


As the diversity of the U.S. population increases, researchers have sought to 
identify and explain mental health disparities between minority youth and 
the majority population. Based on this research, mental health providers and 
policymakers are designing treatments and interventions to eliminate mental 
health disparities. 

The legitimacy of these research findings and the programs and policies 
developed from them hinges, in part, on the validity of the instruments used 
to measure mental health outcomes. To allow cross-ethnic or cross-cultural 
comparisons, survey instruments designed to measure mental health must 
be evaluated for their psychometric or measurement equivalence within each 
ethnocultural group. 

In this analysis, we have shown that the most commonly used survey 
instrument to measure mental health, the CES-D, is not psychometrically 
equivalent across either race-ethnicity or immigrant generation. As a result, 
research using the CES-D may lead to erroneous conclusions about mental health 
disparities in the U.S. adolescent population. We have designed an abbreviated, 
five-item version of the CES-D that can be used for cross-cultural comparisons 
among adolescents living in the U.S. 

The five items included in our abbreviated CES-D (i.e., blues, depressed, happy, 
sad and life) are all effect indicators that loaded strongly on either the positive 
affect or negative affect dimensions of the full CES-D. Thus, the abbreviated 
CES-D has been stripped of causal indicators, outcomes of depression and items 
related to somatic complaints and interpersonal relationships. Previous research 
has shown that somatic complaint items and items regarding interpersonal 
relationships are more likely to be biased by health status, gender, age and race 
than other items in the CES-D (Cole 2000; Office of the Surgeon General 2002; 
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Table 4. Probit Models for Effects of Depressive Symptoms on Suicidal 








Ideation ` 

Standard Estimation Standard Estimation SEM Estimation 
Variables Betas (s.e.) Betas (s.e.) Betas (s.e.) 
CES-D 19-Item* 540 Colo a) ee = ha = 
CES-D 5-Item* — — .438 (O17) .608 (.008)" 
White — — — — — — 
Black —.302 (.052)" —.241 (.047)_ —.172 (.026)" 
Asian .064 (.084) .095 (.080) T31 (.056)* 
Hispanic —.058 (.058) —.012 (.058) .050 (.031) 
lst Generation —.409 COTS) —.357 (.074)" —.350 (2052008 
2nd Generation —.097 (.061) —.060 (.060) —.051 (.031) 
3rd+ Generation — — — — — — 
Female .204 (.034)" .182 (.034)" .316 (.016)" 
Age —.047 (.009)" —.029 (.008)" —.006 (.004) 
Constant —1.000 (.147)" | —1.345 GBo — — 
Chi-sq (df = 21) = — E = 2,798 — 
F (8, 121) 140 — 130 — -— — 
Observations 18,397 — 18,397 — 18,397 — 





? To allow for comparisons across models, the 19-item and five-item CES-Ds have been transformed 
so that their standard deviations equal .896, the standard deviation of the latent variable for 
depression in the SEM estimation. 


* p-value « .05, ** p-value « .01 





Vega and Rumbaut 1991). Moreover, these items are more likely to be related 
to other constructs (e.g., loneliness, discrimination and physical illness) and 
have less discriminate validity. On a more technical level, the use of causal 
indicators and outcome indicators in the CES-D violates the assumptions of 
classical measurement theory. As a result, standard psychometric statistics such 
as Cronbach’s alpha and factor analysis are not valid for the full CES-D. 
Previous research on the effects of reducing the length of the CES-D scale 
shows that, on average, five-item versions of the CES-D are nearly as sensitive 
and specific as the full 20-item scale (Shrout and Yager 1989). The selection of 
items for these abbreviated scales is typically based on high correlations with the 
total CES-D score, discriminant function analyses or factors analyses that focus 
on items with the highest loadings in each factor (Andresen et al. 1994; Shrout 
and Yager 1989). These selection methods, however, presume that all items are 
effect indicators and that the scale is psychometrically equivalent across groups. 
Although derived through different means, our five-item CES-D is quite similar to 
other abbreviated versions of the CES-D. The items “depressed,” “sad” and “happy” 
are commonly included in most abbreviated versions of the CES-D (Andresen et 
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al. 1994; Kohout et al. 1993; Shrout and Yager 1989).? The item "life" is not often 
included and may sometimes be unavailable to researchers using adult versions 
of the CES-D scale. Fortunately, this poses no problem since the "Life" item can 
be eliminated from our abbreviated CES-D without undermining the invariance 
and reliability of the shortened scale. 

While our proposed five-item CES-D improves the cross-cultural 
comparability of the CES-D, researchers studying the causes and consequences 
of depression within a particular ethnocultural group may prefer using more 
culturally-grounded instruments for depression. This is especially true when 
researchers aim to diagnose clinical depression in minority populations. Through 
our analysis we identify five items that measure aspects of depression common to 
our ethnocultural groups, and that can be included for comparative purposes in 
population-based surveys. However, culturally bounded measures of depression 
may be more appropriate when the researchers do not intend to make cross- 
cultural comparisons. 

Finally, we do not claim that the five-item CES-D developed and evaluated 
here is the best cross-cultural measure of depression risk. Other researchers may 
extend this analysis by evaluating alternative measurement models using other 
items that are effect indicators and that allow for measurement equivalence 
across ethnocultural groups. Others may also extend this analysis by evaluating 
the equivalence of the scale within smaller ethnocultural groups and by gender 
within these groups. We have based this analysis on the pan-ethnic categories 
most commonly used by social scientists and health researchers. An important 
next step is to establish the equivalence of the CES-D within these pan-ethnic 
categories. 

Although focused on the CES-D, this research has called attention to the 
overall importance of evaluating the structural equivalence of multiple-item scales 
before meaningful group comparisons can be made. The common techniques 
used to evaluate the reliability and empirical validity of multiple-item scales are 
not sufficient to establish the appropriateness of scale. As the diversity of the U.S. 
population grows and as researchers increasingly adapt measurement instruments 
for use in multiethnic populations, evaluations of the cross-cultural comparability 
of these instruments must become a priority. 


Notes 


1. The rate of item non-response did not differ across items. However, persons who did 
not respond to at least one item were more likely to be non-white and a member of the 
first-generation. In fact, logit analyses indicated that first generation immigrants were more 
likely than U.S.-born, 3rd+ generation immigrants to have missing data on all CES-D items; 
and 2nd generation immigrants were less likely to have missing data on three CES-D items 
— sad, good and hopeful. Controlling for immigrant generation, only three significant 
race-ethnicity differences in item non-response were found. Hispanics were more likely not 
to respond to the CES-D items failure, unfriendly and disliked. There were no significant 
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differences in item non-response by gender or age. Because item non-response was so minor, 
these differences do not bias our analyses. 


2. The vast majority of adolescents in Add Health are ages 13-19. Adolescents younger than 
13 or older than 19 either entered school early or were held back during high school. 


3. In studies of item-response biases, the “crying” item, the item “talked less,” and two 
interpersonal items, “people unfriendly” and “people disliked me,” have been found to be 
gender-, age- and race-biased (Cole et al. 2000; Stommel et. al. 1993). 


4. Using LISREL notation, the measurement model for observed independent and dependent 
variables can be written respectively as the following: 


x=v.+A&+6 (1) 
y = v L Àm E (2) 


In equations (1) and (2), v (upsilon) is the intercept, A (lambda) is a matrix of factor loadings 
describing the expected change in the observed vector of x- and y-variables for a unit change 
in the exogenous latent variable vector Š (xi) and the endogenous latent variable vector 1 (eta). 
The measurement errors are given by the vectors 6 (delta) and € (epsilon). The covariances 
among these errors constitute ©, and O, respectively. The factor means are equal to K (kappa) 
such that E(E) = K. The structural part of the model is given by the following: 


y=G,+Byn+TE+e (3) 


Regarding the above equation, a is the intercept, T is the expected effects of the endogenous 
latent variable and exogenous latent variables, B describes the relationship among endogenous 
latent variables, and Ç is a vector of measurement errors. The covariance among exogenous 
latent variables is given by . In these models, E(C) — E(e) = E(8): = 0, E(CE) = Elen’) = 
E(8&’) = 0, and E(,&;) = 0 fori #j 


5. Ideally, we would like to account for the fact that each CES-D item is a categorical (not 
continuous) variable. We used Mplus version 2.13. It and the other SEM packages that were 
available at the time of analysis were not able to simultaneously account for both the design 
effects and categorical nature of our observed variables. The use of clustering and weighting 
in SEM is an area of intense methodological research (Satorra and Muthen 1995). For this 
reason, we tested the sensitivity of our results to different estimation approaches. First, we 
estimated models that took into account the categorical nature of the CES-D items in lieu 
of the design effects. Our results were not altered. Second, we estimated all models with and 
without adjustments for clustering and weighting. Again, our results were not altered by 
differences in the estimation. For the interested reader, precise information about estimation 
with clustering and weights in Mplus can be found in Asparouhov (2004). 


6. Additional two-factor and three-factor models were evaluated. In the two-factor models, 
one factor was presumed to be one of the following: negative affect, positive affect, somatic 
complaints or interpersonal relations. All the other items were presumed to load on a single 
second factor. In the other three-factor models, the somatic complaints and interpersonal 
relations factors were combined; the somatic complaints and positive affect factors were 
combined; the negative affect and interpersonal relations factors were combined; and 
the positive affect and interpersonal relations factors were combined. Thus, we actually 
compared all possible single-factor, two-factor, three-factor and four-factor solution models 
by estimating a total of 12 CFAs. 


7. To estimate these models, we must also set the variance of the latent variables to 1. This 
sets the metric of the latent variables. 
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8. Three separate regression sets are required because of the differences in the scales of each 
set of parameter estimates. 


9. Our focus on these five items as effect indicators was driven by previous empirical research 
and discussions with psychologists. This is a quasi-empirical approach designed to reduce 
the number of alternative hypotheses regarding the structure of the scale to be tested. A fully 
empirical approach would require testing every possible combination of effect, outcome 
and cause indicators possible using the nineteen items in the CES-D scale. There are several 


thousand possibilities. Thus, a fully empirical approach would be infeasible. 


10. Using LISREL notation, these hypotheses can be written as the following: (1) A! 2 A? 
=... A¥, (2) K' =K’ =... KY, (3) v =v =...v,2, (4) © = =...0", and (5) 6? 
=02= @ P 

e Hr 


Š 


11. We have reported the Cronbach's alpha because it is so commonly used as a measure of 
reliability for the CES-D and other scales. However, Cronbach's alphas are only appropriate 
if a scale is uni-dimensional with no correlated errors between items, equal factor loadings 
across items and equal error variances. 


12. A bonferroni adjustment for multiple group comparisons was made. 


13. Detailed results from the Multiple Group Analysis and the regression analyses are 
available in an online Technical Appendix (see the Social Forces Web site: http://socialforces. 
unc.edu). 


14. Bollen (1991) derived a test statistic, T, that tests multiple vanishing tetrads simultaneously: 
T = Nt 2} where N is the sample size, t is a vector of the independent sample tetrad differences, 
and ;;! is the inverse of the covariance matrix of the limiting distribution of t as N goes to 
infinity. The T statistic asymptotically approximates a chi-square statistic with degrees of 
freedom equal to the number of vanishing tetrads considered in the test. Therefore, differences 
between nested models can be evaluated as the change in chi-squares between the restricted 
and unrestricted model. 


15. These types of analysis require a significant amount of computer memory, due to 
limitations in the memory available for this analysis, only 17 of the 19 CES-D indicators in 
Add Health were included in the analysis. The two excluded indicators were the following: 
(1) You felt that you were just as good as other people, and (2) You talked less than usual. 
The item "talk less" has been found to be a gender-biased item in previous analyses (Stommel 
et al. 1993). And the item "good" is thought to measure self-esteem. Both items also had the 


lowest factor loadings among the native white adolescent population. 


16. This is a quasi-empirical approach that relies on first hypothesizing a model of effect, 
cause and outcome indicators. Several thousand models are possible. For example, future 
researchers could test whether a different 10-item measure might be an improvement over 
both the 19-item and our five-item measure. However, our discussions with a psychologist 
and our review of the empirical literature suggest that the five items identified are the only 
true effect indicators in the CES-D scale. 


17. Complete results for each of the ethnocultural groups are included in the online Technical 
Appendix, located on the Social Forces Web site: http://socialforces.unc.edu. 


18. The X? statistic is particularly large due to our large sample size. For this reason, we also 
present the CFI and RMSEA. 
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19. As discussed in a previous methodology section, our findings are based on the CFI and 
RMSEA. Because of the large N, differences in chi-squares are statistically significant without 
being analytically meaningful. 


20. Detailed results from these Multiple Group Analyses are available in the online Technical 
Appendix. 


21. Measures of reliability generally increase with the addition of items to a scale and decrease 
as they are removed. This is true even when those items are relatively poor indicators of the 
latent variable and indicators that are subject to measurement variance across ethnocultural 
groups. The mean of inter-item correlations is less sensitive to the number of indicators 
in a scale. For each of the ethnocultural groups in our study, the mean of the inter-item 
correlations for the five-item CES-D is higher than the mean of the inter-item correlations 
for the 19-item CES-D. 


22. This measure of reliability can be calculated using the loadings and variances of the 
loadings provided in any standard output for factor analyses. This measure of reliability is 
more general and less restrictive in its assumptions than the Cronbach's alpha. The Cronbach's 
alpha assumes that factor loadings for each item in the scale equal 1 and that the variance 
of the factors is equal to one another. 


23. The item “enjoy” is also commonly included in abbreviated CES-D scales and may be 
an effect indicator. However, including it in the abbreviated scale would result in a two- 
dimensional scale with correlated errors between the four negatively and two positively worded 
items. By including only one of the positively worded effect indicators in the model, we avoid 
this complication without any significant loss in the reliability of the abbreviated index. 
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'*...bea genuine homemaker in your own 
home”: Gender and Familial Relations in State 
Housing Practices, 1917—1922* 


PAUL C. LUKEN, University of West Georgia 
SUZANNE VAUGHAN, Arizona State University 


| Abstract 


Using institutional ethnography we examine the Own-Your-Own-Home (OYOH) 
movement as a configuration of ideological practices designed to reorder gender, 
family and housing arrangements in the United States during the early 20th 
century. We describe the social organization of these practices — with particular 
emphasis on the coordinating activity of the OYOH Section of the Department of 
Labor — and provide specific examples from texts of the National Archives and 
Research Administration. These texts are part of an historic, ongoing process of 
work organization that coordinated other sequences of action at multiple sites of 
production within the housing enterprise. We demonstrate that the texts of the 
| campaign organized white, working class, married couples with children as the 
| owners of homes. 


| Texts play an integral role in the organization of contemporary societies. They 
| have emerged as forms of ruling, which mediate social relations and coordinate 
| the activities of people translocally. Fundamental to the informing, regulating 
| and ordering that occurs at various sites are textual practices that objectify and 
| generalize knowledge, including knowledge of social organization. Although 
people experience and understand things differently, individuals’ perspectives are 
| overridden through the practices of generalization and objectification; and, as 
| they take up texts, people produce/reproduce institutional arrangements locally. 


| * This research was partially supported by grants from the Scholarly Research and Creative Activities 
Grant, Arizona State University. We would like to thank Liza McCoy and the anonymous reviewers of 
| Social Forces for their helpful comments and the women who graciously participated in our study. Direct 
correspondence to Suzanne Vaughan, Social and Behavioral Sciences, Arizona State University, 4701 
\ West Thunderbird Road, P.O. Box 37100, Phoenix, AZ 85069-7100. E-mail: svaughan@asu.edu. 
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Furthermore, through textually mediated forms of communication, people 
connect with elements of the ruling apparatus, such as the state, businesses and 
other formal organizations (Smith 1993, 2003). In this article we examine textual 
practices of the state in the reorganization of gender and family relations through 
the manner in which homeownership was promoted early in the 20th century. 
Specifically, we analyze texts from the archives of the Own-Your-Own-Home 
Section of the U.S. Department of Labor, 1917-1922, and demonstrate their 
coordinating role in women's experience. 


Methodological Framework and Problematic 


We use institutional ethnography in our research (Smith 1987, 1990, 2002). As a 
method of inquiry it provides an empirical bridge between local and particular 
processes of change in the everyday world and the actual historical social relations 
that order everyday experience. Institutional ethnography's starting place is the 
everyday experience of particular people, and this becomes the standpoint from 
which to explore the extra-locally organized social relations that they participate 
in and that govern their social practice. Social reality is viewed as rising within 
people's activities. Included in these activities are language, thinking, concepts, 
beliefs and ideologies; and the social aspect is the ongoing coordination of 
these activities. How coordination occurs remains to be discovered. Institutional 
ethnographers are particularly concerned with the processes that coordinate the 
activities of people in multiple local sites of everyday activity. In this regard, texts 
— in the form of printed words, photographic images, electronic communications 
and so on — are crucial as organizers of social relations translocally (Smith 
2002). 

Dorothy Smith (1993) argues that textual discourse mediates relations between 
active, creative subjects and the market and productive organization of capital. 
Although discourse appears as objectified forms, it can be examined as "actual 
social relations ongoingly organized in and by the activities of actual people" 
(Smith 1993:160). Following the extra-local textual production and distribution 
practices of organizations, activities are coordinated in local settings as people 
talk about the discourse, make decisions and work to organize their lives in 
relation to the discourse. In the process, the relevancies of those authoring the 
texts transcend the local historical setting and become points of reference. Thus, 
institutional ethnography is particularly effective in investigating the processes 
and character of institutional change as it reorganizes social relations at a number 
of organizational levels (Luken and Vaughan 2003). 

This is not to suggest some form of textual determinism. When a text is 
taken up, a text-reader relationship develops; however, only one party in the 
conversation is fixed. The reader is not. The reader is active in interpreting and 
acting upon the text (Smith 2002). As Smith notes, “[T]he relation between 


Gender and Familial Relations in Housing Practices / 1605 


discourse and local practices is not causal. Rather, women are active, skilled, 
make choices, consider, are not fooled or foolish. Within discourse there is play 
and interplay.” (1990:203) 

“Central to Smith’s approach,” as Kamini Maraj Grahame makes clear, “is her 
insistence that the point of departure for inquiry is the situated activities of the 
subject in the everyday world; hence, we begin with the standpoint of women. Such 
a starting point is offered as a break with the standard practice of social science 
inquiry which begins with academic or ‘ruling’ discourses.” (2003:69; emphasis 
added) We began our research by interviewing several women and generating an 
oral housing history for each (DeVault and McCoy 2001). All of the women lived 
in the Phoenix metropolitan area at the time of the interviews, were at least 60 
years of age, and had lived alone for at least six months. After an initial contact we 
interviewed the women in their homes on at least four occasions for approximately 
two hours in 1992-93. We tape recorded and transcribed approximately 10 to 12 
hours of conversation with each woman. In this article we use pseudonyms and 
change some of the information to ensure the anonymity of the women. 

In this article we explicate a housing experience mentioned by two of the 
women we interviewed, Thelma Hay and Ursula Roberts. In 1920, when they were 
children, their parents purchased homes for the first time. It is this shift from 
renting to being homeowners of a particular form of gender-organized shelter 
that is our problematic. To examine the textually mediated discourse about home 
ownership organizing these particular women’s experience, we collected historical 
documents (speeches, correspondence, newspaper articles, advertising materials, 
policy statements) from the National Archives and Research Administration 
in College Park, Maryland. Following McCoy (1995), we treat these texts as 
coordinators of courses of action whose commonly understood schema are 
intended through their assembly, production and distribution processes. 

In this study we draw on the women’s accounts; however, they serve principally 
as our point d’appui as we aim to map the social relations that organize the 
women’s experience but were not contained within that experience or known 
by the women. For institutional ethnographers, the goal is not to explain the 
behavior of the subjects or to simply describe the local social organization of 
their lives. Rather, we examine a segment of the institutional regime of housing 
that coordinates the activities of the subjects as well as of others. Thus, we will 
begin with brief descriptions of Thelma Hay and Ursula Roberts and see what 
they had to say, and this will be followed by an analysis of the Own-Your-Own- 
Home Section and its ideological practices. 

Thelma was born in Akron, Ohio, in 1911. Her mother was a housewife, her 
father was a handyman and hardware store employee, and a younger brother was 
born at the time they purchased their first home. Thelma tells us: 


Yeah, we rented in our first two places. Then I think we were back with 
Grandma for a little bit because I know we're building a home. And I 
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don't know why we left that place and went to Grandma’s or anything, 
but I know then we were building a home. Then we had our own home 
after that. A three-story. It had an attic and basement, and that was fully 
equipped, you know what I mean. It had electric and everything. I mean it 
was a home. It was a brand new house, uh-huh. We had somebody build 
it for us. It was a contractor who built it for us. And I know that we didn't 
have any sidewalks actually around the house, even from the sidewalk 
into the house, or anything, and my dad laid that whole sidewalk and 
clear around the house, laid the sidewalk and things like that, you know. 
He could do a lot of things like that. I was only — 11, I guess, when we 
moved in there. I lived there, I think probably, 12 or 13 years, I think. 


Ursula, the older of two daughters was raised in a small steel manufacturing 
town in western Pennsylvania. At the time when her parents purchased their 
first home, her mother was a housewife and her father was a sales manager for 
a car dealership. She relates a similar story about her family moving into their 
own home: 


And I lived there in the duplex until I was 5, and then I moved on the 
north side, which is the best side of New Castle, Pennsylvania. New Castle 
sits down in a little valley. There's an east side, north side, south side 
and west side, I mean, you know. And so the north side, and I lived on 
Maryland Avenue; they bought their house on Maryland Avenue when I 
was 5 years of age, and, let’s see, I lived there until after I was married . 
. And that's a standard home with a reception hall, living room, dining 
room, kitchen and three bedrooms and a bath upstairs. Just a regular 
home. I don't know that they had it built, but I think it was rather new, 
and I remember the street wasn't paved at the time, but it was paved shortly 
after that. And it was on a hill, going down a grade . . . It was one of the 
nicest neighborhoods in New Castle at that time. It was really nice. 


A distinctive feature of Thelma and Ursula’s narratives is that in the early 20th 
century home ownership was impossible for most U.S. families. Low-interest, 
long-term mortgages with high loan-to-value ratios were not available. Most 
amortized mortgages from savings and loan associations, then known as building 
and loans, were for 10 to 15 years only. Although insurance companies offered 
5- to 10-year “balloon” loans and commercial banks provided “bullet” loans for 
five years or less (Carliner 1998:304; Weiss 1989:110-11), down payments were 
beyond the reach of most families. While there was significant regional variation 
in ownership, in 1910 only 38.4% of non-farm homes were owned and ownership 
increased slightly to 40.9% in 1920. (U.S. Department of Commerce 1923:39) 
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The Own-Your-Own-Home Campaign 


In 1918 the U.S. government, through the Department of Labor, joined with 
lumber companies, lending agencies, real estate brokers and municipalities in 
the promotion of homeownership. This venture, known as the Own-Your-Own- 
Home (OYOH) campaign, was widespread yet short-lived.' We examine the 
OYOH movement as a configuration of ideological practices designed to produce 
the type of housing arrangement in which the women in our study actually lived 
during their childhood and which reorganized the relations of gender and family 
living arrangements during the early 20th century. 

The National Association of Real Estate Boards began the Own-Your-Own- 
Home campaign in 1917. The federal government became more involved with 
the campaign to promote home ownership in February 1919 when the Own- 
Your-Own-Home Section was established in the Department of Labor's Division 
of Public Works and Construction Development and in the Information and 
Education Division. Other interested parties included the United States League 
of Building and Loan Associations, the National Federation of Construction 
Industries, and various lumber associations. Some members of these organizations 
became dollar-a-year officials of the OYOH Section.’ 

Initially Franklin T. Miller, chairman of F.W. Dodge (which handled 
information services for the construction industry) and vice-president of the 
National Federation of Construction Industries, headed the Section. Paul C. 
Murphy and K.V. Haymaker assisted him. Murphy was a real estate broker 
from Portland, Oregon, and spokesman for real estate boards. Haymaker was 
an executive for a savings and loan in Detroit and was connected to United 
States League of Building and Loan Associations. These men were brought to 
Washington, D.C., to head the units within the section. Their efforts were largely 
directed at bankers, encouraging them to restructure mortgage-lending practices 
and to make more money available for home mortgages through builder and 
thrift institutions and directly to purchasers. In addition, the OYOH Section 
acted as a clearinghouse for promotional materials to encourage growth of the 
movement. We demonstrate that a dominant theme of its marketing campaign 
was the organization of gender and family life within the home. 


Examining Text Mediated Discourse 


One of their first promotional efforts on behalf of the Department of Labor 
was the production of a booklet that the department distributed to encourage 
the formation of additional OYOH campaigns in numerous communities and 
to furnish ideas for the local campaign operation (see Figure 1). The booklet 
was distributed widely. It was sent to 148 OYOH committees — including 
committees in Akron and Pittsburgh — and to thousands of city and metropolitan 
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Figure 1. 1919 Own-Your-Own-Home 
Booklet 


Source; Own-Your-Own-Home Section; Real Estate 
Division; U.S. Housing Corporation, Record Group 
3; National Archives and Records Administration, 
College Park, MD. 





newspapers, hundreds of Washington correspondents, speakers representing 
various government departments, labor publications, state governors, architectural 
and engineering associations, and hundreds of building industry associations 
secretaries, and building trade papers. 

Among the booklet suggestions was a program developed by the Portland, 
Oregon, committee. This involved an event that was held at a large bungalow that 
served as the office of the committee, and the occasion was a wedding: 


There was erected in the bungalow a very attractive altar banked with 
evergreens and flowers. The bride and groom, attended by the best man 
and the bridesmaids, marched into the room while the band played the 
wedding march. A popular pastor performed the ceremony and delivered 
a very fine sermon on the merits of the Own-Your-Own-Home movement 
and what it meant to the new couple to establish a firm foundation in life 
by building or owning a home of their own. The mayor, after the wedding, 
delivered a very practical address on what the home meant in stabilizing 
our Government. The secretary of the Oregon Industrial League, in 
presenting the presents to the bride and groom, gave a very lively talk on 
what it meant to the community to use homemade products. All of these 
addresses were, of course, published in the papers.° 
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Figure 2. Daily Jewish 
Courier Ad 


Source: Own-Your-Own- 
Home Section; Real Estate 
Division; U.S. Housing 
Corporation, Record Group 
3; National Archives and 
Records Administration, 
College Park, MD. 
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Advertising was also distributed to newspapers across the nation in multiple 
languages to reach various ethnic groups. The Department of Labor produced 
ads in German, Polish and so on, including Yiddish for the Daily Jewish Courier, 
with marriage and family themes (Figure 2). Similar ads were run in the Akron 
and Pittsburgh newspapers. 

Many of the illustrations used in the advertising campaign by the OYOH 
Section were drawn and contributed by Fremont Wood, the secretary of Citizens’ 
Building and Loan Association of San Francisco and participant in the local 
OYOH committee. A whole series of posters, brochures and newspaper copy 
contained his drawings. The OYOH texts were easily interpreted by people since 
they used styles, words and symbolic representations that were similar to those 
of the larger genre of advertising images widely circulated in magazines and 
newspapers at the time (Marchand 1985). 
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Figure 3. Wood llustration 
No. 1: An American Dream 


Source: Own-Your-Own-Home 
Section; Real Estate Division; U.S. 
Housing Corporation, Record 
Group 3; National Archives and 
Records Administration, College 
Park, MD. 





In Figure 3 Wood sketches a white, well-dressed man and woman, presumably 
a heterosexual couple, where the woman is gazing into the man's vision of a 
detached, single-family, two-story bungalow. With an expression of confidence 
the husband holds the architectural plans for his dream. Similar images appear 
over and over again in Wood's drawings. The people are always white, well 
dressed, urban, heterosexual couples and families, while the houses are always 
single-family, detached bungalows with gracefully landscaped yards. 

Another Fremont Wood's illustration (Figure 4) shows a couple looking over 
the architectural plans for their future home while sitting on a log on the cleared 
and staked, wooded housing site. Here the husband and wife are seated closely 
with the husband's arm placed protectively over the wife's shoulder. The picture 
suggests to us that homebuilding brings couples together and creates intimacy. 
The setting for the image is not the wilderness, an undeveloped area or farmland 
because in the distance is another detached residence with two birds flying over. 
The man and woman are dressed as urban residents. 

Figure 5 shows a young couple, courting, sitting on a bench. Again, they are 
well attired in urban clothing. She flirts demurely at us as he whispers in her ear. 
Perhaps he whispers about the single-family detached bungalow he is thinking 
about? Just as there is a connection between marriage and home ownership in the 
previous image, there is also a connection between courting and homeownership. 
What the OYOH section constructs in its advertising is that finding a committed 
stable relationship, being married in a heterosexual relationship, and legally taking 
on responsibility are steps to homeownership. Furthermore, unlike previous 
generations of couples, courting, as constructed here, portrays couples outside of 
family networks. Kinfolk do not surround couples in any of the images used by 
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Figure 4: Wood Illustra- 
tion No. 2: Planning a 
Future 


Source: Own-Your-Own- 
Home Section; Real Estate 
Division; U.S. Housing 
Corporation, Record Group 3; 
National Archives and Records 
Administration, College Park, 
MD. 





the Department of Labor. All the images depict a particular type of family: the 
nuclear family comprised of a husband, wife and related children. None of the 
illustrations depicts unattached people dreaming or thinking of buying a home 
nor alternative housing structures such as multi-household cooperatives. 

In addition to these idealized visions of family, these illustrations provide a 
universalizing, standard category of femininity and womanhood for the 1920s. 
Womanhood is defined in relation to marriage, responsibilities to children and 
the home in the following images. In Figure 6, a woman holds her left arm out in 
a gesture that prominently displays her wedding band. Behind her is a bungalow 
with smoke rising from the chimney and a landscaped yard. Both the home and 
the ring are standard symbols of marriage in this sketch. 

Copies of solicited endorsements from the OYOH Section files present similar 
images of standardized womanhood. For example, from an interview with 
Margaret Illington, “America’s Foremost Emotional Actress,” the Department of 
Labor reports her comments: 


A woman’s simplest duty, one would say, is to preserve the family interests 
by keeping her Home clean, wholesome and comfortable, and caring for 
the children properly. Yet, if their dwelling is a rented one, she cannot 
fulfill these fundamentally simple responsibilities in a way which will do 
her justice... 


If women would effectually [sic] continue their old avocation in this 
progressive Twentieth Century they must take an interested part in new 
Movements to emphasize old truths. The Own Your Home Movement 
will give women an insight into what actual ownership of a home can 
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Figure 5. Wood Illustration 
No. 3: A First Step 


Source: Own-Your-Own-Home 
Section; Real Estate Division; U.S. 
Housing Corporation, Record 
Group 3; National Archives and 
Records Administration, College 
Park, MD. 





mean, by giving women the opportunity to really supervise, as owners, 
those details in home operation and management which naturally belong 
to them, but which are constantly being overlooked or slighted in rented 
dwellings and apartments. 


If women as mothers wish to fulfill their traditional responsibility to 
their children, they can not sidestep this important factor of actual Home 
Ownership, and what it means in fostering life-long attachments that will 
be a help and inspiration to their future. Mothers, from time immemorial, 
have known to fight for the safety of dear ones. May I not fairly state 
that the American Mother needs this weapon of Ownership to preserve, 
intact, her Home?! 


Between 1917 and 1918, the Department of Labor Information and Education 
Division solicited similar endorsements from such prominent people as Douglas 
Fairbanks, President Wilson, General Pershing, Lloyd George, Chief Justice White, 
Billy Sunday, Samuel Gompers, James Cardinal Gibbons, Jane Addams, John D. 
and John D. Jr. Rockefeller, William Howard Taft, Luther Burbank, Henry Ford, 
Judge Landis and Charles Schwab, among others.” 

Another suggested display copy designed for newspapers and produced by 
the OYOH Section details womanhood in relation to wifely and motherly duties 
in an article titled “LITTLE FEET ON THE STAIRWAY”: 


Gender and Familial Relations in Housing Practices / 1613 


Figure 6. Wood 
Illustration No. 4: 
Symbols of 
Marriage 


Source: Own-Your-Own- 
Home Section; Real 
Estate Division; U.S. 
Housing Corporation, 
Record Group 3; National 
Archives and Records 
Administration, College 
Park, MD. 





This is for you, Mrs. Home-Maker. 


Before Baby's little feet ever really pattered across the floor; before Baby 
ever came, you heard the little footsteps on the stairway. You pictured the 
tiny figure going up to bed, to Baby's own room, and tucked in by your 
own loving hands — safe and sound at Home. 


You have the grandest calling in the world. Little children love you; A good 
man adores you and his greatest ambition is your happiness. When your 
sons went forth to war it was you who inspired them to heroic devotion. 
Your memory shielded them in the land of the stranger. 


Ever since the world began, when men fared forth to war, upon you 
revolved the duty of defending the Home itself, of keeping it safe and 
intact against the time of their return. 


But in this war that is just over, what have you defended? An illusion, a 
dream that is only in your memory? 


If you are to maintain your ancient, glorious vocation, you need to be a 
genuine Home Maker, in your own Home. Not stuck in a pigeon-hole, or 
cliff-dwelling, tabbed and assorted like so many envelopes of a size; not 
cooped between the arbitrary walls of a rented dwelling, but under the 
sheltering roof of a real Home. 
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What is there in renting to appeal to any thinking Mother and Home- 
Maker? Nothing! .. . 


And all this time you are ambitious to administer the Home finances as 
well as possible; to keep Home comfortable; to justify your husband's 
pride in you. 


Think this over, Mrs. Home Maker. It is not all sentiment. It is mostly 
sound common sense. Let no trifling sacrifice, or misdirected ambition 
stand in the way of enabling you to Own Your Own Home. Think it over. 
Then come in and see us. 


“Appropriate masculinity” was also a theme in the departments marketing 
campaign. Figure 7, a product of the Philadelphia Own Your Home Committee, a 
coalition of men, including local elected officials, executives from local businesses 
in banking, construction, real estate and insurance, is about “manhood” in the 
U.S.: being a worker, husband, father, citizen and real American. It tells how one 
can be better at these through owning one's own home. 


It should be a matter of personal pride with you to own your own home. 
Pride is defined as your proven ability to give your family a home of their 
own. Pride — in the ownership of real property, an ownership that places 
you among the bigger men of your community. A home-owner is a marked 
man — marked in the letters that can't be missed — “SUCCESS.” 


Your family looks to you to give them a home of their own. They know 
that there is more happiness and comfort in a home of one's own. Let 
your children grow up in a real “home” atmosphere. Give your wife an 
opportunity to show you that she can be a business woman and a loving 
wife and mother at the same time. 


Own your own Home! 
You can afford it. 
Think it over. 


Talk to your wife about it. She wants to own her home for a good many 
reasons you might never think of. Talk over the idea with her. 


Men are encouraged to write for a “handy booklet” to get the “necessary data” 
to consider and make a decision. 

Attaining masculinity through homeownership is further elaborated in an 
address by Paul C. Murphy before the Interstate Realty Convention in August 
19175 


The most sacred institution is not the Church, whatever the preacher may 
claim, not the State, whatever the patriot may say, but it is the Home, 
and the man, who, in these times of high costs and hard conditions, 
buys a home, whether it be for cash or on the installment plan, secures 
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Figure 7. Philadelphia 
The Man Who Owns OYH Committee: A 
Successful Man 


Source: Own-Your-Own- 
Home Section; Real Estate 


bener Division; U.S. Housing 
CILE Corporation, Record Group 3; 
National Archives and Records 
Administration, College Park, 
MD. 





your proved ability to give your family ahome of their own. Pride in the owner 
ship of real property, an ownership that. places you among the bigger men of your 
community. 

A home-owner is a marked man-- marked in letters that can't be maed 
"Success." 

Your family looks ta you to give them a home of their own. They know that 
there is more happiness and comfort in z home of one's own, Let your children grow 
upin a re al "home" atmosphere. Give your wife an opportunity to show you that 
she can be a business woman and a loving wile and mother at the same time. 
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I T SHOULD be a matter of personal pride with you to own your home, Pride--m 


You can afford it, With the exception of the ini- 
tial cash payment of a few hundred dollars. it will be ze 
easy for you to ows a home of yoür own as it now ts for 
youts pay rent. Spread out over a period of ten or eleven 
years, your payments on your home will be small enough 
to cause you no mere inconvenience tban your present 
rent pay! menfa. 

‘Think jt over. 
Talk to your wife about it. She wants to own her 
home, for a good many. reasons that you might never think 
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for him a happy wife and raises therein patriotic well-educated American 
children, and sends them out to high moral ideals, to be a blessing to the 
community, has done the most patriotic and religious thing possible and 
incidently [sic] the most opportune thing for the betterment of human 
conditions." 


It is not enough to provide the family with a shelter or make a wife happy, 
but men must reside in a “real home” appropriate for raising children. Figure 
8 shows a father playing horsy with his daughter. The son is looking at a book 
while the mother is holding flowers, an idyllic picture of a family in their living 
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Source: Own-Your-Own-Home 
Section; Real Estate Division; U.S. 
Housing Corporation, Record 
Group 3; National Archives and 
Records Administration, College 
Park, MD. 
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room. Here the Philadelphia OYH Committee presents in a woman's words a 
testimonial to the value of homeownership to her husband and her: 


How glad I am that I agreed with John when he proposed that we own 
our home. I CAN look back and laugh now at the worries and discomforts 
that were always with us when we rented our house . . . 


Eleven years later we're still here. Eleven happy years, too... 


When repairing or decorating is needed, we have it done. Our home is 
always in order and much pleasanter and more livable than any rented 
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house could be. Our lawn is well kept, our garden is blooming — in short, 
we live in a real home atmosphere.” 


Her testimonial is followed by a course of action for the potential home owner: 
attend the Own Your Home Exposition at the First Regimen Armory and enter 
a chance to win a bungalow. 

The Department of Labor also directed its reformation efforts at owning a 
home as a sign of better parenting. In a letter to national Women's Clubs, the 
OYOH campaign argued that women's involvement in homeownership was 
important because it meant a "desirable neighborhood and proper environment 
in which to raise children.” The poster, presented in Figure 9, declares, “For Your 
Children’s Sake,” write to Roger Babson, chief, Information and Education Service, 
for copies of the brochure and instructs parents on how to go about becoming 
homeowners. In a newspaper ad, a family is shown approaching a walled-off 
territory (see Figure 10). The image also presents a vision of a single-family, 
detached bungalow on the other side of the wall. To get in, the family must pass 
through the gate of home ownership to the building and loan association. The 
father holds the passbook controlling the financial aspect of home owning and 
the family’s connection to the resources. The family strides forward confidently as 
the boy holds the keys to the locked gate. As the next generation of homeowners, 
the kids lead the way in. Note that eyes of the father, son and daughter are on the 
lock of the gate, but the mother’s eyes are focused on the home bubble. She is 
envisioning not just a house or a set of architectural plans, but a home, a warm 
one at that. Although the mother and daughter are portrayed more passively than 
the men, they are integral to the illustration since they create a sense of family 
and intimacy and their presence signifies for us a particular kind of family life. 

In “A LETTER TO LITTLE GIRLS AND BOYS — AND BIG ONES” the 
Department of Labor portrays proper parenting in remarkable newspaper 
copy: 

Every baby bird and animal that is born comes into the world in its 

own little nest. Its father and mother choose a sheltered spot, and make 

the home comfortable with loving care. At night they gather there for 
protection against danger. 


No bird or animal would even think of raising its little ones in a strange 
nest. Only the cuckoo, a very lazy bird, tries to save itself the trouble of 
building a nest, and lays its eggs in other birds [sic] nest. Birds hate the 
cuckoo, and will drive it away. But sometimes it lays its eggs unseen, and 
a bird mother hatches them as though they were her own, and feeds more 
little ones than she should have to. Or sometimes she finds that there are 
strange eggs in her nest and she builds over the top of it and lays more 
eggs, or chooses another nesting place altogether. 


Of course we must shelter chickens and cows and horses. We have tamed 
them for our own use, and must keep them near by for our use. But if they 
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Figure 9. For Your 
Children's Sake Poster 


Source: Own-Your-Own-Home 
Section; Real Estate Division; U.S. 
Housing Corporation, Record Group 
3; National Archives and Records 
Administration, College Park, MD. 


Secretary of Labor. 


For additional coples address Rager W. Taboon, Chief, Foforation and Kdwestion Service, Dapartavent uf Labor, 
u 3206 G. Strest Washington, I C 
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were wild they would have the natural instinct that would tell them where 
and how they should find shelter — how to make their own homes. 


Now what we are trying to make plain is this: If it is natural and right 
for little baby birds and animals to be born in their very own homes, it 
is the most natural thing in the world that you should have the right to 
be born in your own home. 


When your father and mother pay rent they are paying someone else for 
the use of the place they live in. It is not a good thing to do that. It is like 
wearing someone else's clothes. 


Besides, your father and mother want you to be healthy and strong. Now, 
in Oakland, California a doctor measured a lot of little boys and girls not 
long ago, and the children whose fathers and mothers lived in their own 
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Figure 10. Wood Illustra- 
tion No. 5: The Gate of 
Home Ownership 


Source: Own-Your-Own-Home 
Section; Real Estate Division; U.S. 
Housing Corporation, Record 
Group 3; National Archives and 
Records Administration, College 
Park, MD. 





homes were bigger and stronger and smarter than others whose parents 
were not so well situated. It does not mean that some were rich and others 
poor, but that when your parents own your home they can keep it healthy 
and sanitary and comfortable. 


When you live in a nice home you progress faster at school. If you change 
to a new school and new teachers every time your folks move you lose a lot 
of time, and must get used to a lot of new teachers and books and school- 
mates, and you don't get along as fast in your classes as you should. 


Your home ought to have a nice big sunshiny room for you, and a sand 
pile under the back porch for little brother and sister, and a big front 
porch where you can watch for Daddy when he comes home in the 
evening. It will be where there are a lot of nice playmates, nice children, 
your own father and mother will be glad to have you play and go with, 
if they choose it rightly. 


Read this letter all through, and ask your father and mother about 
anything in it you do not understand. And if they want more information 
themselves, have them ask us.* 

The well-being of children is further addressed in an OYOH press clipping 
from the Washington Star (May 17, 1919). In the article about "Mothers Grow 
Weary,” the writer asks parents to assess how well they are fulfilling their 
responsibilities to their children: 

The children must start all over again making friends and forming those 

associations so dear to the heart of youth. It all seems so useless for already 

their short lives are overcrowded with a jungle of friends made one year 
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Figure 11. The Parable 
of Renting and Owner- 
ship 

Source: Own-Your-Own- 
Home Section; Real Estate 
Division; U.S. Housing 
Corporation, Record Group 3; 
| National Archives and Records 
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and lost the next — are they to have no permanent memories of home 
to go with them through life? 


“But what can I do?” you say. The answer is so easy if you just know how 
— “build your own home? Start building or buying today. 


Are you fair to your family — are you doing for them all that is within 

your power?? 

Figure 11 dramatically portrays the parable of renting and homeownership. 
Not only is it a matter of rags to riches, but in the prosperous household the 
husband is the dominant person in the foreground of the picture while the wife 
recedes and is largely hidden by him. In the sketch on the right, it is not clear 
what the wife is doing — other than being by her husband's side. In the sketch 
on the left, where the impoverished wife is dominant, there is considerably more 
distance between the wife and the husband. On the left the wife is the center of the 
family and the husband appears tangential. Owning one's own home transforms 
the social hierarchy from matriarchal to patriarchal. 


Discussion 


Social historians regard the Own-Your-Own-Home campaign within the 
Department of Labor as an attempt to stimulate a sluggish capitalist economy and 
dissipate a wave of labor strikes and social unrest after World War I by promoting 
consumption of housing and invigorating the homebuilding industry (Carliner 
1998:299; Weiss 1989:109). It was a deliberate attempt to promote a particular 
type of shelter for Americans, one that was a "suburban ideal, a single-family 
dwelling surrounded by a yard, a secure foundation for American citizenship, an 
essential consumer arena, and a model for planned, rational living.” (Hutchison 
1997:185) The state’s promotion of this ideal disregarded other viable alternatives 
being debated by reformers and public policy makers at the time, including multi- 
family and cooperative housing structures (Hutchison 1997:186). 
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We should note that this ideal form of housing has comparable historical 
antecedents. Rybczynski (1987) credits the Dutch middle class with exemplifying 
the family home in the first half of the 17th century. This influence quickly spread 
throughout Northern Europe, but it took a new form in the rural landscape of 
the United States. As Hayden (2002) argues, the Land Ordinance of 1785 and the 
accompanying land surveys were instrumental in the destruction of the colonial 
vision of the ideal city and the substitution of the dream of the single-family 
home. This Act was consistent with the Jeffersonian vision of a country filled 
with land-owning small farmers. In the 19th century, architects, social reformers 
and plan book authors vigorously promoted family homes (Clark 1986; Smeins 
1999). Catharine Beecher's 1841 work, A Treatise on Domestic Economy for the 
Use of Young Ladies at Home and at School, and the 1869 book coauthored with 
her sister Harriet, The American Woman's Home, were significant texts advocating 
both an architecture and a gendered lifestyle for American families. After the Civil 
War, many building-related professions, taking advantage of mass-distributed 
magazines such as Good Housekeeping, Ladies Home Journal and Domestic 
Monthly, began to lay claim to housing expertise through the distribution of 
housing images — exalting simultaneously the nuclear family, homeownership 
and the gendered division of labor. These images were replicated over and over at 
various sites within the newly emerging housing industry, such as in professional 
architecture journals and advertising catalogues sent to building suppliers, 
hardware stores, carpenters and builders (Smeins 1999). However, while these 
ideological practices began to organize local housing activities along gender and 
familial lines, the state did much more. Through the Own-Your-Own-Home 
Section and its advertising campaign, it centralized, standardized and provided 
material support for these efforts. 

In constructing the text mediated discourse of home ownership through the 
lanuage of typification (e.g., clean, wholesome and comfortable home; real women 
and mothers; Mrs. Home-Maker; happy wife; good man; American children; real 
home atmosphere; the family interests) and the atemporal present, the OYOH 
campaign deployed what Smith (1999) calls “ideological codes.” Regardless of the 
complexities of people's actual lives, their gender and familial relations to one 
another and to a form of shelter “are mediated by the standardization of visual 
norms organized by this discourse.” (Smith 1993:176) The texts order courses of 
action and social relations among individuals who make up divergent audiences 
at multiple sites by locating people within the social institution of housing 
and instructing them on what to do next. Thus, the texts link “the active and 
creative subject and the market and productive organization of capital” (Smith 
1993:161). 

The OYOH textual practices organize women and men’s relation to homes 
and instructs families in the proper manner of living in the emerging “modern,” 
urban and industrial America of the early 20th century. By the turn of the century 
the old family economy in which all members contributed to the household 


1622 / Social Forces 83:4, June 2005 


was disappearing. Many of the traditional tasks of women and men had been 
commodified and their production transferred to factories. The OYOH ad copy, 
like the advertising tableaux of the day (Marchand 1985), speaks of women's 
homemaking and mothering work as impelled by a constant antediluvian dynamic 
*since the world began" and "since time immemorial,” while at the same time 
articulating its imagery to connect with “modern” discourses on the middle-class 
wife as domestic scientist and an effective “business woman” administering the 
“home finances” — as elaborated by the premier Taylorist home economist of 
the day, Christine Frederick (McGee 2001). The OYOH texts portray women as 
managers of homes and in the process suggest that homemaking is the proper 
and venerated career for women. On the other hand the texts indicate that men, 
whose productive work is increasingly outside the house, are to provide houses 
for wives to manage. The husband becomes the presider over and financier of the 
household while the wife becomes the manager and purchaser within a housing 
environment in which owned homes are considered to be economic assets. In 
this way gender relations to homes are configured. 

The conceptual aspects of the routine marketing efforts of the Department 
of Labor OYOH Section served as “‘organizers’ packaged in texts that transmit 
the organization” (Smith 1999:93) of standardized, normative gender and family 
relations to working class households. The advertisements coordinated and 
organized relations for the housing industry and its various agents. These practices 
included white heterosexual couples with children purchasing single-family homes 
surrounded by yards. The families of the women we interviewed, Thelma Hay and 
Ursula Roberts, took up these practices and coordinated their activities with the 
OYOH campaign and others in the housing industries by building and buying 
homes in 1920. As more families sought shelter in detached, single-family homes, 
their actions, in conjunction with other agents within the housing regime, began 
to transform housing and family practices. By the 1930s these social relations 
were codified by public policy makers in the 1934 National Housing Act and 
further served to organize housing practices — lending, purchasing, household 
consumption and the geography of suburban space — from the 1920s on. 


Further Implications 


Although ideological practices similar to those discussed in this paper appear to 
be operative today, additional research is needed on the changes in the housing 
discourse and on the manner in which people are now being connected with this 
discourse. Our analysis suggests some important directions for exploring these 
processes. The Bush administration claims several initiatives that cut across various 
segments of the housing industry — real estate and mortgage finance industries, 
local housing agencies which counsel families on home buying, HUD, the Federal 
Trade Commission and credit agencies — and connects these organizations with 
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potential homebuyers. The American Dream Downpayment Act, Single-Family 
Affordable Housing Tax Credit, the Self Help Homeownership Opportunities 
Program and American's Homeownership Challenge are presented as efforts to 
dismantle barriers and stimulate home buying among low-income and minority 
Americans (The White House 2004a). Furthermore, many of the themes found 
in the rhetoric of the OYOH movement prevail. While applauding the record 
high rates of home ownership, President George W. Bush asserts that we should 
move toward what he has termed an “ownership society.” During a speech in 
spring 2004 the president stated, “We want more people owning their own home. 
There is nothing like saying, this home is my home" Later he added, "The more 
people that own their home, the more carpenters are going to be working. But 
more importantly, the more people that own their home, the more satisfaction 
there will be in America, in the American experience.” During public appearances, 
President Bush commonly elicits glowing testimonials for homeownership from 
new, first-time homeowners. Homeownership is still presented as good for family 
and community stability, good for the economy and good for America in general. 
However, the emphasis on home ownership by minorities has come front stage. 
As the president noted, “Not enough minorities own their own homes. And it 
seems to me it makes sense to encourage all to own homes.” (The White House 
2004b, 2004c, 2004d) In this commentary on homeownership, gender and family 
relations, which we focused on in this research, are taken for granted rather than 
made explicit. 

Of course, significant changes have occurred in government organization, 
family organization, gender relations, race and ethnic relations, housing options, 
public policies, home financing and other housing-related arenas since 1920. The 
massive impact of the New Deal legislation on the role of the state in housing 
is widely acknowledged. State interest in housing was episodic before that time. 
Since then it has been continuously involved in varieties of ways that integrate 
individuals to the housing market. This integration is currently pursued by (a) 
soliciting new consumers and (b) repeatedly returning owners to the housing 
market. The patterns of consumption of women and minorities are being 
redirected toward this market. Single women, Hispanics, African-Americans 
— groups that were largely restricted from home ownership in early in the 20th 
century — are now being encouraged to purchase homes, and home ownership 
is encouraged for new immigrants. Secondly, home ownership is not an end 
state, but a step in a progression of purchases. The virtual individuals referred 
to as homeowners in 1920 are often termed “first-time home buyers” today and 
their residences are called “starter homes.” Thus, while the housing discourse 
has changed somewhat in its content, it remains rooted in capitalist economic 
relations. 
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Notes 


1. Records of the Own-Your-Own-Home Section. Finding Aid; Own Your Own Home Section; 
Real Estate Division; U.S. Housing Corporation, Record Group 3; National Archives and 
Records Administration, College Park, MD. 


2. Records of the Own-Your-Own-Home Section. RG 3; NARA. 


3. "Suggestions for Own-Your-Own-Home Campaigns." Department of Labor, 1919; Own 
Your Own Home Section; Real Estate Division; U.S. Housing Corporation, Record Group 
3; National Archives and Records Administration, College Park, MD. 


4. “Only A Real Home Will Safeguard Woman's Interest, Says Margaret Illington, America's 
Foremost Emotional Actress in Interview"; Own Your Own Home Section; Real Estate 
Division; U.S. Housing Corporation, Record Group 3; National Archives and Records 
Administration, College Park, MD. 


5. Correspondence. Own Your Own Home Section; Real Estate Division; U.S. Housing 
Corporation, Record Group 3; National Archives and Records Administration, College Park, 
MD. 


6. “Little Feet on the Stairway”; Suggested display advertising copy for ‘Own Your Own 
Home’ advertising campaigns; Own Your Own Home Section; Real Estate Division; U.S. 
Housing Corporation, Record Group 3; National Archives and Records Administration, 
College Park, MD. 


7. Address of Paul C. Murphy before the Realty Convention Held at Aberdeen and Hoquiam, 
August 9, 10, and 11, 1917, under the Auspices of the Grays Harbor Realty Board; Own- 
Your-Own-Home Section; Real Estate Division; U.S. Housing Corporation, Record Group 
3; National Archives and Records Administration, College Park, MD. 


8. “Letter to Little Boys and Girls — and Big Ones;” Suggested display advertising copy 
for Own-Your-Own-Home advertising campaigns; Own-Your-Own-Home Section; Real 
Estate Division; U.S. Housing Corporation, Record Group 3; National Archives and Records 
Administration, College Park, MD. 


9. “Build Your Home Now and Everybody Will Be Glad,” Washington Star, May 17, 1919; Press 
Clippings form Information & Education Service, Department of Labor; Own Your Own 
Home Section; Real Estate Division; U.S. Housing Corporation, Record Group 3; National 
Archives and Records Administration, College Park, MD. 
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Abstract 


The outrage over revelations of torture and abuse at Abu Ghraib prison has faded 
from public discourse, but a number of questions remain unanswered. This paper 
criticizes official rationalizations offered for the abuse. We make the case that these 
abuses are systemic, resulting from dehumanization of the enemy and the long 
reliance on and refinement of torture by the United States national security agencies. 
We also consider the spread of torture in the current war "on terror," and we call 
on sociologists to become involved in the study of torture and prisoner abuse. 


Our title is a quote from Alberto Gonzales, White House counsel during the 
first George W. Bush administration, who was subsequently appointed to serve 
as attorney general in Bush's second term. When Gonzales! (2002) discussed 
“outrages against personal dignity” it was not to condemn them, it was to plan 
for them. The Bush administration viewed “outrages against personal dignity” 
as integral to interrogation. For this reason, Gonzales recommended that 
Taliban and al-Qaeda be defined as “enemy combatants” rather than prisoners 
of war, and therefore, not covered by Geneva Convention protections. The 
' Bush administration heeded this advice. Through this legal maneuver, Gonzales 
(2002) asserted that U.S. soldiers committing “outrages against personal dignity” 
and those who commanded them could be shielded from “domestic criminal 
prosecution” and insulated from international prosecution as well (Gonzales in 
Human Rights Watch 2004). 
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There have been documented cases of prisoner abuse in each war that the 
United States has fought. However, the Bush administration is unique for formally 
embracing this abuse and for conferring on the president the authority to set 
aside the Geneva Convention if he sees fit. The Red Cross and most signatories 
to the Geneva Conventions reject the Bush administration's legalese; they also 
condemn the abuse and torture of prisoners that has been pervasive in U.S.-run 
facilities holding “detainees” in Guantanamo, Afghanistan and Iraq. American 
public opinion reflects only a modest level of concern over these abuses and the 
Bush administration’s unilateralism; and government officials have attributed the 
atrocities to the proclivities of individuals or small groups of guards. Congress 
has been reluctant to investigate these abuses. In the wake of the 2004 election, 
with Republicans strengthened in both the House and Senate, it is even less likely 
that Congress will probe abuse and torture. (See Stolberg 2004.) 

For sociologists, it is taken for granted that individualist accounts fail to 
explain such a widespread and deep-seated pattern of abuse. But beyond 
skepticism of accounts that lay the blame on several low-ranking guards, sociology 
has done little to fill the void in the public understanding of these events. This 
is not only unfortunate, but it is surprising given that these events provide a 
stark and troubling manifestation of the Weber’s classic imagery — the “iron 
cage of rationality.” In this essay, we provide an overview of detainee abuse and 
explanations of it. We believe that too much emphasis has been placed on the role 
of individuals — whether it is the actions of aberrant guards or the lawlessness of 
Bush administration officials. The abuse of detainees is systemic. The problem is 
not that a handful of troops were poorly trained and supervised; the problem is 
that the United States has developed and refined a callous and calculated method 
for extracting information and intimidating civilian populations. “Violence has 
been turned into a technique.” (Bauman 2000:98) The decision about when and 
how to deploy violence is made on the basis of efficiency calculations, not human 
rights conventions. 

As Abu Ghraib, Guantanamo and other well-documented cases in Iraq 
and Afghanistan reveal, the U.S. military violated established international and 
domestic conventions and policies. But these abuses did not occur because the 
chain of command failed. On the contrary, these abuses occurred because a 
highly rationalized system of interrogation is in place — and (in instrumentally 
rational fashion) this system of interrogation places a higher value on information 
extraction than it does on human rights. Thus, the widely publicized punishment 
of low-ranking military personnel and shielding of commanding officers who 
knew or should have known of the abuse at Abu Ghraib is not simply the 
protection of elites. Rather, we will demonstrate that this pattern of punishment 
reflects the systemic commitment to callous cruelty (Collins 1974) and the 
emphasis that U.S. military doctrine places on information extraction. 


Outrages Against Personal Dignity / 1629 
The Scope of Prisoner Abuse 


The Fay-Jones (2004) report (one of several official investigations undertaken 
after the Abu Ghraib abuses were publicized) identified 44 instances or events 
of detainee abuse committed by Military Police (MP) and Military Intelligence 
(MI) soldiers and civilian contractors at Abu Ghraib prison. The report alleged 
that these abuses fell into two categories: (1) intentional violent or sexual abuse, 
and (2) actions taken based on misinterpretation or confusion regarding law or 
policy. Among the actions in the former category were the following: 


a. a contest between two army dog handlers to see who could make the 
detainees urinate or defecate in the presence of dogs; 


b. three detainees stripped of their clothing, handcuffed together while nude, 
placed on the ground, forced to lie on each other and simulate sex while 
photographs were taken’; 


c. the rape of a female detainee and the rape of a 15- to 18-year-old male 
detainee; 


d. a detainee being forced to bark like a dog and crawl on his stomach while 
MPs spat and urinated on him; 


e. several instances of detainees being forced to wear women’s underwear, 
often on their heads; 


f. a detainee who was beaten with a broom and had a chemical light broken 
and poured over his body. MPs then used the broom to sodomize the 
detainee while two female MPs hit him, threw a ball at his penis and took 
photographs. 


Abu Ghraib was not an anomaly. Amnesty International asserts that the United 
States-led “war on terror” had produced the most sustained attack on international 
law in 50 years (Sengupta 2004). Not surprisingly, the Bush administration 
dismissed such criticisms and defended its actions as part of an effort to defend 
the nation from terrorism and to “liberate” other nations. White House Press 
Secretary Scott McClellan defended detention and killings around the world on 
the pretext that the “war on terrorism has protected the human rights of some 
25 million people in Afghanistan, and 25 million in Iraq.” (in Sengupta 2004:4) 
In fact, the very actions that Amnesty International condemned were trumpeted 
by President Bush in his 2003 State of the Union Address: “More than 3,000 
suspected terrorists have been arrested in many countries. Many others have met 
a different fate. Let’s put it this way — they are no longer a threat to the United 
States and our friends and allies.” (Blumenthal 2004) 

In this essay, we explore several explanations of these abuses. We begin 
with a discussion of individualist explanations — and criticisms of them. We 
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close by advancing our view that these abuses are deeply ingrained in the social 
organization and policies of the United States. That is, these abuses are not 
aberrations inconsistent with American values as President Bush claims. They 
are, instead, integral to the rationalized system of violence that the United States 
has created. 


Justifying Torture and Abuse 


Prominent public figures have defended and excused the abuse and torture at 
Abu Ghraib. Senator Zell Miller (D-Georgia) trivialized the events by arguing 
that the sexual degradation documented in photographs was no different than 
what young people experienced in high school gym classes. “The two times I 
think I have been most humiliated in my life was standing in a big room, naked 
as a jaybird with about 50 others, and they were checking us out. Now that was 
humiliating. It didn't kill us, did it?" (Hitt 2004). Secretary of Defense Donald 
Rumsfeld deployed similar semantics: “My impression is that what has been 
charged thus far is abuse, which, I believe, is technically different from torture.” 
(Kadhim 2004:M1) Rush Limbaugh trivialized torture as follows: 


This is no different than what happens at the Skull and Bones initiation 
and we're going to ruin people's lives over it and we're going to hamper 
our military effort. . . . You know, these people are being fired at every 
day. I’m talking about people having a good time, these people, you ever 
heard of emotional release? You of heard [sic] of need to blow some steam 
off? (Rush Limbaugh’, in CBS News 2004) 


Some observers go further — arguing that detainees “got what they deserved.” 
(Hitt 2004). Senator James Inhofe (R-Oklahoma) was “more outraged by the 
outrage than ... by the treatment” of detainees. Despite a report from the Red 
Cross (and confirmed by the Fay-Jones and Schlesinger reports) that as many 
as 90 percent of the Abu Ghraib detainees were “arrested by mistake,” Inhofe 
insisted that “these prisoners, you know, they’re not there for traffic violations. 
If they're in Cell Block 1-A or 1-B, these prisoners, theyre murderers, they’re 
terrorists, they’re insurgents. Many of them probably have American blood on 
their hands, and we're so concerned about the treatment of these individuals" 
(Hitt 2004) Inhofe further claimed that the prisoners in the photographs *wake 
up every morning thanking Allah-that Saddam Hussein is not in charge of those 
prisons.” (Williams 2004a: 10) 

Perhaps “they are worse than us” or what Arendt (1979) referred to as 
“nihilistic relativism” rationale is more troubling still. In response to assertions 
that his policies contributed to the abuse at Abu Ghraib, and in light of the 
beheadings conducted by alleged terrorist groups, Rumsfeld said that the military’s 
treatment of prisoners was not as bad as what the terrorists had done: “Does it 
rank up there with chopping someone’s head off on television? It doesn’t.” (Jelinek 
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2004) In an otherwise critical discussion of the Abu Ghraib situation, but one in 
which the authors incorrectly equated all Abu Ghraib detainees with terrorists, 
Newsweek reporters John Barry, Michael Hirsh and Michael Isikoff asserted that 
American forces "arrayed against the terrorists are being tarred with the same 
epithet [as Saddam Hussein]. That's unfair. What Saddam did at Abu Ghraib 
during his regime was more horrible, and on a much vaster scale, than anything 
seen in those images on Capitol Hill.” (2004) Perhaps the most definitive version 
of this rationale was offered by Mark Bowden who argued: *The photos are said 
to prove that American forces are no better than Saddam Hussein's jailers. Well, 
no: whatever the Americans did, it is not the equivalent of cutting out tongues, 
gouging out eyes, lopping off limbs, stringing people up with piano wire, and 
executing people by the tens of thousands. Any reader of the yearly reports on 
torture published by Amnesty International and Human Rights Watch would 
pay his weight in antique dinars to stay in an American military prison if the 
alternative was jail anywhere in the Arab world.” (Bowden 2004) 

The *Saddam was worse" refrain was often complemented by reminders that 
harsh rebukes of the U.S. military risked undermining support for the war “on 
terror? Rumsfeld, for example, criticized a Washington Post editorial on the torture 
saying it made soldiers question their mission; he also argued that the editorial 
would make people in Iraq and Afghanistan less willing to cooperate with the 
United States and could be used as an excuse for the torture SDN U.S. soldiers or 
civilians (Gee 2004). 


I'm Shocked, Shocked 


Official U.S. pronouncements on prisoner abuse in Iraq are reminiscent of Captain 
Renault (in the movie Casablanca). As a Vichy officer in Casablanca during 1941, 
Captain Renault (complying with the Nazis who were in control) announces that 
he is closing the bar owned by an American expatriate. Captain Renault offers a 
disingenuous explanation: he has learned that gambling was taking place in the 
bar. Just as he announces, “I’m shocked! Shocked that gambling is going on in 
this establishment,” the manager approaches Captain Renault and gives to him 
his winnings. In the case of Iraq, those expressing shock are the very officials 
who organized and supervised a systematically cruel system of detention and 
interrogation. Instead of taking responsibility for their individual actions or 
finding fault with the U.S. chain of command, high-ranking officials place all 
responsibility on the soldiers of the lowest rank and with the least control over 
the situation. In concert with Captain Renault, those at the top of the chain 
of command and the “investigations” they authorized pronounce that they are 
“shocked” torture is occurring in U.S. detention facilities. President Bush set the 
tone when he declared that the abuse of detainees was inconsistent with American 
values. He attributed the acts to a few aberrant individuals and argued that the 
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abuses revealed in the photographs did not reflect "the true nature and heart of 
America." (Jacques 2004; Danner 2004) 

In response to the revelations of abuse, a number of investigative commissions 
were created by the United States government (Fay-Jones 2004; Schlesinger 2004; 
Taguba 2004). These reports became official exercises in obfuscation. Given space 
limitations, we focus here on the findings of the Schlesinger Commission report. 
In the spring of 2004, Defense Secretary Rumsfeld appointed the Schlesinger 
Commission. Its report referred to the “aberrant behavior on the night shift on 
Cell Block 1 at Abu Ghraib” and asserted that the abuses would not have occurred 
“had the officers behaved more like those on the day shift.” (Schlesinger et al. 
2004:13) “In fact, the only exceptional aspect of the abuse at Abu Ghraib may 
have been that it was photographed.” (Human Rights Watch 2004) And similar 
to the beating of Rodney King in Los Angeles that was videotaped by a citizen 
bystander, the Schlesinger report lamented that diplomatic fallout was “magnified 
by the fact that the shocking photographs were aired throughout the world.” 
(Schlesinger et al. 2004:13)* 

The Schlesinger report downplayed the significance of the events at Abu 
Ghraib and other detention facilities run by the United States by placing them in 
a statistical context. “Since the beginning of hostilities in Afghanistan and Iraq, 
military and security operations have apprehended about 50,000 individuals. 
From this number, about 300 allegations of abuse in Afghanistan, Iraq, or 
Guantanamo have arisen. As of mid-August, 2004, 155 investigations into the 
allegations have been completed, resulting in 66 substantiated cases.” (Schlesinger 
et al. 2004:5) 

Although the Schlesinger and Fay-Jones reports insisted that the guards 
— and not their commanding officers — must face judgment for the abuses 
that occurred, both reports identified deficiencies in the training of military 
personnel and asserted that this poor training contributed to the abuse. In fact, 
one MP indicated that he had to learn about the Geneva Conventions on his 
own by searching the Internet (Weiner 2004). Despite acknowledging lapses 
in training, the Schlesinger report echoed President Bush’s assertion that the 
torture and abuse did not represent U.S. culture or official policy: “Our nation 
has been and will continue to be a strong supporter of Geneva and its principles. 
As a matter of policy, the United States Armed Forces shall continue to treat 
detainees humanely, and, to the extent appropriate and consistent with military 
necessity, in a manner consistent with the principles of Geneva.” (President Bush, 
in Schlesinger et al. 2004:34) 

Journalists have looked to social scientific research to understand the abuse 
in Iraq, Afghanistan and around the world. These accounts move away from an 
emphasis on a few “bad apples” and call into question an emphasis on punishing 
the lowest ranking soldiers. Zimbardo’s (1972) research figures prominently in 
these accounts. Zimbardo conducted an experiment in which “normal” college 
students were divided into guards and prisoners. In a context in which the 
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"guards" received uniforms and were provided the trappings of authority, the 
“prisoners” were depersonalized and dehumanized as is the case in most prison 
contexts. But the guards displayed such cruelty that Zimbardo aborted the 
experiment for fear of mental and physical harm to subjects. At a minimum, the 
social context (and not the failures of “bad apples”) — i.e., chaos and stress of 
war coupled with the dynamics of prisoner and guards — figure prominently in 
understanding these abuses. (See for example, Donn 2004; Mattimore 2004.) But 
Zimbardo goes further. He rejects out of hand the “bad apple” thesis, suggesting 
instead that the barrel is bad. Zimbardo faulted the Bush administration with a 
“failure of leadership” and emphasized that the abusive interrogation techniques 
and harsh treatment of prisoners were “authorized from the top down” by military 
commanders and by the highest-ranking officials in the Bush administration. 
(Zhang 2004; Reiland 2004; Saletan 2004) We share Bauman’s view (2000:151—52) 
that Stanley Milgram’s research offers singular insight into systemic abuse. For 
Milgram (1974), the question is why lower-ranking personnel follow orders that 
result in callous and cruel actions. His findings are disturbing: those in authority 
can manipulate the context such that compliance occurs at a very high rate. Thus, 
social science provides a clear lesson, i.e., abuse of prisoners does not emanate 
from the lowest-ranking guards and interrogators. Rather, high-ranking officials 
secured the compliance of the guards and interrogators to implement a policy 
of detention and interrogation that was systematically cruel. 

Protestations by the Bush administration, Fay-Jones and Schlesinger reports 
notwithstanding, blaming the lowest-ranking personnel lacks credibility. For 
example, a memo from Lt. General Ricardo S. Sanchez approved techniques 
such as hooding, imposing stress positions and using dogs to terrify detainees 
(Washington Post 2004). Further, Hersh (2004) documents that Pentagon officials 
and military commanders discussed Arabs’ vulnerability to sexual humiliation 
prior to the invasion of Iraq. And although both the Fay-Jones and Schlesinger 
reports implied that the abuse of detainees began in 2003, the Red Cross 
formally complained about mistreatment “regularly and consistently over a 
lengthy period” dating as far back as to the U.S. invasion of Afghanistan in 2001 
(Thomas 2004:24). In addition to the warnings by the Red Cross, a group of 
military lawyers issued a memo in January of 2003 informing the Pentagon that 
prisoner interrogation practices at Guantanamo violated the Geneva Convention 
(Moniz and Squitieri 2004). Several months before the Abu Ghraib abuses were 
revealed, Iraqi prisoners suffered abuse at the U.S. detention center outside of 
Baghdad known as Camp Cropper (Jehl 2004). Several sources documented the 
mistreatment of the “American Taliban” John Walker Lindh in late 2001 (Danner 
2004). Further, there were at least nine suspected homicides of prisoners held by 
the U.S. in Iraq and Afghanistan in late 2002; and in several of the 37 documented 
deaths of prisoners in the two countries, the army did not even bother to conduct 
autopsies and claimed it was unable to determine the cause of death (Jehl, Myers, 
and Schmitt 2004). 
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A Dotted Line from President Bush to Abu Ghraib: Explanations 
Emphasizing Elites 


Neither the Fay-Jones nor the Schlesinger report assigned responsibility for the 
Abu Ghraib abuses to senior government officials — despite abundant evidence 
of their culpability. For example, Secretary of Defense Rumsfeld acknowledged 
that he had ordered an Iraqi suspect held for seven months without registering 
the individual on the prison rolls or notifying the Red Cross? (Schmitt and 
Shanker 2004). Members of the Schlesinger Commission did more than downplay 
Rumsfeld's responsibility; they actively defended him as an ^honest, decent, 
honorable man, who'd never condone this type of activity. This was not a tone 
set by the Secretary" (Tillie Fowler, in Myers and Schmitt 2004:A1) Schlesinger 
rejected calls for Rumsfeld's resignation on the grounds that this would "be a 
boon to all of America’s enemies.” As Danner observes, collectively, these reports 
“serve to isolate the acts they depict . . . from any inference that they might have 
resulted, either directly or indirectly, from policy.” (2004:40) 

Presidents have long claimed expansive powers during a time of war. During 
the Civil War, President Lincoln suspended habeas corpus and allowed alleged 
traitors to be tried before military courts. Similarly, after the Japanese attack 
on Pearl Harbor, President Roosevelt authorized the internment of Japanese 
Americans (Savage and Schmitt 2004). In both instances, the president sought 
and secured congressional approval. But President Bush has acted unilaterally with 
regard to Congress and the international community. Under Military Order #1, 
issued in November of 2001, Bush granted himself the authority, in open defiance 
of international law, to detain indefinitely any non-U.S. citizen, anywhere in the 
world (Hilton 2004). 

White House Counsel Gonzales advised President Bush that "the war on 
terrorism is a new kind of war .. . [that] renders obsolete the Genevas's strict 
limitations on questioning of enemy prisoners and renders quaint some of its 
provisions." (Danner 2004:42) In August 2002, the Justice Department went 
further and declared that the administration was exempt even from U.S. laws 
against torture. The memo, written by then Assistant Attorney-General Jay S. 
Bybee, stated, “Congress may no more regulate the President's ability to detain 
and interrogate enemy combatants than it may regulate his ability to direct 
troop movements in the battlefield.” (Eaves 2004:6) In this memo, torture 
referred to “extreme acts . . . where the pain is physical it must be of an intensity 
akin to that which accompanies serious physical injury such as death or organ 
failure . . . because the acts inflicting torture are extreme, there is [a] significant 
range of acts that, though they might constitute cruel, inhuman, or degrading 
treatment or punishment, fail to rise to the level of torture.” (Eaves 2004:6) Even 
after establishing a threshold for torture that is far higher than human rights 
conventions establish, this memo claimed that torturing al-Qaeda captives “may 
be justified” and that international laws against torture “may be unconstitutional 
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if applied to interrogations.” (Scheer 2004:B13). Recognizing that its assertions 
would be rejected in international courts and forums, the Bush administration 
promulgated “Order 17” providing troops and personnel with immunity from 
prosecution by Iraqi courts for killing Iraqis or destroying property after 
“sovereignty” was returned to Iraq (Wright 2004). 

Looking at the evidence presented in the Schlesinger report as well as other 
information available to the commission, the New York Times (2004) rejected 
the Schlesinger Commission’s conclusions. It placed responsibility at the top 
and discerned a “dotted line from President Bush’s decision to declare Iraq a 
front in the war on terror, to government lawyers finding ways to circumvent the 
Geneva Conventions, to Mr. Rumsfeld’s bungled planning of the occupation and 
understaffing of the ground forces in Iraq, to the hideous events at Abu Ghraib.” 
(A5) This responsibility was not simply allowing torture in the abstract, but 
included the specific interrogation tactics and acts of torture (Diehl 2004). 


Callous Cruelty and Demonization: Systematic Origins of Abuse 


“We must now fear the person who obeys the law more than the one who breaks it.” 
(Bauman 2000:151) Bauman began a chapter entitled “The Ethics of Obedience” 
with the preceding paraphrase of Dwight MacDonald — and Bauman did so to 
emphasize that the Holocaust was not implemented by aberrant psychopaths nor 
was the Holocaust “illegal.” Germany passed laws mandating it. As outlined above, 
there is little doubt that high-ranking Bush administration officials, especially neo- 
conservatives such as Vice President Dick Cheney, Secretary of Defense Rumsfeld 
and Assistant Secretary of Defense Paul Wolfowitz, bear responsibility for abuses 
at Abu Ghraib. But ultimately, we cannot understand these events as the result of 
the actions of such individuals — nor even a conspiracy of high-ranking officials. 
A handful of neoconservatives holding high office might steer the United States 
towards a risky and bellicose foreign policy, but these elites did not put in place 
the enduring organizational structure that led to systematic abuse of prisoners 
detained in the United-States and prisons around the world. Bauman (2000) 
demonstrates that the Holocaust was the manifestation of modernity and formal 
rationality — it was a manifestation of modern Germany, not an aberration. By 
the same token, the systemic abuses perpetrated by American forces around the 
world are not isolated aberrations; they represent a highly rationalized system that 
has been put in place over the course of decades. We emphasize two dimensions 
of U.S. militarism: (1) callous cruelty (Collins 1974) that permeates U.S. military 
doctrine, and (2) the demonization/dehumanization of prisoners. 


1636 / Social Forces 83:4, June 2005 


Callous Cruelty 


Collins (1974:18) argues that ^we can find a key to cruelty in the connection 
between morality and the boundaries of group inclusion and exclusion." Violence 
more generally is practiced by one group against another to “dramatize the 
fact that the human community and its ties extend only to a certain limit, and 
that persons outside are alien and subordinate" (Collins 1974:420) The type 
of violence that guides U.S. detention and interrogation is callous — "cruelty 
without passion, the kind of hardship or violence people may inflict on others 
without special intent to hurt . . . [this type of violence] arises from structures 
that cut off the possibility of personal empathy.” (Collins 1874:431) United 
States military doctrine has put in place administrative structures that anticipate 
abusive treatment of prisoners in order to accomplish organizational goals. The 
United States minimizes its casualties by fighting from a distance. By maintaining 
technological superiority (especially in the realm of air power) the U.S. is able 
to inflict staggering numbers of casualties on enemy forces while keeping U.S. 
casualties to a minimum. This military doctrine places a premium on information 
— knowing where critical military assets (human and physical) and liabilities 
are located. Reflecting the growing importance of intelligence, the United States 
has continued to refine its techniques for compiling information through 
technological means and for extracting information from detainees. As McCoy 
comments, “[the Abu Ghraib] photos are snapshots, not of simple brutality or 
a breakdown in discipline, but of CIA torture techniques that have metastasized 
over the past 50 years like an undetected cancer inside the U.S. intelligence 
community.” (2005:209) 

At the height of the Cold War, the Central Intelligence Agency prepared 
the top-secret KUBARK Counterintelligence Interrogation report (U.S. 
Central Intelligence Agency 1963). Chapter IX of this report, “The Coercive 
Counterintelligence Interrogation of Resistant Sources,” provides guidance on 
when and how to use coercion. This decision hinges on utilitarian — not human 
rights — considerations (Bauman 2000). That is, coercion is “simply a method 
for obtaining correct and useful information.” (U.S. Central Intelligence Agency 
1963:85) The KUBARK manual suggests that physical abuse of prisoners should 
be used only as a last resort because such techniques were less likely to yield 
reliable information. Instead, based on prior research into the effectiveness of 
various techniques, the CIA favored sensory deprivation and forcing prisoners to 
hold uncomfortable positions for extended periods. Thus, isolation, humiliation 
and forcing prisoners to hold uncomfortable positions for hours — techniques 
that were used systematically from Guantanamo Bay to Abu Ghraib — were not 
“invented” by the guards at those institutions. These techniques had been laid 
out in manuals first prepared in the 1960s. 

Over the course of the Cold War, this approach to interrogation was refined, 
as is outlined in a 1983 CIA manual that: 
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drew heavily on the language of the earlier [RUBARK] manual, as well 
as on Army Intelligence field manuals from the mid-1960s generated 
by “Project X" — a military effort to create training guides drawn from 
counterinsurgency experience in Vietnam. Recommendations on prisoner 
interrogation included the threat of violence and deprivation. . . . The 
interrogator “is able to manipulate the subject’s environment? the 1983 
manual states, “to create unpleasant or intolerable situations, to disrupt 
patterns of time, space, and sensory perception.” (National Security 
Archives 2004:1) 


In the 1980s, in addition to refining coercive techniques, the United States 
translated this information into Spanish and incorporated these techniques 
into training provided to Latin American military personnel, including those 
attending the School of the Americas (National Security Archives 2004; see also 
Pollari 1991). The utilitarian justification for cruelty points to a strong element 
of continuity. Information is of central importance — quality information 
allows the U.S. to avoid casualties and reduce conflict for all concerned. Harsh 
treatment of prisoners for reasons of revenge or catharsis is unacceptable, but 
comparable treatment that extracts information is justified. Stated in Collins’s 
(1974) terms, callous cruelty — that which is meted out in a cool and calculated 
fashion — is condoned, but ferocious cruelty — attacks on prisoners driven 
by revenge or emotional catharsis — is condemned. Bauman makes a similar 
point when discussing the “social production of killers” (2000:240—47) — i.e., 
Germany did not recruit anti-Semites to implement the Holocaust, and the 
United States does not recruit people who enjoy killing to implement the death 
penalty. In bureaucracies designed to deploy violence, killing and inflicting pain 
are separated from feeling; the individuals implementing these callous policies 
are ordered to separate their actions from moral assessments of killing and 
harming other people. 

The United States is well aware that its detention and interrogation procedures 
are at odds with international norms. In the introduction to the 1983 manual, the 
CIA declares that the “use of force, mental torture, threats, insults, or exposure 
to inhumane treatment of any kind as an aid to interrogation is prohibited by 
law, both international and domestic; it is neither authorized nor condoned.” 
(U.S. Central Intelligence Agency 1983) That disclaimer notwithstanding, the 
United States translated this manual into Spanish and continued to use a manual 
with instructions on coercive interrogation to train U.S. and allied forces. In a 
similar vein, although created to examine abuses at Abu Ghraib, the Schlesinger 
Commission defends callous cruelty. The report insists that techniques used at 
Abu Ghraib were integral to the “war on terror.”® The Schlesinger report forcefully 
criticized the International Committee of the Red Cross (ICRC), noting: “If we 
were to follow the ICRC’s interpretations, interrogation operations would not be 
allowed. This would deprive us of an indispensable source of intelligence in the 
war on terror.” (Schlesinger et al. 2004:85) Schlesinger also criticized the ICRC 
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for not being in tune with current geo-political realities stating that the ICRC 
“needs to adapt itself to the new realities of conflict, which are far different from 
the Western European environment from which the ICRC’s interpretations of the 
Geneva Conventions was drawn.” (Schlesinger et al. 2004:88) Finally, echoing the 
above-mentioned theme that the exposure of abuse could be more damaging than 
the abuse itself, Schlesinger noted that “while the ICRC has no enforcing authority 
and its reports are supposedly confidential, any public revelation regarding 
standards of detainee treatment can have a substantial effect on international 
opinion.” (Schlesinger et al. 2004:85) 

The Bush administration did more than dust off interrogation manuals 
drafted decades earlier. The Guantanamo Bay detention facility was used to detain 
(nominally) dangerous prisoners in the war “on terror” and to systematically 
extract intelligence. As the Iraqi insurgency gathered momentum in 2002 
and 2003, the Bush administration made a conscious decision to apply the 
Guantanamo techniques for interrogation of tens of thousands of detainees in 
Iraq (and, apparently, at several other secret U.S. detention facilities in other 
countries). These cruel interrogation methods were employed despite the fact 
that few detainees at Abu Ghraib were of any intelligence value, and fewer still 
had any known connection to al-Qaeda or the Iraqi insurgency. Moreover, the 
Bush administration saw potential in exploiting Arab modesty over sexuality. The 
well-documented sexual humiliation — being forced to stand naked, masturbate 
and perform sexual acts — was not dreamt up by the night shift. These acts were 
photographed in order to blackmail detainees. As Hersh notes, “It was thought 
that some prisoners would do anything, including spying on their associates — to 
avoid dissemination of the shameful photos to family and friends.” (2004:44) The 
sexual abuse which shocked the world — and for which the lowest level military 
personnel are being punished — is the logical consequence of the callous cruelty 
that guides U.S. military doctrine — as extended by the Bush administration. 


Demonization/Dehumanization 


The abuse documented at Abu Ghraib and other U.S. detention facilities goes 
beyond the callousness of training manuals and interrogation techniques. Guards 
and other military personnel assaulted prisoners in cruel fashion — and did so 
without concern for extracting information. Researchers have long documented 
that the dehumanization of prisoners (the “enemy”) contributes to abuse. And 
clearly, over the course of history the United States (and other “great powers”) 
has demonized and dehumanized its “enemies.” The Japanese attack on Pearl 
Harbor seemingly justified the internment of more than 100,000 Japanese 
Americans during World War II and the dropping of atomic bombs in Hiroshima 
and Nagasaki. A letter written by Paul Glen, a military specialist from the 
America | Division which was involved in the My Lai Massacre, explains how 
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dehumanization of the Vietnamese contributed to the rape and massacre of men, 
women and children. “Far beyond merely dismissing the Vietnamese as ‘slopes’ or 
‘gooks’ in both deed and thought, too many American soldiers seem to discount 
their very humanity.” (Goff 2004) 

The Bush administration used the 9/11 attacks to justify the invasion of Iraq. 
In so doing, it reinforced the demonization and dehumanization of Iraqis (and all 
Arabs) among the military — and American citizens more generally (Goff 2004). 
For example, Captain Douglas Zembiec told the Los Angeles Times, “From day 
one I’ve told [my troops] that killing is not wrong if it’s for a purpose. If it’s to 
keep your nation free or to protect your buddy . . . one of the most noble things 
you can do is kill the enemy.” (Perry 2004:120). Although he apparently later 
regretted the fact that he had killed innocent civilians, Sergeant Jimmy Massey 
acknowledged that he and other soldiers were inspired by Bush’s characterization 
of Iraq. “My president told me they got weapons of mass destruction, that 
Saddam threatened the free world, that he had all this might and could reach 
us anywhere,” Massey said. “I just bought into the whole thing.” (Floyd 2004). A 
guard who served at Bagram (Afghanistan) suggested that this demonization was 
pervasive in the U.S. military: “We were pretty much told that they were nobodies, 
that they were just enemy combatants. I think that giving them the distinction 
of soldier would have changed our attitudes toward them. A lot was based on 
racism, really. We called them Hajis, and that psychology was really important.” 
(Jehl and Elliott 2004:A1)’ 

In their book The Interrogators, Chris Mackey* and Greg Miller report that 
members of an interrogation unit received mail from schoolchildren that served 
to “brighten everyone’s day.” Among the messages were: “Go get the bad men;” 
and “I hope you kill them all,” accompanied by pictures of soldiers shooting at 
men in beards (2004:182). One drawing, sent by a 9-year-old girl from a Catholic 
grade school, “Our Lady of Peace,” had pictures of airplanes dropping bombs 
and “small figures at the bottom of the picture, all in turbans, fleeing for their 
life and flailing their arms.” This drawing came with the following caption: “We 
are praying for you and saying the rosary in class for you today.” (Mackey and 
Miller 2004:182) 

Perhaps the most obvious manifestation of this dehumanization is the fact 
that the Pentagon has not conducted body counts in the current Iraq war. While 
the death of the 1,000th American soldier in Iraq during September of 2004 
generated considerable media attention, far less attention has been paid to the 
number of Iraqi citizens killed.’ As of December 2004, conservative estimates are 
that 10,000 to 30,000 Iraqis have lost their lives (Mroue 2004). Many place this 
figure at more than 100,000 (Roberts et al. 2004). As Hedges commented, “While 
we venerate and mourn our dead, we are curiously indifferent about those we 
kill. ... Our dead. Their dead. They are not the same. Our dead matter, theirs do 
not.” (2003:13-14) While U.S. officials claimed that many of these deaths were 
attributable to “insurgents and criminals,” a representative of the Human Rights 
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Organization in Iraq stated the obvious: “[T]he main responsibility behind those 
Iraqi civilian deaths lies with the occupation because those victims would not 
have fallen had there not been an occupation.” (Mroue 2004:A13) 


Conclusion 


The notorious Abu Ghraib photographs spawned several official investigations. 
Nevertheless, “as the months passed and more evidence accumulated the images 
have increasingly . . . [helped] to block a full understanding of how the scandal 
arose and how what Americans did at Abu Ghraib was ultimately tied to what 
they had been doing in Afghanistan, Guantanamo, and elsewhere in the ‘war 
on terror’ — and, finally, to what officials had been deciding in Washington.” 
(Danner 2004:xiii) We have turned to Bauman's (2000) account of the Holocaust 
to understand the systemic nature of U.S. abuses that occurred at Abu Ghraib 
and with the “war on terror” more generally. Bauman's lesson is not restricted to 
episodes of genocide." On the contrary, Bauman points to a much larger lesson, 
one that applies to a number of social situations and certainly to the systematically 
callous violence that is deployed by the United States: “[I]n our modern society, 
people who are neither morally corrupt nor prejudiced may partake with vigor 
and dedication in the destruction of targeted categories of human beings; and 
that their participation, far from calling for the mobilization of their moral or 
any other conviction, demands on the contrary their suspension, obliteration 
and irrelevance.” (Baumann 2000:250) 

For nearly two millennia, the reliance on torture has been associated with 
empires and tyrants (McCoy 2005). The Bush administration has publicly 
embraced torture, where necessary, to prosecute the “war on terror.’ This embrace 
of abuse and torture is manifest in the promotions awarded to high-ranking 
officials responsible for managing a global network of detention facilities that 
rely on abusive treatment of detainees versus the trial and convictions of low- 
ranking soldiers for abuses at Abu Ghraib. President Bush has used the “political 
capital” he ostensibly gained in the 2004 election to appoint individuals who 
have well-documented records of either explicitly or implicitly condoning 
human rights abuses to key cabinet positions: Rumsfeld remains as Secretary 
of Defense; Gonzales, who, as noted above, declared the Geneva Conventions 
to be obsolete, was appointed attorney-general; Michael Chertoff, a key figure 
in the detention of several hundred (primarily Arab and South Asian) innocent 
immigrants (McGovern, 2005), was confirmed as secretary of homeland security; 
and John Negroponte, who was a political officer at the U.S. embassy in Vietnam 
during a period of gross human rights abuses and more recently was involved 
in covering up CIA involvement in human rights atrocities while serving as 
ambassador to Honduras (Cohn 2005), was Bush’s choice to serve as director of 
national intelligence. 
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While those who controlled and managed a global network of abusive 
detention facilities received promotions, lowest-ranking soldiers have faced 
stiff punishment. Soldiers are not being punished because they tortured Iraqi 
detainees — coercion and torture are condoned and systematically used by the 
United States. But these soldiers made two inter-related mistakes: (1) they failed 
to focus on the instrumental goal of information extraction, and (2) they abused 
prisoners for reasons of emotional catharsis or revenge. Had they committed the 
same actions in a cold and calculated fashion, especially if these abuses yielded 
information — as outlined and defended by CIA training manuals, the Schlesinger 
report, Secretary of Defense Rumsfeld and President Bush — the horrific actions 
of U.S. soldiers at Abu Ghraib and elsewhere would be fully acceptable. Seen in 
this light, perhaps President Bush's comment in August of 2004 was not really a 
faux pas, but a Freudian slip: “Our enemies are innovative and resourceful, and 
so are we. They never stop thinking about ways to harm our country and our 
people, and neither do we.” (Morford 2004) 


Notes 


1. Gonzales’s phrasing is a slight alteration of the Geneva Convention, Article 3. In the wake 
of the atrocities and carnage of World War II, this protocol was one of the United Nations’ 
first accomplishments and explicitly prohibits “outrages upon personal dignity.” (United 
Nations [1949] 2004) 


2. Although the intention of those taking the photographs is unknown, for the detainees, 
the photos had the potential of exposing their humiliation to family members and friends, 
thereby serving as a “shame multiplier.” (Danner 2004:19) 


3. Although it is obviously not possible to determine whether President Bush agreed with 
Limbaugh’s comments, it is perhaps notable that, in celebrating the “liberation” of Iraq, 
Bush commented, “Today in Iraq, more than 170 newspapers are being published, and I 
saw the other day that they even got talk radio. I don't know if they've contacted Rush yet.” 
(Postman 2004:B3) 


4. A similar theme appeared in the response of military officials to additional charges of 
abuse against 26 soldiers resulting from an investigation into the deaths of two detainees 
at the U.S. base in Bagram, Afghanistan. The officials claimed that they did not expect the 
cases to have as powerful an impact as those from Iraq. “This is in part because the alleged 
offenses occurred two years ago and appear to be fairly limited in number, but also because 


no photos of abuse have been found by investigators.” (Ricks 2004) 


5. Later estimates placed the number of “ghost detainees” at possibly 100 (Guardian Unlimited 
2004). 


6. Defense Secretary Rumsfeld shared such views, at one point commenting that complaints 
about prisoner abuse amounted to “isolated pockets of international hyperventilation.” 
(Hersh 2004:17) 


7. Demonization is by no means restricted to soldiers. Consider these letters to the editor 
of Newsweek published shortly after the revelations of abuse at Abu Ghraib. “Having a few 
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Iraqi prisoners walking around naked doesn't even come close to the actions of the Iraqis. 
These are the same people who danced in the streets after the attacks of September 11. These 
are the same people who videotaped beheading an innocent young American. These are the 
same people who cheer every American death. And the media are screaming bloody murder 
after seeing a few prisoners in naked poses? Give me a break.” (Caskey 2004:7) “Personally, I 
have little sympathy for these prisoners. If Iraqis can generalize their hatred to all Americans, 
then I can certainly generalize to blame all Iraqis. We might as well try to make friends with 
rattlesnakes.” (Smith 2004:7) 


8. Chris Mackey (a pseudonym) served as an interrogator in Afghanistan. 


9. One of the first things the military did upon entering Falluja was to take over a hospital, 
apparently because officials believed that doctors were inflating the number of deaths of 
Iraqis, thereby “milking sympathy for untoward ends.” (Williams 2004b) 


10. Ritzer (1996), for example, discusses Bauman’s work when introducing the concept 
of McDonaldization. Ritzer includes a discussion of Bauman because the process of 
rationalization that leads to callous and calculated violence has a great deal in common with 
the social organization that produces fast food (and a market to consume fast food). 
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Contributing to Scholarship and Theory through 
Public Sociology* 


DONALD W. LIGHT, University of Medicine & Dentistry of New Jersey 
and Princeton University 


Abstract 


The derivative nature of public sociology resting on academic research and theory 
is widely perceived as a central challenge to establishing its legitimacy for faculty 
recruitment, promotion and graduate training. While public sociology can be 
derivative, it need not be. This brief essay explains how, with three illustrations 
of large public campaigns that succeeded in changing social institutions and also 
produced articles in major journals, grants and a book. 


In the Social Forces symposium on public sociology, Michael Burawoy outlined 
four types of sociology — professional, critical, policy and public. He observed 
that “an effective public or policy sociology . . . depends upon the professional 
sociology that lies at the core of our disciplinary field.” (2004:1609) Training 
and certification depend on it, he explained, and the essays by practitioners in 
the Boston College symposium on public sociology bore this out (Burawoy et 
al. 2004). This derivative character of public sociology poses a central problem 
to establishing its academic legitimacy, a problem that kept cropping up in 
discussions during the thematic sessions of the 2004 ASA convention. For 
example, a graduate student expressed her frustration at being told to keep a lid 
on her desire to serve the public, and William Julius Wilson replied, “Wait until 
- you get tenure. I’m sorry, but that’s the way it is.” (2004) 

It need not be that way, however. Public sociology can be practiced so that it 

makes original contributions to scholarship and theory. That graduate student can 
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work for a better society and also do first-rate work that earns her tenure. This 
is the kind of public sociology I have been practicing for more than 20 years. 
By using for one's *foundation of legitimacy" a significant public problem, and 
by using as one’s “basis for accountability” dialogue and communications with 
relevant publics, one can design research to assess the nature and extent of the 
problem. This process often leads to re-conceptualizing the problem in ways 
different from prevailing academic approaches, and, thus, to making original 
conceptual contributions as well. This process is an important way in which, as 
Burawoy writes, “professional sociology becomes stronger, more vital and less self- 
referential from continuous contact with its publics.” (2004:126) Tittle expresses 
concern about endangering “what little legitimacy sociology currently has,” 
but good public sociology can actually strengthen and broaden that legitimacy 
(2004:1641). 

Often, elites use their knowledge “to mystify and subdue their people,” as 
Francis Fox Piven (2004:33) has described: Demystifying the accounts of elites 
and democratizing arcane knowledge are critical tasks of public sociology; such 
work unveils the false consciousness created by elites for their publics, and is fully 
compatible with good research and theory. The intensely challenging process of 
researching the accuracy of elite accounts, exposing false knowledge and designing 
structural changes, in fact, helps generate new research findings and new ways 
to conceptualize the sociological nature of a problem. 

As an example of an effective project, let us look at the successful, four-year 
campaign against a Blue Cross Blue Shield plan that tried to abandon the social 
and legal mandate to provide equitable and affordable coverage for individuals 
and small groups. Blue Cross Blue Shield of New Jersey (“Blue Cross”) had 
mounted a large lobbying campaign in the mid-1980s to persuade the legislature 
and public that it was going bankrupt from losses on its equitable risk-blind 
policies for individuals and small groups. Providing affordable, community-rated 
policies (i.e., the same premium regardless of health profile or risk) was a major 
purpose for creating Blue Cross in the 1930s. Based on related research for the 
state legislature, I knew from financial records of Blue Cross that its losses were 
not coming from its community-rated policies for small groups and individuals 
but from its big, risk-rated commercial policies, which Blue Cross was blaming 
on its mandated policies. The false campaign, further research revealed, was being 
orchestrated by a new generation of ambitious leaders that wanted to transform 
Blue Cross into a large commercial insurance company and get rid of its historic 
non-profit mission. They proposed “demographic community rates” that varied 
by age, gender and zip code so that they discriminated against older subscribers, 
women and the poor, often minorities. It was risk-rating using “demographic” as a 
fudge on “community rating.” A similar change in corporate culture transformed 
Blue Cross Blue Shield plans across the country. 

Initially a research assistant and I translated our research into press packets 
to inform the legislators and news media that Blue Cross was violating its public 
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mandate, creating a false crisis of bankruptcy, and using this to charge higher 
rates to women and disadvantaged minorities. Since Blue Cross had long, deep 
ties with the Department of Insurance, the governor's office and the legislature 
— and was not accustomed to incisive challenges with counter facts and figures 
— it was not long before my dean called me in and informed me that that the 
governor's office had spoken to the university president about getting me to 
stop. He also told me not to conduct research or related activities on this project 
during work nor to mention my academic position as a professor of health care 
policy. I complied and turned to the New Jersey Public Health Association as a 
new base for the campaign. Its board quickly understood the issues, and together 
we built a large, statewide coalition that included the NAACP, NOW, AARP 
and the Council of Churches — organizations that together represented about 
one in every seven adults throughout the state. Getting such a large and diverse 
coalition together was difficult and benefited from sociological studies of social 
movements. We created, in effect, a public. Granted that much public sociology, 
as Burawoy writes, “focuses on solutions to specific problems defined by clients,” 
most of my campaigns alert and mobilize a public to a specific problem that it 
does not know is being imposed on it (2004:1608). 

Because of my training, I did most of the research. The key findings were 
translated into simple bar graphs and phrases for TV and press conferences. 
With the help of the NAACP Legal Defense Fund, we also began to mount a 
legal challenge to the discriminatory rates as a violation of Blue Cross’s enabling 
legislation and charter. We then appealed to the Public Advocate to use taxpayers’ 
money for the costly court case to defend the public interest by suing BCBS for 
violating their legal mandate. When the New Jersey public advocate realized 
the breadth of our coalition and heard its legal team’s arguments, he agreed 
to prosecute. The case led to an appellate court ruling to reaffirm Blue Cross’s 
enabling legislation and require BCBS to roll back premiums to reflect the true 
community rate for about 600,000 people. 

During the next two years, the Blue Cross executive team tried other tactics 
to drive subscribers out of its historic community-rated policies, first by filing 
premium increases that were four times the increase in medical costs, and then by 
dropping coverage for costly services used by seriously ill patients. We mounted 
two more statewide campaigns to oppose these tactics, and defeated each. These 
three campaigns set the stage for a successful effort to design a new way for all 
insurers to offer community-rated policies. 

These three campaigns took constant effort and did not contribute to 
professional sociology. But they led me to realize that graduate training at the 
University of Chicago and Brandeis had not, as Tittle points out, provided me 
with any basis for making moral decisions, what Burawoy aptly calls “foundational 
knowledge” about social justice (2004:1607).' I therefore applied for the faculty 
fellowship at Green College, Oxford, to read moral philosophy relating to the 
issues raised by this campaign. Study at Oxford helped me convert advocacy to 
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scholarship, theory and growth as an intellectual. It led to a widely used article in 
the Journal of the American Medical Association on actuarial versus social fairness 
and on the techniques used by commercial insurers to minimize enrolling higher 
risk people, to minimize how much of the medical bills among those enrolled is 
covered by their policies, and to minimize how much they actually pay of what 
they owe (Light 1992). This article, together with the reading at Oxford and a 
detailed understanding of health insurance as a social institution, became the basis 
for being selected as the DeCamp Visiting Professor in Ethics and the Biological 
Sciences at Princeton University. That position led to a grant to develop a set of 
benchmarks for Rawls's theory of social justice. The resulting book Benchmarks 
of Fairness for Health Care Reform (Daniels, Light and Caplan 1996) transposes 
a theory of justice into specific criteria and measures for comparing different 
proposals from right to left. The benchmarks have been subsequently developed 
by WHO and the Rockefeller Foundation as an evidence-based practical tool 
for allocating scarce resources fairly in poorer countries. So far these standards 
have been used in Mexico, Guatemala, Cameroon, Zambia, Thailand, Yunnan, 
Bangladesh, Pakistan and Vietnam (Daniels et al. 2000, 2004). 


Demythologizing Elite Accounts 


Demystifying is a major public role that sociologists and other social scientists 
can perform, and it can contribute to serious theoretical and empirical work at 
the same time. In my experience, the term, “demystifying,” is inaccurate, because 
the problem is usually not a fog of mystery. It is usually clear, misleading accounts 
or myths constructed by power elites to pursue their own interests at the expense 
of the public interest. Behind the myths often lie biased research or buried facts, 
so that the major task is to de-mythologize and democratize knowledge. This 
often requires a complete re-framing of what the “problem” really is. Theoretical 
development naturally follows, usually based on existing theory but advancing 
new insights or applications — just as academic theoreticians do, except done in 
the service of public sociology. The ways in which a number of sociologists have 
been contributing to research and theory in this manner needs fuller attention 
because such work holds the key to giving public sociology academic respect. 

Policy and public sociology can also be quite often intertwined. In Burawoy’s 
four-fold scheme, policy sociology is characterized as “directed at patrons 
or clients,” using “concrete” knowledge (Burawoy 2004:1607). However, the 
contributions of a good academic in policy work are often more conceptual or 
research-based than this characterization implies, and it may be directed at the 
public rather than at patrons. 

An example of combining policy work for elite patrons with public sociology 
occurred in a successful effort to preserve the equitable health insurance scheme 
in Ireland. That country has a universal health care system with a widely used, 
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community-rated supplementary insurance system that offers subscribers quicker 
and upgraded services. Under new rules from the European Union, this system 
had to be opened to competition. A large, international insurer found loopholes 
through which it could risk-rate the market and attract away younger, healthier 
subscribers, thus leaving the older, more ailing subscribers with the traditional 
non-profit insurer so that its medical expenses would then continuously exceed 
the community-rated premiums and lead to bankruptcy. Leaders of the old system 
asked me to help them close the loopholes and restore equity. 

The campaign goal was to use a sociological analysis of the market to persuade 
the minister of health and key legislators that these new policies should be banned 
and the loopholes closed. We organized a campaign on the societal consequences 
of risk-rating, and wrote articles and editorials explaining what the problems 
were. Within weeks, the minister of health stopped the new policies from being 
sold, announced that the loopholes would be closed, and restored the relative 
equity of Ireland's two-tier system (Light 1998). This experience led to thinking 
of new designs for equitable health insurance because flat community-rating is 
not sustainable over the long run. 


Class Discrimination and Waiting 


My longest and most complex undertaking in public sociology has involved 
attempting to change the infamous waiting lists of Britain's NHS (National Health 
Service). Long waits are probably the best-known and infamous characteristic of 
the NHS. Field work began in 1990, and I concluded that organizationally, waiting 
lists could be avoided altogether by giving patients appointments. Although they 
would not shorten the wait, appointments are much more humane than having 
to stay close to home, week after week for a year or two, waiting for word that 
the specialist can see you. I titled my first article "Medical House Arrest" (Light 
1990). But it took several years of further interviews and field work while working 
on other projects in the UK to realize that so-called waiting lists were a myth, an 
account by a power elite that dated back to the origins of the NHS. By degrees it 
became clear that they were not lists because there was no queue. They were more 
like waiting pools into which patients were tossed to tread water until a consultant 
pulled them out. Everyone with an urgent need gets treated right away in the NHS, 
but elective cases are “put on the waiting list.” But “the waiting list,” I discovered, 
was a myth, too. My research identified five main types of the thousands of 
individual “lists” or pools. Furthermore, consultants (senior specialists) used their 
“lists” to build private practices alongside their NHS jobs and increase funding for 
their units. This paternalistic system not only discriminated by class and gender, 
but it also increased patients’ anxiety with uncertainty over when treatment for 
a diagnosed condition would be made available. 
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Eventually, this research culminated in an entirely new account of “waiting 
lists,” not as an unavoidable part of a national health service, but as a complex 
institution of networks that maximize the ability of specialists to practice as they 
wish and build private practices on the side. An official investigation found that 
surgeons operated on only four to six NHS patients a week. No wonder patients 
had to wait so long! “Betrayal by the Surgeons” in The Lancet was the first of a 
series of efforts in journals, newspapers, television interviews and ultimately a 
Parliamentary hearing, that demythologized the standard waiting list account 
and offered a practical alternative that would shorten waiting times and reduce 
discrimination against the lower and working classes (Light 1997, 2000). Finally, 
after several years of conspicuous silence from the leaders of the world’s largest 
health care system, the minister of health embraced much of the new account, 
spoke of the past “sins of the NHS,” and began a series of basic reforms much 
like our organizational redesign. Waiting lists were taken out of the hands of 
consultants so that elective referrals have now become more fair, humane and 
free of conflicts of interest. Referral and appointment centers have been set up 
with incentives and disincentives to shorten waiting times. Although vestiges 
of the old system still remain, waiting periods have decreased dramatically. It 
takes a long time for fundamental changes to work their way through one of the 
world’s largest organizations.’ So far, this work has led to a new theory of English 
waiting lists and a special issue of Sociology of Health and Illness on rationing 
(Light and Hughes 2001). 

The above are only three of many examples that illustrate the potential for 
public sociology to contribute to the professional research and theory worthy 
of promotion and tenure. Nielsen (2004) warns that practitioners of “public 
sociology” can reduce it to ideological clashes and destructive academic politics, 
but so can great academics. I have watched them do it for years; so would it follow 
that we should avoid academic sociology as well? Bray’s (2004:1629) five reasons 
for why “Burawoy’s public sociology may fail” constitute a solid list of issues 
and flaws that need to be addressed, but these are not fatal. I see no compelling 
critique thus far, so my advice to that graduate student: “Serve the public, but 
in ways that will get you tenure.” 


Notes 


1. Perhaps professional sociology thinks itself too unscientific to teach graduate students 
about just and unjust social and political structures. 


2. The NHS employs about one million people. 
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Latin America: Societies, Markets and Politics 


JUDITH BLAU, EDITOR 


As an introduction to this supplement, which centrally deals with the responses 
of various Latin American countries to globalism, I clarify the reasons why 
Social Forces is devoting space in this journal as well as on the website, and then 
explain how this project evolved. This is important because it illustrates how 
international collaboration evolves around shared themes and complementary 
interests. 

As American readers of Social Forces realize, the U.S. academic community 
is enormous by virtue of the size and wealth of the country and the sheer 
number of institutions of higher education and research centers. Its members are 
productive, well endowed with research funds, and have little difficulty accessing 
literatures in their native English language. As a result, the members of the U.S. 
academic community often err, paradoxically, both on the side of provincialism 
and of hegemonic attitudes — mistaking the United States for the world and 
blithely exporting American knowledge to the far corners of the globe. While 
we may think we are being generous, our concepts, perspectives and viewpoints 
are not universally shared. There are important exceptions. Anthropologists have 
struggled for decades with pluralism and otherness, the ethics of cross-cultural 
studies, and have been guided by an unusual modesty. The field of international 
relations has undergone great changes with the decline in the significance of 
national borders and economic globalization. Other important exceptions are 
the “hard sciences.” We might say a quark is a quark is a quark (even more 
fundamentally so than a rose is a rose is a rose). 

However, sociology is particularly susceptible to provincialism, and American 
sociologists, in my view, often fail to recognize the differences between societies. 


"Thus I asked Professor Alcántara, Director of Instituto de Estudios de Iberoamérica 


y. Portugal, Universidad de Salamanca, to be guest editor of this supplement. He 
may not share my views about American sociology, and in fact we have never 
discussed these epistemological issues, nor have we ever met. I did discover 
that Alcántara has close connections with the Latin American specialists at the 
University of North Carolina, Chapel Hill, and they were extremely helpful in 
obtaining support for translations and copyediting from the university's Institute 
of Latin American Studies (ILAS), the Marjorie M. Schwab Fund through the 
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Institute for the Arts and Humanities and the College of Arts & Sciences. Without 
this support we could not have launched and carried out this project, and I 
especially want to thank Arturo Escobar and Robert Anderson from ILAS, and 
Alberto Moncada, who introduced me to Manuel Alcántara. 

What made this comparative project possible is that Manuel Alcántara has 
collaborative research ties with colleagues throughout Latin America. He asked 
some of them to write individual case studies devoted to the changing political, 
economic and social conditions in each of seven countries — Argentina, Bolivia, 
Brazil, Venezuela, Mexico, Colombia and Chile — as well as one on Central 
America. He also asked Esther del Campo to write an essay on women’s political 
representation in Latin America, and Fatima Garcia to write a review of recent 
books on Latin America. 

By showcasing the works of Latin American social scientists, we believe we 
provide our American readers with a special treat, not only with the opportunity 
to learn more about Latin America countries and the impact of globalization on 
many of them, but also with a window into Latin American social science as a set 
of perspectives that are somewhat different from those in U.S. social science. 

Five of 11 articles written by Latin American social scientists are listed in 
the table of contents and published in this issue. All 11 will be published on our 
website and archived by Project Muse and JSTOR, including the following: 


Petroleum and Democracy in Venezuela 
Roberto Bricefio-León, Laboratorio de Ciencias Sociales (LACSO) 
Universidad Central de Venezuela 


Social Capital and Democracy in Mexico: The Social Limits of the Political 
Change 

David H. Corrochano 

Facultad Latinoamericana de Ciencias Sociales (FLACSO) México 


Institutions and Citizenship in Colombia, The Ups and Downs of a Difficult 
Relation 
Gabriel Murillo-Castafio and Carolina Victoria Gomez Segura 
Universidad de Los Andes 


The Chilean Dilemma: Between Economic Development and the Deepening 
of Democracy 

Antonia Santos 

Universidad de Salamanca 


New Actors, Democracy, and Challenges Facing Central America on the 
Threshold of the 21* Century 

Juany Guzmán León 

Luis Guillermo Solís Rivera 

Universidad de Costa Rica 
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Book Reviews 
Fátima García . 
Universidad de Salamanca 
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Politics and Society in Latin America at the Start 
of the New Millennium* 


MANUAL ALCANTARA, Universidad de Salamanca 


Abstract 


This article discusses the existing relationships between politics and society at the 
start of the twenty-first century in Latin America, a region that is characterized by 
its heterogeneity, and that is addressed as a homogenous whole. Politics are treated as 
an extension of democracy, the changing role of institutions, the weakness of political 
capital, difficulties of representation, and the new regionalist integrationism. When 
engaging in social analysis, this work addresses divisions among social classes, large 
and growing inequality, poverty, lack of security, and multiculturalism. 


Introduction 


Reflection on the relationship between politics and society in Latin America is 
not new. It has always sought to explain the relationship by using theories that 
accentuate factors from outside the region as well as others of a more domestic 
character. Without going into an exposition of these, it is enough to remember 
the accounts of Latin America made from the point of view of populism, 
modernization, elitism, political culture, dependency, and neo-institutionalism, 
among others. The different efforts carried through have always had to face the 
heterogeneity of the countries that make up the region. As a consequence of an 
excess of attraction to the Southern Cone, they have had to generate cognitive 
- frameworks that are much too broad and that project inaccurate conclusions 
and resounding gaps of knowledge. Besides, from the political perspective it is 
difficult to find periods in which the enormous differences between countries 
were not the dominant characteristic. 

However, at the start of the new millennium, and after having witnessed the 
profound transformations in the region due to neoliberal reforms in the last 


* Direct correspondence to Manuel Alcántara Sáez, Universidad de Salamanca. E-mail: 
malcanta@usal.es. This article was translated by Silver Moon, Duke University. 
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decade of the twentieth century, things are different in that, at least in politics, 
Latin American countries have shown outstanding homogeneity. If there are 
still differences, to various degrees, in constitutional designs, in the character 
and number of existing political parties, in the degree of decentralization, in 
an almost unrestricted acceptance of the market in the face of the formerly all- 
powerful and central state government, and in the patterns of political culture 
of their citizens, democracy as polyarchy is the logic behind the workings of 
today's political systems for all the countries except Cuba. So much so, that this 
practical uniformity has opened an unusual academic discussion that is centered 
on democracy! itself, which would not come up if its existence were not taken 
for granted. 

At the same time a desire has grown to escape regional stereotypes of a 
generalizing character. Now the desire is to focus on studies of national cases, 
albeit from a comparative perspective, and using similar frameworks of analysis. 
The very integrationist impulse of continental blocks helps to combat globalizing 
visions, as well as emphasize the singularity of strictly national projects. In this 
way, other countries helping to define fully individualized models have imitated 
the different road followed by Chile in its international commercial strategies, 
started more than a quarter century ago. 

The following pages describe a complex reality that places emphasis on the 
more conspicuous points brought up by those doing deep analysis. Some causal 
relationships can be sketched. First, the text lends space to a characterization of the 
politics of the region upon the basis of the unquestionable advance of democracy 
as a ruling model to the development of the different national political systems. 
Second, it tackles aspects that define types of societies where marginalization, 
poverty, inequality, violence, and scarce attention to current multiculturalism are 
extensive. Lastly, it makes a case for an interconnection of both spheres, political 
and social, from a perspective of an incomplete citizenship, which demands an 
articulated process that is still to be accomplished in Latin American countries. 


Politics in Latin American Countries 


From among the differentiating aspects that can be used to understand politics, 
five are used here: (1) the expansion of democracy, (2) the changing role of 
institutions, (3) the weakness of political capital, (4) the difficulties of political 
representation, and (5) a new integrationist regionalism. Each of these contains 
a complex of interrelated elements that benefits from looking at them jointly to 
be able to comprehend the political system. 
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Democracy in Latin America 


The report produced by the Program of the United Nations for Development 
(PNUD), upon whose data this work is based, confirms that democracy is firmly 
in place (2004a). Democracy is understood as the way through which political 
personnel is chosen thanks to periodic, free, competitive, clean, electoral processes, 
under universal suffrage. This report places all the countries considered during 
the period of 1990-2002 inside the index of electoral democracy (defined by these 
variables: right to vote, integrity and freedom of elections, and elected public 
posts) within a bracket of .80 and 1. The only exceptions are Colombia (.57) and 
Guatemala (.60). The expansion of the democratic formula to the totality of Latin 
American countries, and for an estimable time period besides, is an historical 
moment given that this situation never took place before. 

However, the levels of democratic development are very different according to 
country. A recent study (Poliat 2003) establishes a classification through which, 
in 2002, 9 of the 17 countries, according to the classic classification table, would 
stand between 1 and 10, and 3 countries would earn an outstanding grade: 
Uruguay, Chile, and Costa Rica. These marked differences between countries 
are also supported by the annual studies done for Freedom House and other 
organizations like it. If democracy is indeed present in the region, the intensity 
of its presence, according to quality, shows differences between countries. From a 
comparative perspective, history always speaks of the exceptionalism of the three 
aforementioned countries. However, differences have become manifest within 
each of these countries, from period to period, without many years passing in 
between. According to this, if one considers democracy in Peru during the last 
years of the 1990s, and compares these years with the first years of the following 
decade, surprising changes toward amelioration emerge. Almost in the same way, 
and within the same time period, one could consider the case of Argentina. The 
personal reliable seal of a governing figure or a profound socioeconomic crisis 
can be at the base of these changes. 


Political Instution Reform 


Countries in Latin America share presidentialism as a generalized mode of 
government. However, differences among them are marked with regard to 
relations between the executive and the legislative branches of government, the 
number of political parties, the existence of one or two houses, and the degree 
of regional influences, making speaking of a single model very difficult. There 
are also variations, although fewer, with regard to the formula for presidential 
elections, where ballots have gained precedence and are affecting many countries, 
such as changes in reelection prohibitions, the incorporation of new institutions 
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such as the General Councils of Judicial Power, the Public Attorney and others’ 
that introduce enormous heterogeneity to the region. 

In general, the obsession with holding institutions as a panacea that can 
articulate better governments, along with a feverous search for governability, 
have been the common denominators throughout the 1990s, when practically all 
the constitutions and principal political laws were reformed at different degrees. 
Following an intense process of change, the continuous cycle of political reform 
has not stabilized and remains open at the close of 2004, such as is the case in 
Colombia, Ecuador, and Bolivia. The neo-institutional push remains present and 
maintains its strong fetishist component in such a way that it is estimated that 
“Latin American countries must increase significantly the capacity of political 
institutions to reach the objectives for which they were created so that poverty and 
inequality could be diminished and they can become more competitive.” (Graham 
et al. 1999:1) Animated, in large part, by multilateral organizations, some with 
vocation for cooperation toward development, and encouraged notably by the 
international bureau, which had found a fertile territory for its expansion, the 
demands for reform have been a mantra that has consumed financial resources, 
efforts, illusions, and vain promises with immediate but precarious effects. 
Without devaluing the need for changes with very different meanings for the 
needs of each country, this homogenous wave of reforms, at times because of 
imperatives of organizations and of donating countries, and others because of 
mimetic demands by national political elites,* has not ceased. 


Political Capital 


Political capital, understood as the degree of trust in the different institutions 
and political actors, supposedly suffers a process of de-capitalization due to two 
factors: the poor performance of the economy, with regards to maintenance of 
growth rates though the 1990s, and failure to diminish the inequality gap, as 
well as an increased awareness of generalized corruption. In fact, the regional 
index of trust in institutions (the three state powers as well as municipal powers) 
and actors (political elites and parties) was 1.91 in 2002 (on a scale of 1 to 4), 
with clear differences between countries. In that particular moment, Argentina 
(1.43), immersed in a serious crisis, was clearly at the bottom of the spectrum, 
far beneath Guatemala (1.71), Ecuador (1.72), and Colombia (1.77). The highest 
values belonged to Honduras (2.24) and Costa Rica (2.23) (PNUD 2004b). 

If, on the one hand, the health of the regional economy improved considerably 
at the start of the new century, with practically generalized and constant growth 
rates (breaking the negative stretch of the second half of the 1990s), the same 
cannot be said of two other factors. Without trying to prejudice what we will 
see next, here is certain empirical evidence about the state of affairs. The Gini 
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coefficient for 1999 showed values greater than .516 for all the countries of Latin 
America except Uruguay (.440), Costa Rica (.473), and Venezuela (.498), with a 
world average of .381 (PNUD 2004a). 

Indicators of corruption awareness in 2002, as well as studies of international 
transparence such as the World Economic Forum, show a region with high values 
that also registers, nevertheless, differences between the countries, with Argentina, 
Bolivia, Salvador, Guatemala, Honduras, Nicaragua, Paraguay, and Venezuela 
placed below the regional average (PNUD 2004a). 


Political Parties and Representation 


In some countries politics are still understood more in terms of participation 
instead of representation (Touraine 1989). This explains movements turbulence, 
propitiated largely in the Andean world, and that, notwithstanding common 
origins, acquire different components. Venezuela stands at one end because it 
sustains itself upon oil revenues, and because it has raised itself over a caudillistic 
leadership. Bolivia and Ecuador stand at the other end because they are more 
diffuse and heterogeneous expressions. Colombia and Peru confront a situation 
of the practical disappearance of their parties. In Colombia, the Uribist leadership 
fights for an unusual presidential re-election, in historic terms, among discredited 
Parties, but regional clientist leaderships, as well as enormous difficulties for the 
stabilization of a political space for the left, also atomize them. In Peru, the crisis 
of the end of the decade of the 1980s only became deeper during Fujimori's 
period. Today the country finds itself facing wide party emptiness, where only 
the Aprista Party rebuilds itself on a platform based on bringing Alan Garcia to 
the presidency, three glorious years after his ignominious exit. 

Nevertheless, political parties remain and still continue to articulate a large 
part of political life in most countries (to the exceptions above-mentioned in the 
Andean countries we should add Guatemala). There are still firm party identities, 
and parties with clear ideological semblances are split among those that show 
very institutionalized modes of action and those with a more joint-electoral 
character (Alcántara 2004). 

It is true that a representative sample of the political elite is found, barely, 
in Uruguay and Honduras (PNUD 2004a). This aspect is complemented by the 
trust index that parties receive from Latin Americans. Only Uruguay, Costa Rica 
(1.89), and Honduras (1.80) approximate intermediate values on a scale from 1 to 
4. The low profile shown by parties, in spite of the high levels of Latin American 
public opinion, which considers them as indispensable figures to the proper 
functioning of democracy, perhaps has to do with historic reasons as well as the 
successful implantation of a neoliberal model. The first makes reference to the 
continuous interruptions of democratic political life, the proscription of parties by 
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nondemocratic governments, and to the systematic persecution of a large number 
of party leaders. With regard to the success of the neoliberal model, particularly 
in its political aspect, its certain ability to significantly affect the weakening of 
bases in favor of organized class struggle and effective mobilization channels for 
popular discontent, is well proven (Portes and Hoffman 2003). 


Integrationist Regionalism 


Integrationist rhetoric again reaches an apogee due to the different patterns of 
subregional integration, as well as of insertion of continental blocks, or insertion 
to the European Union, without concrete results. Bilateral strategies of special 
accords with the United States and the European Union, of which Chile is 
one of the most active countries, are accompanied with others of multilateral 
character that seek to revitalize old integrationist patterns in Central America, 
the Andean region, or more recently Mercosur, without abandoning continental 
patterns of free commerce that the United States has supported since the start 
of the 1990s. 

Political national changes in the latter years have brought about innovative 
proposals, of regional tone, that, however, have not yet become well defined. On 
the one hand, there is the strategic alliance between Havana and Caracas, which 
based on a cooperation agreement signed in December of 2004 seeks to propel 
an initiative of Latin American integration named ALBA (Bolivarian Alternative 
for the Americas). ALBA wants to become the response to ALCA (Area of Free 
Trade for the Americas) that is fomented by the United States. On the other hand, 
Brazil has emerged into the international arena in the hand of its President Lula 
da Silva. He has strengthened Brazil’s role in the World Trade Organization, as it 
maintains its aspirations to become a permanent member of the United Nations 
Security Council, and acts as an unequivocal Latin American referent against 
Mexico's positions, inevitably closer to the United States. The preferential visit to 
Brasilia made by the Chinese president in November of 2004 is another important 
sign of recognition of the voice that Brazil has in the international arena. 


Latin American Society 


Although neoliberal reforms in the political sphere made a notable impact, such 
as the de-structuring of the bases of political mobilization just mentioned, it is in 
Latin American society that the reforms have had the greatest repercussions. Some 
traditional characteristics of the social map have worsened, such as inequality, and 
others have been profoundly modified, as has happened with communitarianism 
to the benefit of much more individualistic guidelines of action. 
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If the previous section signaled the difficulty of separating the aspects chosen 
for analyzing the politics of Latin American countries, even greater complexity 
comes to bear upon tackling the social problems due to the high degree of 
interaction of the selected variables. The enormous and growing inequality, the 
scathing poverty, the disquieting insecurity, and the existence of divisions by social 
class and ethnicity, which introduce fears into the project of multiculturalism, are 
five axes that compose the pentagon on which the inhabitants of Latin American 
countries is located. 


Social Classes 


The analysis of social structure through the use of the concept of social class 
continues to be pertinent if it is conceived not only with the traditional criteria 
by which a class is formed by those that control the means of production, but 
also by those that have control of the labor power and by those rare intellectuals 
that contribute new forms of knowledge. From this perspective, social classes 
in Latin America are different than in other regions because the great majority 
of its population is not incorporated into the formal labor market (Portes and 
Hoffman 2003). In this way, informality, which does not fail to have an impact 
on institutionalized politics, comes to be one of the principal characteristics of 
Latin American societies. 

A 2002 study of the structure of social classes in Latin America has thrown 
into relief that for the total labor force, 45.996 were treated as informal proletariat, 
compared to 23.4% as manual formal proletariat; 12.4% were nonmanual formal 
proletariat; 8.5% were petit bourgeois, consisting of self-employed professionals 
and technicians and small-business owners; 2.8% were elite workers, composed of 
salaried university professionals and medium-sized business owners and workers 
in public institutions; and 1.8% were capitalists, property owners, and managers 
of large and medium-sized businesses (Portes and Hoffman 2003:46—49). 

The data show that the dominant classes hardly reach 1096 of the population, 
while the subordinate classes comprise over 8096, having registered a notable 
decline in the number of workers in the public sector and ruining whatever 
optimistic image about the existence of middle classes that were made popular 
in the 1960s. Moreover, the differences between the different Latin American 
countries are not very significant. From the eight cases selected, the dominant 
classes hovered between 13.9% in Venezuela and 5.096 in El Salvador. The informal 
proletariat oscillated between 50% and 38% in those same countries (Portes and 
Hoffman 2003). The data evidences a regional pattern that extends to all the 
countries and correlates with the following sections. 
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Although poverty is historically present in Latin America, it has been seen to 
increase due to the economic decelerations that occurred in the region throughout 
the last two decades, interrupting a half-century of continuous growth and putting 
an end to the dream of modernization. In 1980 poor individuals represented 
40.596 of the population; in 1999, 43.896. Certainly, the differences between 
countries are extreme. In Uruguay, Chile, and Costa Rica, the percentage of 
the population under the poverty line in 2001 was situated below 2296, while 
in Argentina, Panama, and the Dominican Republic, it hovered around 3096. 
The majority of the other countries were moved to values above 5096 of the 
population: Bolivia (61.2%), Columbia (54.9%), Ecuador (60.2%), Guatemala 
(60.4%), Honduras (79.1%), Nicaragua (67.4%), and Paraguay (61.8%) (PNUD 
2004a). 

The responsibility for the macroeconomic crisis has been made manifest 
because “not only is the actual life of the poor affected, but their capacity to 
leave poverty.” (Lusting 2004:61) However, these crises have a complex origin. 
In the first place, we find the aspects of each country’s own institutions relative 
to the definitions of its economic policies. The confrontation between the state 
and the market plays the starring role. However, also relevant are the difficulties 
in putting into motion effective control mechanisms that would assure the 
development of secure rules and reliable organizations with a goal of eradicating 
distinct perversions stemming from cronyism, clientelism, privileged information, 
and other practices that lubricate rampant corruption. In second place we see 
exogenous factors derived from the historic weakness of many of Latin America’s 
countries when faced with international capital. 

Each combination of phenomena projects a shameful situation based on the 
bitter contradiction that allows for poverty to reside in the middle of extreme 
wealth. On the other hand, poverty has a triple component that links it to the 
rural, the feminine, and the ethnic, giving it a dimension that is not exclusively 
economic. The effects of poverty on these sectors of society acquire a structural 
component that has a singular influence on the development of social integration 
in Latin America. 


Forms of Social Inequality 


Although Latin America has reached the stage of legal equality of citizens, reality 
is very different. The public’s own opinion is that 82% of the poor and 77% of 
the indigenous population will almost never, or never, come to assert their rights 
(PNUD 2004a). Economic division, and therefore the abovementioned structure 
of classes, as well as the empire of poverty, and African or indigenous origins are 
patterns of quotidian segregation that, moreover, are strongly correlated in that 
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poor are often identified as “non-white.” With respect to women, the forms of 
inequality have been tempered thanks to the progressive incorporation of women 
into the labor market, which has brought the total from 28.8% of all women 
working in 1990 to 33.9% in 2000 (PNUD 2004a). However, women continue 
to earn significantly less in salaries than men. 

A strong component of social inequality is also registered in the sphere of 
those aspects of the lives of individuals that affect the potential to develop one’s 
basic capacities. The precarious access to the vital necessities by the great majority 
of the population — which translates into high rates of malnutrition and infant 
mortality, illiteracy, the impossibility of universal sanitation and of living in a 
dignified way, and negligible assistance to the elderly — brings with it serious 
problems of social integration. 


Lack of Security 


The rate of homicides for the year 2000 placed Latin America as the most 
violent region in the world with 25.1 homicides per one hundred thousand 
habitants, compared to 22.2 for Africa or the world average of 8.8. Honduras 
(154) and Columbia (70) have higher than average rates. Panama (2.0), Chile 
(4.5) and Uruguay (4.6) are the three countries in the region with the lowest 
homicide rates (PNUD 2004a). This level of criminality stems from the enormous 
inequality in incomes, the impunity of the offences, and the existence of drug 
trafficking networks that pervade society. Other structural aspects must be taken 
into account, such as the massive rural exodus, which reached its highpoint in 
the 1980s and generated a kind of marginal urbanization of broad sectors of 
society; the widespread diffusion of firearms and the absence of effective policies 
to control them; and the weakening of the state government, which is seen as 
incapable of exercising the monopoly on legitimate violence (Kurtenbach 2002) 
with the financial resources of scarcely 2.5% of the national budget dedicated to 
the administration of the justice systems as a regional average for 2001. 
Everyday violence is the principal social problem that has become stronger 
because of the multiplication of the actors of violence. The state’s own security 
forces, which frequently extort citizens, stand out. However, public policies meant 
to tackle this violence are not articulated, with the resulting distancing of politics 
from citizens’ concerns. For one in five interviewed in a 2002 survey (PNUD 
2004b), crime and public security are principal interests not addressed in electoral 
campaigns, ranking as the fifth-highest item of interest, only after unemployment 
(36.6%), corruption (23%), low wages (22.8%), and poverty/inequality (21%). 
However, the responsibility for crime is directly attributable to the actions of the 
state government which should have the monopoly of legitimate violence. But the 
state governments in Latin America support a severe financial crisis in a region 
where the payment of taxes is a rare phenomenon. Beside that, fiscal pressure 
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in 2000 was 15.196 of the PIB (PNUD 2004b), something that contributes to 
weaken the role of the state government and which is seen as driven in recent 
times by the empire of the market. 


Multiculturalism 


The multicultural dimension, which heightens tensions in other parts of the world, 
seems explicitly absent from the main array of Latin American public problems, 
despite the multiethnic character of several of the region's countries, the advance 
of Protestantism that introduced religious pluralism, and the slow awakening 
of regional movements brought forth by recent policies of decentralization and 
the development of local electorates. The low-end estimates for the indigenous 
population of Bolivia reach 55.6%, Guatemala 49.6%, Peru 37.4%, Ecuador 
23.0%, and Honduras 11.2%. The average for all of Latin America is 7.4% (PNUD 
2004b). Although this multiethnic character is recognized in the state constitutions 
— which timidly reserve representational quotas for indigenous minorities in the 
congresses of Columbia and Venezuela — there is hardly any impact on everyday 
coexistence. On the other hand, political parties of an indigenous bent only have 
relevance in Ecuador and Bolivia, channeling a very small part of the electorate. 
However, the use of vernacular languages by hundreds of thousands who have 
difficulty expressing themselves in Spanish frequently generates situations of 
scorn and disgrace because various public institutions do not afford indigenous 
peoples sufficient protections for their rights. 

Moreover, we must note the existence of eight indigenous groups whose region 
extends through at least two states, overflowing strict national borders. So, speakers 
of Quechua live in five countries; speakers of Maya, Aymara, and Garifuna in 
four; and of Nahuatl, Mapuche, Guajiro and Misquito in two (PNUD 2004b). 
The Latin American nation-state does not appear to be globally questioned save 
by minority sectors of Guatemala, Bolivia, Ecuador, and Mexico that are still 
politically irrelevant. However, it is evident that the challenge is there. 


The Challenges of an Incomplete Citizenship 


In Latin America, with minor national differences, it can be considered that 
political citizenship is practically achieved from the perspective of the vertical 
accountability that is exercised by the vote. The mechanisms of horizontal 
accountability, which suppose the development of the rule of law, have achieved 
somewhat less. Questions regarding the professionalization and independence 
of judges, the control of management of public functionaries, the use of 
mechanisms of direct democracy, and the elimination of practices of corruption 
and clientelism in the management of public affairs (PNUD 2004b) are still 
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pending implementation owing to the fact that they must at times confront a 
lack of public will to tackle them, and at other occasions, the heavy inertia of 
historical legacy. 

Although civil citizenship has been developed in doctrine and norms — with 
women and indigenous communities having full recognition of their civil rights, 
and having advanced a good deal in the area of human rights — reality is very 
different. De facto inequality based on economic, cultural, and ethnic motives, 
and the ruling violence that strangles security and frequently puts physical 
integrity in danger are daily constants in the face of which the functioning 
of justice administration does nothing because of a lack of means or capacity 
(PNUD 2004a). 

Social citizenship represents one of the most serious challenges confronting 
Latin America. The lack of basic social services is generalized to practically all 
sectors, coinciding with other deficits of civil citizenship since the groups most 
marginalized by one or by the other coincide. 

This panorama offers an incomplete citizenship with a slant that, for the first 
time, favors the instance of political citizenship, which has achieved important 
goals in the last two decades. However, the success of political citizenship may 
cease to be relevant if social citizenship does not advance and if civil citizenship, 
the recent success of which has been merely formal, does not become real. 
Not progressing in the pursuit of these two latter types of citizenship could 
conclude in the failure of political citizenship, since the framework on which 
electoral democracy is mounted could collapse as a consequence of generalized 
disenchantment turning into political apathy, and at times, frank hostility towards 
the democratic system. The success of the challenge depends on a fortunate 
combination of political will and effective institutions because, as O'Donnell 
(2003:125) has indicated, *[H]istory abundantly shows that a complete package 
of these capacities and rights could never be established — not to mention 
implemented — simultaneously.” 


Notes 


1. See articles compiled in the central part of the October 2004 Journal of Democracy 
15(4). 


2. PNUD, which is used for some citations in this article, is the Spanish acronym for the 
Program of the United Nations for Development. Please refer to the bibliography. 


3. See data in Payne et al (2003). 


4. A study about 231 Latin American political leaders has shown that "political reform" is 
the main problem faced in the strengthening of democracy, as claimed by 4596 of those 
interviewed. The gap on the face of other questions such as “fight against inequality" (1896), 
*economic politics" (1096), "fight against corruption” (9%), and “social politics” (8%) stand 
out (PRODDAL 2002). 
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Patterns in Contingencies: The Interlocking of 
Formal and Informal Political Institutions in 
| Contemporary Argentina" 
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Abstract 


This article explores how the interlocking of formal and informal political 
| institutions has affected the dynamics and performance of the Argentine democracy. 
| Key institutional features of the Argentine political system have been a competitive 
| form of federalism, loosely structured and political parties that are not ideologically 
| unified, neopatrimonial practices, and client list exchanges. These features have 
interacted with the pluralist political landscape resulting from the vertical and 
| horizontal constitutional division of powers. In combination with the distribution 
| of power resulting from electoral results at different levels of government, these 
| features have severely constrained the possibilities for coordination in and across 
| institutional arenas. 


! Introduction 


|! When examining the recent social and political evolution of Argentina, observers 
| will immediately recognize several features that reoccur in its history. Perhaps 
| the first one lies in the inability to find a stable, widely accepted, and sustained 
Í model for economic growth and social development. Over the last few decades, 
| Argentina has witnessed the implementation of very different socio-economic 
| programs, and although some of them have been temporarily successful, they 
1 have eventually collapsed under adverse economic circumstances. The economic 
| policies conducted by the military dictatorship from 1976 to 1983 affected deep 
| social and economic changes, but failed to build a sustainable economic model 
| for the country. After the 1983 democratic restoration, President Alfonsín led an 
| unorthodox program of economic reforms. After some successes, this program 
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failed amid economic collapse and hyperinflationary episodes. Economic crisis 
paved Carlos Menem's way to the Presidency after an anticipated transfer of 
powers, one that took place after Alfonsín failed to draw Menem’s support for a 
coordinated response to the crisis! (Palermo and Novaro 1997: 113). 

President Menem undertook an ambitious program of economic reforms 
that combined deregulation, widespread privatization, commercial opening, and 
foreign investment. Menem’s reforms were followed by vigorous economic growth 
and anti-inflationary success. However, as Menem’s second presidency reached its 
end, a number of economic and political circumstances (economic stagnation, 
high unemployment, high public debt and deficit, and the divisions within 
the Peronist party opposing Menem and the former Vice-President Eduardo 
Duhalde) made the Argentine prospects quite somber. In 1999, Fernando de la 
Rua, the candidate of the Alliance for Jobs, Education, and Justice (henceforth, the 
Alliance), formed by the Civic Radical Union (UCR) and the center-left FREPASO, 
attained victory in the electoral contest against the Peronist candidate, Eduardo 
Duhalde. Nevertheless, in the years 2000 and 2001 the De la Rua presidency 
had to face an endless economic and political crisis, the latter mainly derived 
from the strong divisions within the ruling coalition and the President's political 
party, the UCR.! At the end of year 2001, in a context of deep economic crisis, 
internal political conflicts, and rising social protests, the De la Ráa presidency 
collapsed. President De la Rua resigned in December 2001, and, in January 2002, 
the Peronist Eduardo Duhalde was elected President by Congress. In the year 
2003, the Peronist Néstor Kirchner was elected President, after Menem refused 
to participate in the run-up election against Kirchner. Under the Duhalde and 
Kirchner presidencies, Argentina saw the end of the fixed exchange-rate system 
and therefore the crumbling of one of the main pillars of the Menemist economic 
model. It also experienced a substantial reversal of the economic policies and 
strategies followed during the Menem years. 

The strong swings in the character and outcomes of economic policies in 
Argentine have been matched by important political variations. Argentina has 
had to face profound political and institutional crises, which have made political 
prospects very uncertain. This country has also known stable periods in which 
Presidents have enjoyed both political power and popular support. In spite of the 
difficult economic and political circumstances occasionally faced by the country, 
no severe legitimacy crisis of the democratic system as a whole has erupted, and 
no significant sociopolitical actors have moved to reject and subvert democratic 
institutions. Fitting Linz and Stepan's description of democratic consolidation, 
in Argentina democracy has become “the only game in town.” (Linz and Stepan 
1996) 

The resilience of democratic values indicates the importance of the political 
changes experienced by the country since the return to democracy in 1983. 
Perhaps the most important, and positive development is the fading of the 
military as an anti-democratic political actor, and its final subordination to 
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civil government, partly as a result of the experiences of terror and economic 
failure that marked the 1976-1983 military dictatorship. Also important has 
been the gradual erosion of the acute and politically destabilizing conflicts 
between Peronism and anti-peronism, a change to which different political 
experiences greatly contributed (including the last military dictatorship and the 
strong political and cultural turns triggered by President Menem). Yet, in spite 
of these important changes, the Argentine democracy has failed to deliver a 
stable economic and political environment. Its performance ranges below what 
other indicators, such as its per capita wealth, human development indexes, or 
educational levels, would have made us expect. 

This piece aims at identifying some of the key political-institutional factors 
that have conditioned the dynamics and performance of the Argentine democracy. 
The first part of this article examines some of the main institutions affecting the 
workings of the Argentine horizontal (the president vis-à-vis the legislatures) and 
vertical (between the national executive and provincial governments) divisions 
of power. This section considers not only formal and constitutional rules, but 
also several informal rules governing political life in governments, legislatures, 
and political parties. This analysis pays special attention to several properties of 
the Argentine institutional configuration (such as non cooperative federalism, 
loosely organized and decentralized parties, clientelism, and neopatrimonialism) 
that have characterized, in following O'Donnell's (1997) insights, the type of 
institutionalization experienced by the Argentine democracy. These characteristics 
have made coordination in and among some of the main political actors in 
the Argentine institutional system (the legislatures, the national and provincial 
governments, and political parties) costly and, under some circumstances, even 
impossible. 

The second part of this piece analyzes the performance of the Argentine 
democracy in several key dimensions of democratic governance. This section 
presents the evolution of several performance indicators, including economic 
growth, human development, democratic governance, and public attitudes, and 
tries to identify how the combination of persistent patterns and conjunctural 
political configurations have favored or constrained the political ability to respond 
to the economic and political challenges faced by the Argentine democracy. In 
so doing, this contribution aims at identifying some of the key political and 
institutional patterns affecting the dynamics and highly variable outcomes of 
Argentine political life. 


The Interlocking of Political Institutions 


A first important issue for the analysis of the Argentine political-institutional 
framework concerns the competencies, powers, and relationships of the 
President and the Legislative. On first inspection, the Argentine President enjoys 
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comparatively moderate constitutional legislative (both proactive and reactive) 
and partisan powers (Payne et al. 2002: 216).’ If we focus on partisan powers, 
we observe that in the period analyzed by Payne et al., the President's governing 
coalition has controlled a majority of the Lower Chamber seats 33.396 of the 
time, and 55.6% of the time the Senate, disposing of an absolute majority in 
both chambers only 11.1% of the time. However, the average percentage of seats 
of the governing coalition in the period analyzed by Payne et al. equaled 48.5 in 
the Lower Chamber and 48.6 in the Upper Chamber (Payne et al. 2002: 214). 
This indicates that on average governing coalitions have been extremely close to 
achieving parliamentary majorities in each chamber. 

In the Lower Chamber, this possibility has been enhanced by the significant 
presence of third forces. Thus, the average effective number of parties in the 
Lower Chamber has been 2.68 during the period analyzed by Payne et al. (2002: 
145). Over all, the partisan powers of Argentine presidents have been greater 
than those of presidents in Uruguay, Brazil, and Chile have been. As for the 
constitutional legislative powers, according to Payne et al. Argentine Presidents 
fare very slightly below the Latin American mean (2002: 202). In Argentina, 
Presidents have enjoyed legislative constitutional powers lower than in Brazil and 
Chile, but slightly higher than in Uruguay (and much higher than in the US). 
Finally, as for the non-legislative constitutional powers of Argentine presidents 
(cabinet formation, cabinet dismissal, and censure), they are higher than the Latin 
American mean, but lower than those of the Chilean and Brazilian presidents. 
. Overall, if we consider the constitutional and partisan variables addressed by 
Payne et al, Argentine presidents seem to be in positions similar to those of 
Mexico and the US, if slightly more powerful than the latter due to their superior 
legislative powers. 

In-depth analysis of the workings of the legislative-presidential relationship 
provides us with more fine-grained picture of Argentine institutional life. 
According to Mustapic (2000), the general image is one of presidential leadership 
and legislative control. On the one hand, presidents have played a very active 
role in legislative activity, and have enjoyed high discipline levels in Congress 
(Mustapic 2000; Jones 1997). On the other hand, presidents have had to work 
hard to get legislative support, even in their own political parties. The process of 
coalition building led by presidents in the legislative has often been complex and 
costly, not the least due to the need to win the support of legislators belonging 
to the President's party or governing coalition (Llanos 1998; Mustapic 2000). 

In order to understand why the process of coalition building can be difficult 
even for presidents supported by political forces who enjoy a solid position in 
the legislative, it is necessary to consider two other critical Argentine institutions: 
their political parties and party system and the workings of the federal system. 
Due to the characteristics of these two institutional realms, presidents have often 
had to negotiate with legislators and political incumbents belonging to their own 
political forces and governing coalitions. National party leaders, including the 
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presidents, have not had unified, hierarchical, and centralized party machines 
at their disposal, but have had to face plural, decentralized, and often factional 
political organizations. The process for the selection of electoral candidates 
has strengthened this characteristic of Argentine party politics. Although the 
Argentine congress is elected with a proportional system and closed lists, the 
fact is that most of the times party candidates have not been selected by national 
leaders, but in primary elections held by militants and in some cases voters 
(Mustapic 2000). The internal pluralism of political parties has had more to do 
with the strength of personal and local leaderships (Mustapic 2000) than with 
ideological or programmatic divisions.’ This view of internal party politics is 
consistent with survey data on party politicians gathered at the Universidad 
de Salamanca under the direction of Manuel Alcantara in the 1999-2000 
period. According to this survey, a clear majority of the Peronist and Radical 
politicians saw their respective parties as constituted by several streams under 
different leaderships (see Table 1). Comparisons based on this survey between 
the Argentine and other Latin American parties reveal that the former ranked 
among the group of least disciplined parties in Latin America (Alcantara 2004: 
197-199). Furthermore, as shown by Table 2, in the two main traditional parties, 
internal debates on the selection of electoral candidates have not been likely to 
evolve around programmatic issues. Eighty-one percent of Peronists and 78.8% 
of Radicals pointed out that the selection of candidates hardly ever involves 
intense programmatic debates’. 

By contrast, there is strong evidence of the importance that clientelist 
exchanges and patronage networks play in the internal life of political parties.° 
Thus, Levitsky’s analysis of Peronism (2003) has shown the crucial importance 
of the provision of basic goods, like food and medicine, and jobs? in the internal 
life of that political force. These exchanges were parallel to the lack of ideological 
commitment of Peronist militants to the economic course adopted by President 
Menem. The existence of such strong differences between the policies chosen 
by the Menem government and those favored by Peronist militants was made 
possible by the highly decentralized structure of the Peronist party.’ 

Finally, even if the success of this practice is far from uniform across territories, 
there is evidence that both Peronists and Radicals have made extensive use of 
direct material incentives (and public resources) in order to appeal to voters 
(Levitsky 2003; Brusco, Nazareno and Stokes 2004).* In this neopatrimonial 
environment (in so far as it involves the presence of "patron-client linkages" 
and “the blurring of private and public purposes and practices within the 
administration”) (Hartlyn 1998: 14),° access to state resources at the national, 
provincial, and local levels has become critical. This access facilitates both the 
internal balance of power between different party factions and the general ability 
of parties to compete for support and votes. 

The disposal of national executive power has certainly put strong premiums 
and selective incentives into the hands of either the Presidents or presidents. 
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Table 1. To what extent is the respondent's party integrated by several 
party currents following different leaderships? (Frequencies by 


political party) 
PERONIST PARTY CIVIC RADICAL UNION FREPASO 








1. Very different from what 4.8% 0% 0% 
happens in my party 

2. [Somewhat different] 0% 6.3% 23.5% 
3. [Not different] 14.3% 18.8% 14.7% 
4, [Somewhat similar] 61.9% 37.5% 38.2% 
5. Very similar to what 14.3% 37.5% 14.7% 
happens in my party 

DK/DA 4.8% 0% 8.8% 
Total 21 32 34 


Table 2. How often does the selection of candidates in the respondent's 
party involve intense debates on political programs? (Frequencies 
by political party) 


PERONIST PARTY CIVIC RADICAL UNION “FREPASO 


Hardly ever 8196 78.8% 50% 
Some times 19 21:2 30.6 
Very often 0 0 8.3 
DK 0 0 11.1 
Total 21 33 36 


However, as a result of the pluralist, clientelist, and limited programmatic 
cohesiveness of political parties, presidents have also had to bargain skillfully in 
the legislatures in order to see their legislative programs advanced, even in those 
cases in which their parties or party coalitions have enjoyed legislative majorities. 
In this context, coordination within parties or political coalitions has sometimes 
been as difficult as interparty coordination. This was, for instance, the case in 
during the Presidency of De la Rua, which was decisively marked by permanent 
disputes within the ruling coalition and the President’s party (Novaro 2002).!° 

The limited role of ideological and programmatic commitments in internal 
party divisions, and the importance of decentralized and clientelist leaderships 
are factors that have also affected the contents and reach of bargaining processes. 
In some instances, the lack of centralized control may have even given rise to 
vote buying in the legislature by branches of the national executive, as in the 
case of the bribes to Senators in order to find support for labor reforms in the 
year 2000.” 
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In order to understand how the characteristics of parties may have affected 
national legislation and politics, it is necessary to consider how they have 
interacted with the institutions of Argentine federalism. Argentina's federal 
framework assigns provincial governments very important roles in the realms of 
social and economic policy. In addition to this, because of the highly pluralistic 
character of parties and the importance of provincial governments, provincial 
party leaders play very important roles in national politics when they are in 
charge of their respective provincial administrations. 

The importance of political decentralization becomes clear when we consider 
both the competencies and resources enjoyed by subnational actors. Provinces are 
the main providers of primary and secondary education, as well as of different 
types of social services, such as health programs and funds to fight poverty. 
They also play significant roles in the provision of infrastructures and employ 
substantial numbers of public service workers. In the late 1990s, subnational 
administrations accounted for nearly 5096 of the consolidated public expenditures 
and two thirds of public expenditures excluding pensions (Tommasi, Saiegh, 
Sanguinetti 2001: 150). 

A second critical feature of the Argentine federalism derives from the 
economic flows that link the national and provincial governments. Although 
public expenditure is highly decentralized, provinces are highly dependent on 
transferences of resources from the national government. In 1999, on average, 
more than one half of the provincial expenditures were financed by the national 
executive (Tommasi, Saiegh, Sanguinetti 2001: 150-51). In that same year, 
in eight provinces, the national government financed more than 8096 of the 
provincial expenditures ( Tommasi, Saiegh, Sanguinetti 2001: 151). This situation 
is closely connected to the fact that the extraction of fiscal resources is mainly 
conducted by the national government. Transferences of resources are mainly 
based on the coparticipación regime, which establishes general mechanisms for 
the reallocation of federal funds extracted from taxes. The 1994 constitutional 
reform established that a new law should regulate coparticipación, but as of 2004, 
no new law has been passed, and coparticipación has been regulated by the 1988 
law and several other adjustments introduced during the 1990s. In addition to 
this, the federal governrnent has had at its disposal different highly discretional 
mechanisms to reallocate funds to the provinces based on economic and political 
considerations.? This situation has given rise to strong provincial demands 
and complex multilateral and bilateral political negotiations and disputes." 
Economically, these characteristics have not encouraged fiscal responsibility on 
the part of provincial governments, but in combination with other institutional 
and economic features (such as the high provincial borrowing autonomy and 
the high dependence of certain provincial economies on public expenditure and 
employment); they have facilitated the development of high provincial fiscal 
deficits and debts (Tommasi, Saiegh, Sanguinetti 2001).'* In some cases, this 
escalation of provincial expenses was caused by intense intra-party competition 
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for national leadership." This trend aggravated the vulnerability of the Argentine 
economy at the end of the 1990s.!^ 

All these characteristics identify the Argentine federal system as one of 
competitive rather than cooperative federalism. However, in this case, competition 
among the federation units does not aim at lowering regulations and taxes in 
order to attract capital, but at extracting resources from the federal level, largely 
in order to strengthen the political positions of provincial political leaders and 
their parties. This kind of competitive federalism has been reinforced by the 
previously mentioned characteristics of party politics. The pluralistic structure of 
parties, the importance of provincial party leaders in national party life (especially 
when they hold power in provincial governments) the clientelist use of public 
resources, and the weak role of ideological and programmatic factors," have 
weakened nationwide intraparty coordination, encouraged competition among 
the regions in their relationships with the national government, and aggravated 
some of the vulnerabilities of the Argentine economy. 

To the extent that provincial leaderships have enjoyed privileged access to the 
Senate, they have made use of their institutional location to exact concessions 
from the national government. This scenario has been enhanced by the strong 
demos constraining features, in Alfred Stepan's terms, of the Argentine federal 
system (2002: 341 and the following), which result in the high overrepresentation 
of weakly populated districts in the upper chamber, the symmetrical policy scope 
of both chambers, and the loose character (organizational as well as ideological) 
of political parties." 

Finally, it is worth reconsidering again the position of Argentine presidents in 
light of the previous considerations on political parties and Argentine federalism. 
As indicated above, the Argentine formal institutional setting has neither 
precluded nor encouraged the development of stable presidential leaderships, 
but set the conditions and costs for the development of cooperation within 
and across institutional arenas. Alternatively, there have been periods in which 
presidents have managed to build sufficient support inside their respective 
parties and governing coalitions, in the national legislatures, and from provincial 
governments, and there have also been phases in which presidents have failed to 
find sufficient support in all these institutional arenas. These possibilities have 
been conditioned not only by the share of seats of the Presidents' parties in the 
Upper and Lower Chambers, but also by the internal politics of their political 
forces, as well as by the relationships between the national government and the 
provinces. 

These general institutional patterns have had important implications for the 
dynamics of democratic governance in Argentina. The next section turns its 
attention to this issue by examining the evolution and outcomes of democratic 
governance in Argentina, and by exploring the links between Argentine political 
dynamics and the institutional framework analyzed above. 
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Graph 1. Evolution of the Human Development Index in Argentina, 
Brazil, Chile and Uruguay. (Data excerpted from PNUD: 
La democracia) 
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Table 3. Annual per capita GDP growth. Excerpted from the World Bank 
Human Development Report 2004. 


1975-2002 1990-2002 
Aedes aea E ee er IE E DERE s 
Argentina 0.4 157 
Brasil 0.8 1.3 
Chile 4.1 4.4 
Uruguay 1.3 1.4 
Latin America & the Caribbean 0.7 1: 


The Performance of Argentine Democracy 


Although many of the characteristics of Argentine society are individually 
present in other Latin American societies, they have combined and evolved in 
a very specific, if not unique manner, in the case of Argentina. One important 
characteristic of this country lies in its comparatively high level of economic and 
human development. From 1975 to 2001, the Index of Human development has 
been higher in Argentina than in any other Latin American nation. Per capita 
purchasing parity power has been also consistently higher in Argentina than in the 
rest of Latin American societies. In the year 2002, purchasing power parity equaled 
10.880 US dollars in Argentina, for 9.820 in Chile, 8.840 in Costa Rica, and 7.830 
in Uruguay." On the other hand, economic development has been highly variable 
and, over the long run, lower than that of Latin America and the Caribbean and 
of its three most similar neighboring countries (Brazil, Chile and Uruguay). 
(See Table 3.) Besides, although Argentine Human Development Indexes have 
remained higher than those of any other Latin American country have, over the 
long run they have experienced comparatively modest improvements (from .78 
in 1975 to .85 in 2000). Among Argentina’s neighboring countries (see Graph 
1), the Human Development Index improved at a faster pace in Brazil (from .64 
to .77 in the same period), and Chile (from .70 to .83) than it did in Argentina, 
and very similarly to Argentina in the case of Uruguay (from .76 to .83). 

Argentina has known episodes of both rapid growth (for instance, in 1987, 
from 1991 to 1995, and in 2003, with per capita GDP growing faster than 5% each 
year) and very severe economic downturns (as in 1986, 1990 and 2002, with per 
capita GDP dropping by more than 8%). From 1991 to 2002, the Argentine rate 
of economic growth showed the highest rate of variation in all Latin America 6.6, 
slightly higher than that of the also highly unstable economies of Uruguay and 
Venezuela (with standard deviations of 5.6 and 5.5, respectively, and much higher 
than that of Chile and Brazil (with standard deviations of 3.23 and 1.98). 

The comparative examination of data on economic inequality in Argentina, 
Brazil, Chile, and Uruguay reveals more troubling changes. As shown by Table 4, 
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Table 4. Gini index for income inequality 


1990 1996 1997 1999 2000 2001 2002 


Gran Buenos Aires .501 .530 .542 .590 
Brazil .627 .638 .640 639 

Chile 554 553 2559 

Uruguay .492 ‘430 440 455 


Source: Comisión Económica para América Latina y el Caribe. Panorama Social de América Latina 
2002-2003. 


whereas in other countries inequality remained nearly constant (Brazil, Chile) or 
even decreased (Uruguay) in the 1990-2002 period, in the metropolitan Buenos 
Aires area (the only Argentine region for which the CEPAL reports values for 
the Gini index) inequality consistently increased during this time frame. From 
1990 to 2002, the evolution of inequality was worse than that of economic 
growth: whereas the Argentine economic growth outperformed that of Brazil 
and Uruguay, inequality increased much more in the former case than in any of 
these neighboring countries. 

These economic and social changes have not been unrelated either to 
the characteristics or to the evolution of Argentine politics. First, and most 
clearly, because economic dynamics have had strong implications for political 
life, affecting the workings of political institutions and the public attitudes of 
Argentine citizens. And second, because political and institutional variables have 
conditioned both the economic responses chosen by politicians and their degree 
of success. 

Some of the links between economic changes and political life can be grasped 
by comparing the democratic features and performance of Argentina to that of 
its socio-economically closer neighboring countries, Brazil, Chile and Uruguay, 
a purpose for which the World Bank indicators for democratic governance can 
prove very useful. Graphs 2, 3, 4 and 5 show the evolution of each of these four 
countries in four critical dimensions of democratic governance.” Values for 
each indicator reflect the percentage of countries worldwide that rank below the 
country of reference in each of the following variables: popular participation in 
the selection of governments and media freedom (Voice and Accountability), 
the likelihood that the government will be destabilized or overthrown by 
violence (Political Stability), the quality of public service provision (Government 
Effectiveness), the extent to which citizens trust the rule of law and abide by it 
(Rule of Law), and the perceptions that public means are used for private goals 
(Control of Corruption). These indicators are based on different types of survey 
responses. Even though, due to their large confidence intervals, these indicators 
must be approached with caution,” they provide us with very useful interpretive 
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Graph 2. Evolution of the World Bank Index for Political Stability in 
Argentina, Brazil, Chile and Uruguay 
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Graph 3. Evolution of the World Bank Index for Voice and Accountability 
in Argentina, Brazil, Chile and Uruguay 


1684 / Social Forces 83:4, June 2005 






90 


80 


70 


60 


Rule of Law 


50 
40 
30 


20 
1996 1998 2000 2002 


Year 


Graph 4. Evolution of the World Bank Index for the Rule of Law in 
Argentina, Brazil, Chile and Uruguay 
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Graph 5. Evolution of the World Bank Index for Control of Corruption in 
Argentina, Brazil, Chile and Uruguay 
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leverage to interpret the characteristics and implications of Argentine political 
dynamics. 

Two main conclusions can be drawn from this data. The first one is the sharp 
drop in the Argentine values from 2000 to 2002, when the Argentine performance 
was lower than that of its neighboring countries in each of these indicators, and 
below the 40* percentile in all variables except Voice and Accountability. The 
second tentative conclusion relates to the overall poor democratic performance 
of Argentina, even before the obvious impact of the 2000-2002 economic and 
political collapse. In spite of its superior levels of per capita income and human 
development, in 1998 Argentina already showed the lowest values among 
these Southern Cone countries in the Voice and Accountability and Control 
of Corruption indicators, and also the lowest 1996 indicator in the Control of 
Corruption variable. In no single instance were the indicators for Argentina better 
than Chile and Uruguay's were. 

We can further explore the links between political-institutional conditions 
and democratic performance by considering how changes in the political- 
institutional and economic landscapes have affected political stability and crises. 
As we saw above, certain political-institutional features, such as federalism and the 
characteristics of parties and the party system, made cooperation both necessary 
and costly. In addition to this, the prospects of cooperation were affected by 
how political and economic conditions combined at each point in time. As 
previously stated, whereas under certain circumstances cooperation within and 
among institutions was possible and gave rise to stable and predictable political 
outcomes, in other situations, prospects for cooperation consistently worsened, 
and political life was anything but predictable. We can get some sense of both 
the diversity and challenges of different Argentine situations by examining the 
position of the ruling parties or political coalitions in other key institutions 
(the legislatures and the provincial governments), how internally coordinated 
the ruling parties or coalitions were, and whether governments had to face 
severe economic downturns during their administration. Table 5 presents, in 
an extremely simplified, dichotomous way, some of the main characteristics 
mentioned above in each legislative period. It informs us of whether the party 
or political coalition of the President was the first group in the Lower and Upper 
Chambers, ? whether a majority of provincial governments belonged to the 
President's party or ruling coalition, and whether internal coordination in the 
ruling party or coalition was high.? This table also indicates whether the country 
faced a severe economic downturn (at least a 196 drop in the GDP) during at 
least one year of the respective legislative period," and whether the country 
experienced a severe political-institutional crisis, one in which constitutional 
provisions regarding the occupation and transfer of national executive power 
were ignored or suspended. These were the cases of the anticipated transfer of 
presidential powers from Alfonsín to Menem in the face of the hyperinflationary 
crisis, and, even more clearly, of the resignation of President De la Rua.” 
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Table 6. Percentage of respondents who agree with the idea that democracy 
is preferable to any other form of government. Excerpted from 
Informe-Resumen Latinobarómetro 2004. Una década de 
mediciones. Corporación Latinobarómetro. Santiago de Chile. 





1996 1997 1998 2000 2001 2002 2003 2004 





Argentina Zl 75 73 71 58 65 68 64 
Brazil 50 50 48 39 30 37 35 41 
Chile 54 61 59 D7 45 50 51 57 
Uruguay 80 86 80 84 79 78 78 78 
Latin America 61 62 62 60 48 56 53 53 





It is clear from this table that there is a strong contrast between the favorable 
political conditions over which Menem presided from 1989 to 1997 and the much 
less manageable political landscape faced by Presidents Alfonsín, at the end of 
his term, and De la Rua, particularly after the Alliance reached a dramatic crisis 
in the year 2000. Even if it is true that some of these (un)favourable political 
circumstances were conditioned by the skills and successes of these presidents, the 
fact remains that the specific characteristics of political scenarios at each legislative 
period severely constrained the effectiveness of governments. Furthermore, some 
of these conditioning elements were rooted in structural features of the Argentine 
polity, as it was the case of the Peronist preponderance in both provincial 
governments and the Senate. Important differences between these situations 
notwithstanding, it is perhaps not accidental that the most difficult combinations 
of political and economic crises took place in 1989 and 2001% when Presidents 
Alfonsin, and even more so, De la Rua, were in very precarious political positions. 
In the last case, even if the severe internal difficulties experienced by both the 
Alliance and Radicalism were aggravated by some of President De la Rüa's 
choices, it remains clear that the difficulties posed to intra- and inter-institutional 
coordination were very strong, particularly in the economic dead-end situation 
faced by the country. 

Under economic and political circumstances as adverse as these, both the costs 
and the constraints of bargaining dramatically escalated,” and both centralized 
and decentralized external solutions to the collective actions problem faced by 
politicians became impossible.”* External centralized solutions were not possible 
due to the political and institutional weakness of Presidents. And external 
decentralized solutions were not available due to the absence of common political 
values or identities, and the lack of norms of reciprocity linking the political 
groups prevailing at each institutional level. These circumstances certainly 
hampered the ability to successfully respond to the evolving economic crisis. It is 
also revealing that crises that erupted under scenarios favourable to the executive 
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Table 7. Percentage of respondents who are very satisfied or rather 
satisfied with the functioning of democracy in their country. 
Excerpted from Informe-Resumen Latinobarómetro 2004. Una 
década de mediciones. Corporación Latinobarómetro. Santiago 








de Chile. 
1996 1997 1998 2000 2001 2002 2003 2004 
Argentina 34 42 49 46 20 8 34 34 
Brazil 20 23 27 18 21 21 28 28 
Chile 2 57 32 33 23 27 99 40 
Uruguay 52 64 68 69 55 52 43 45 
Latin America 27 41 57 37 25 32 28 25 





ability to launch coordinated responses (such as the 1995 one) had much more 
modest political repercussions. 

Overall, political-institutional features have given rise, alternatively, to both 
periods of institutional cooperation and stability, and intense inter- and intra- 
institutional disputes and decisional deadlocks. At its best, the institutional 
system has favored economic development and political stability. At its worst, the 
difficulties in achieving political coordination have aggravated economic crises 
and triggered political instability. Certain features of the political institutional 
system, such as the horizontal and vertical division of powers, have demanded 
cooperation, Yet, other institutional features (like the heterogeneous and clientelist 
nature of parties, the strength of neopatrimonial practices, competitive federalism) 
and political circumstances (like splits within the governing party and/or coalition 
and adverse majorities in the legislatures and regional governments) have made 
cooperation difficult and costly. 

Fortunately, the poor performance of the Argentine political institutions has 
not led to huge drops in the levels of public support for democracy. According 
to the Latinobarómetro surveys (see Table 6), from 1996 to 2004 most Argentine 
citizens preferred democracy to any other form of government, in spite of the 
profound political and economic crisis experienced by the country after the 
year 2000. Although support for democracy in Argentina decreased after 2001, 
it remained higher than in Brazil (by more than 20 percentage points), Chile, 
and the whole of Latin America. 

As shown in Table 7 public satisfaction with the actual functioning of 
democracy (see Table 8) has remained at lower levels, approaching the 50% 
threshold only in the year 1998, and reaching its lowest value in the year 2002. 
The overcoming of the crisis has led to levels similar to those of 1996. Overall, 
the contrast of widespread support for democracy and low satisfaction with the 
actual functioning of institutions reveals that normative democratic views coexist 
with predominantly negative views on the functioning of Argentine institutions. 
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Fortunately, the former have proved resilient even in the face of severe governance 
crises, as they did also in the last two years of Alfonsín's presidency (Palermo 
and Novaro 1996: 95). 


Conclusions 


In combination with other structural and contingent features (ranging from the 
distribution of electoral preferences across the Argentine society and territories 
to the structural and contingent problems faced by the Argentine economy), the 
Argentine institutional map has given rise to highly variable political outcomes. 
In some periods, coordination across institutions has been facilitated by the 
control exercised by politicians belonging to the same party, most often the 
Peronist party. This type of scenario has made presidential leadership possible, 
and given rise to stability and predictability, although at the cost of reinforcing 
the institutions of clientelism and neopatrimonialism. Under other political 
configurations, coordinated policies have proved either difficult or impossible, 
and, when combined with severe economic adversities, coordination failures have 
led to the threat of political-institutional crises. 

Depending on political and electoral circumstances, Argentine presidents 
have either enjoyed important powers and exercised a strong leadership, or faced 
the risk of acute powerlessness. This extreme variability of circumstances and 
institutional capabilities has prevented the development of stable presidential styles 
and patterns of relationship between the President and other key institutional 
actors. The prevalence of majoritarian inclinations and expectations in Argentine 
political culture, which is based on actual but in no way permanent, political 
experiences, may have also prevented (both within the national executive and 
among the ranks of the main opposition political forces), the development of a 
more consensual presidential political style.” 

Even more importantly, this high variability of political circumstances has 
hampered the stability and predictability of Argentine economic, social, and 
political life, facilitated the recurrent deterioration of many realms of social and 
political life, and then imposed very severe costs for the recreation of stable and 
predictable institutional frameworks in the different realms of social life. Highly 
unstable situations may have also increased the discount rates of future benefits 
by political actors and strengthened their short-term focus. This may have made 
it more difficult for them to achieve cooperative solutions in political crises. 
As shown by the democratic governance indicators, deep and encompassing 
crises also increased the precariousness of the rule of law, and reinvigorated the 
clientelist and neopatrimonial informal institutions that were already present. 

Clientelist and neopatrimonial practices may have also damaged the efficiency, 
efficacy, and quality of political representation. It has been pointed out that 
complex and pluralistic institutional settings giving rise to multiple winners will 
lead to policies closer to those preferred by the median voter and to higher social 


Political Institutions in Contemporary Argentina / 1691 


utility than those systems based on simple formulas and giving rise to single 
winners (Colomer 2001). The Argentine case does not directly contradict this 
expectation, but shows that informal political-institutional features must also be 
taken into account when ascertaining the effects of different formal institutional 
settings. When programmatic and ideological principles fail to facilitate intraparty 
coordination and link representatives to their voters, and when the provision of 
private and/or club goods within parties and to voters through clientelist and 
neopatrimonial exchanges becomes critical instead, institutional complexity 
and pluralism will not necessarily lead to policies close to those preferred by 
the median voter. Additionally, this kind of political-institutional scenario will 
increase the costs of coordination in and across institutional arenas, and make 
it more difficult to respond effectively to deep and threatening economic and 
political crises. 

Argentine political institutions present a very mixed record. They have 
contributed to making democracy a consolidated and resilient political regime, but 
have failed to give rise to a stable political environment, and have not contributed 
to the improvement of Argentine social, economic, and political conditions. 
It is far beyond the scope and possibilities of this piece to explore the specific 
institutional changes that could promote a web of virtuous circles, and give rise 
to a more stable and better functioning democracy. Yet, more likely than not, 
positive institutional transformations will have to stimulate cooperation in and 
across institutional arenas, as well as reduce the clientelist and neopatrimonial 
practices that have characterized the workings of Argentine political institutions 
in the current democratic period. 


Notes 


1. On the political and economic factors leading to the collapse of the De la Rua government, 
see the volume edited by Novaro (2002). 


2. For an analysis of the weakening of the presidential powers that resulted from the 1994 
constitutional reforms, see Payne (1997). 


3. The weak role of programmatic factors in political parties may result from both long- 
term factors and recent experiences. For an analysis of the long term factors affecting the 
programmatic structuration of party systems, see Kitschelt and Zechmeister (2003). Among 
the short term factors helping disorganize previous programmatic links between voters and 
their representatives might be Menem’s policy switch in 1989. On policy switches, see Stokes 
(2002). Institutional factors affecting the current of programmatic linkages in Argentina are 
addressed below in this article. 


4. For a comparison with other Latin American parties, see Alcántara (2004:195-97). 


5. On the differing nature and implications of the programmatic and clientelist linkages 
between voters and politicians, see Kitschelt and Zechmeister (2003). 


6. Consistent with this is the fact that a majority of the Argentine politicians interviewed 
in the previously mentioned Salamanca survey thought that in fact political connections 
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mattered more than professional qualifications when selecting public employees (8596 of 
the Peronists, 68.5% of the Frepaso members, and 45.5% of the Radicals). 


7. A good example of the decentralized and provincial nature of the Peronist party is frequent 
failure of Menem’s attempts to impose policies and candidates to local and provincial Peronist 
branches (Levitsky 2003: 177-81). 


8. The decentralized and factional character of parties has been strengthened by the selection 
of candidates through internal elections. In turn, success in internal elections has depended 
on access to resources and support by militants, thus enhancing the importance of patronage 
networks and clientelistic exchanges (Levitsky 2003: 178). 


9. The centralization of political power is the only typical characteristic of neopatrimonalism 
that is not present in the informal practices described above. On the features and implications 
of neopatrimonialism for democratic quality, see Hartlyn (1988: 15-6). 


10. Novaro (2002: 97) indicates that conflicts within the ruling Alliance were much more 
acute than those between the government and the Peronist party. Domingo Cavallo, minister 
of Economy during final crisis of the De la Rua presidency, argues that Radical politicians 
were largely responsible for the institutional collapse in December 2001. See http://www. 


cavallo.com.ar/papers/golpe.html. 


11. The alleged buying of votes (both Radical and Peronist) in the Argentine Senate in the 
year 2000 triggered the political crisis that led to the resignation of the Vice-President Carlos 
Alvarez. As a result of these accusations, several Radical senators renounced their seats. See 
Novaro (2002: 85 and the following). 


12. See for instance the use that the Menem government made of financial transfers to 
regions in order to facilitate the implementation of his economic reforms (Gibson and 
Calvo 2001). 


13. See Cetrangolo (2003) and Tommasi, Saiegh and Sanguinetti (2001). See also Bonvecchi 
(2002: 127-28) on the negotiations between the national government and the provinces 
during the De la Rua government. 


14. Interestingly, whereas from 1991 to 2001 the percentage of Argentine workers employed 
by the national sector declined from 5 to 3.296, the percentage of workers employed by the 
provinces rose from 8.6 to 9.396. See Krueger (2002). 


15. This was particularly the case of the conflict between the Governor of Buenos Aires, 
Eduardo Duhalde and President Menem after 1997. Duhalde's race to block Menem's 
reelection and lead Peronism in the 1999 presidential contest led to very strong increases in 
the provincial deficit. Increases in public expenditure were connected to the need to expand 
support through the use of public resources for clientelistic exchanges. On these issues, see 
Corrales (2002). 


16. For an analysis of how political factors aggravated and prolonged the Argentine economic 
crisis, see Corrales (2002). 


17. As Riker (1964: 131-35) and Stepan (2001: 354-58) have pointed out, the characteristics 
of parties and party systems have clear effects on the properties and dynamics of federalism 
(centralizing vs. peripheralizing in Riker, and demos enabling vs. demos constraining in 
Stepan). 


18. On the character and implications of the overrepresentation of certain districts in 
Argentine federalism, see the volume edited by Calvo and Abal Medina (2001). 
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19. Figures excerpted from the World Bank Human Development Report. 


20. This section does not consider the World Bank indicator for regulatory quality because 
it overly favors a specific normative and technical model for economic policies that praises 
market-friendly policies and rejects what it considers as excessive regulations. 


21. See Kaufman, Kraay and Mastruzzi (2004). 


22. This way to qualitatively present the distribution of seats in both chambers can be more 
informative than indicating whether the President’s party enjoyed an absolute majority of 
seats. Due to the multi-party character of the Argentine party system, it is quite possible 
to fall below the 50% threshold and be in a very good parliamentary bargaining position 
vis-a-vis third political parties. 


23. This is the only variable that depends on strictly qualitative assessments. Internal 
coordination problems were acute and dramatic during the De la Rua Presidency. They 
were also very serious at the end of Menem’s second term, when the diverging strategies of 
Menem and Duhalde necessarily clashed. 


24. GDP drops in years in which elections took place have been assigned to the legislative 
period about to finish. For instance, the 1989 GDP drop has been assigned to the 1987-89 
period and not to the 1989-91. 


25. The resignation of Vice-president Alvarez in 2000 certainly involved a deep political crisis, 
but it did not entail the transgression of key constitutional rules. 


26. Certainly, the Argentine economic crises responded to domestic economic vulnerabilities 
and international fluctuations (like the tequila crises in 1995 or the Russian and Brazilian 
crises in 1998 and 1999). However, the intensity, duration, and resolution of economic crises 
were also conditioned by political and institutional variables. Thus, the inability to politically 
respond to the crisis in an intertemporally consistent and institutionally coordinated manner 
must have also affected the intensity of the crisis and the prospects for economic recovery. 
Furthermore, it is also the case that the form that crises took was already the result of political 
circumstances and designs. This was clearly the case of the crucial convertibility system 
established in 1991 under the Menem Presidency. The self-tying character of convertibility 
(Palermo and Novaro 1996: 288 and the following) was also a response and a solution to 
political circumstances. It would in turn decisively condition the character and implications 
of the economic crisis under the De la Rua presidency. For an in-depth exploration of the 
complex and interactive relationships between economic conditions and politics during the 
Alfonsin and Menem years, see Palermo and Novaro (1997). 


27. For an exploration of the transaction costs of Argentine political actors during the De 
la Rua presidency, see Bonvecchi (2002). 


28. On the different types of solutions to collective action problems, see Taylor (1987: 21- 
24). 


29. On the tension between the actual Argentine political-institutional framework and 
the majoritarian legacies and inclinations of the Argentine political culture, see Palermo 


(2004). 
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Women and Politics in Latin America: 
Perspectives and Limits of the Institutional 
Aspects of Women's Political Representation* 


ESTHER DEL CAMPO, University Complutense of Madrid 





Abstract 


This article attempts to offer a general panorama of some issues related to political 
representation of women in Latin America. Specifically, it analyzes the advances 
made in the representation of women in politics during the 1990s. It offers a 
descriptive analysis of national cases in Latin America from an institutional focus. In 
spite of the importance that structural, social and psychological elements have in the 
different accounts of women representation in the developing countries, in this text 
emphasis will be made on the effects of political-institutional reforms. Results from 
reforms have varied considerably between and within countries. They have only been 
incoherent and disconnected policies implemented to increase female representation 
during the ‘90s. We can only deduce, therefore, the existence of intentions on the part 
of governments to enhance the status of women, notwithstanding that numerous 
political, institutional, economical and social constrictions continually reproduce 
inequalities. 


Some Initial Considerations 


During the past 20 years, the debate around the political representation of 
- women has become a frequent theme of numerous studies in sociology as well 
as in political science. Diverse important empirical studies from a good number 
of countries based on solid quantitative analysis of comparative style have 
been carried out. Authors have highlighted that elements incurring in feminine 
representation at a national level are very different from the economic, social 
and cultural status of women. Especially significant for Kenworthy and Malami 
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(1999) is the structure of the electoral system — the proportional representation 
and the system of closed lists. Norris, Rule and Paxton have found that pluralistic 
districts seem to be more eager to support female representation (Norris 1985, 
1987; Rule 1981, 1987, 1994; Paxton 1997). Two of these scholars have also studied 
the effect that the ideological position of the party in government has on women's 
representation: if the governing party is on the left the possibilities for women's 
representation increase (Norris 1987; Rule 1987). Also, the moment of acquisition 
of suffrage for women (Rule 1981) and the cultural attitudes around the role of 
women in politics (Paxton 1997) both affect the quality of female representation. 
Many of these studies have centered on explaining "the gender gap" that exists 
between Latin America and the more industrialized democracies, while others 
incorporating processes from developing countries have a tendency to insist on 
structural, economic and social issues (Matland 1998; Moore & Shackman 1996; 
Oakes & Almquist 1993). 

This article will clarify the principal institutional developments that have 
modified patterns and structures of representation of women in Latin America 
during the 1990s. As it has been reiterated in recent years, the tendency to reduce 
the explanation of political process to social, economic or cultural explanations 
normally signified that the institutional and organizational forms of the political 
life had barely been studied and were considered null for all effects. However, 
because the former political institutions have historically structured the political 
process of incorporation of women to positions of representation and political 
decision, they also constitute a plausible alternative for Latin-American countries. 
Finally, two general ideas need to be made clear: first, that Anglophone or 
Francophone countries have not been included in this study because historically 
they have maintained institutional frames very different from the Spanish and 
Portuguese ones, and second, that although a feminist perspective or a Gender 
Studies focus is not adopted in this paper, it can be reaffirmed that the relevance 
of the theoretical renovation inside Sociology has meant, in Walloy's (1992) words, 
"the decline of the differences in the significant structure of power through an 
analysis too much focused in the fragmentations of these analysis" This means, 
regardless of the importance that postmodern/postructural analysis has had, we 
currently encounter ourselves with a "turn towards agency" (Roseneil 1995), that 
would cohere at some point with the institutional preoccupation of this text. 

At the same time as the public agenda has incorporated the issue of gender 
inequity in its concerns, international organizations have also focused on gender 
issues in Latin America. At the Women's Fourth World Conference: Actions of 
Equality, Development and Peace, carried out in Beijing 1995, a commitment was 
made to address the issue of gender equity. The leaders of the conference required 
that signing countries supervise and evaluate advances made in the representation 
and participation of women in their respective countries. Thus, Latin-American 
countries have developed regular statistics in the last few years dealing with 
female inclusion on all levels of decision-making, from the governmental to the 
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non-governmental,! taking as a starting point the indexes of participation and 
leadership worked out by the Woman and Development Unit of the CEPAL in 
collaboration with the Gender Studies Area of FLACSO in Chile. 

This essay will use part of the data elaborated by the CEPAL to evaluate the 
participation, leadership, and gender equity in Latin America and the Caribbean. 
It is convenient to differentiate between indexes of “political will” from indexes 
of “result?” The former ones are measures pointing to the degree to which 
governments commit to gender equity and provide information about efforts that 
are being deployed to cope with this task. Hence, indexes of “political will” give 
qualitative information and are important to this analysis because they signify the 
creation of organisms for the advancement of women, the passing of projects of 
law in this area, gender equity, the writing down of norms for the regional and 
local level and the formulation of specific programs. Very differently, indexes of 
“result” measure the consequences of the different processes and interventions 
inside this specific goal. In the political arena, indexes of “result” were identified 
from results of presidential, parliamentary and municipal elections; meanwhile, 
of “political will” were identified through the establishment of mechanisms of 
governance for the advancement of women in different levels of government, 
provincial and local, of programs in the ministries and secretaries of the State, 
as well as the development of specific plans and politics for equal opportunities 
for women. In the parliamentary and political party level, indexes of “political 
will” were considered from the creation of commissions specialized in legislation 
that affects women, as well as the approval of laws and norms for positive action, 
like the allocation of slots, in the national legislation and the internal normative 
status of parties. 


The Construction of Women as a Political Subject in Latin America 


In Latin America, the mode of exchange between the political-state sphere 
and the weak civil society, even if it was in terms of participation, pressure or 
cooptation, posed an obstacle to the constitution of independent actors from 
State and political parties. Because of this, the political parties and other actors 
such as unions and student movements predominated in the public domain for 
the good part of this century; in all of them participation of women was very 
scarce. This scarcity came as a given not only from the latest incorporation of 
women in public life? but also, and in a special way, to the own cultures and 
organizational practices of these institutions. If women were incorporated into 
political institutions, it was in isolated cases that corresponded to personal 
experiences of political and social leadership and not with the incorporation of 
women as a collective force or representative body in the political system. 
Remarkably interesting are the conclusions at which Elsa Chaney arrives in 
the 1970s, in a study she carried out using a survey of 81 leading women of Peru 
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and 86 in Chile with political and administrative positions in the national or 
municipal level. In the first place, the author shows that when women entered 
government, they and their male counterparts almost invariably seem to consider 
their intervention as an extension of their familiar role in public affairs. In this 
sense “an officer of feminine sex generally defines herself as a kind of super- 
mother handling with the necessities of its large family in the large ‘house’ of the 
municipality or even the nation.” (Chaney 1983:39) In second place, she considers 
that the active political intervention of women in Latin America has always 
occurred in moments of crisis in the history of their nations. It’s a participation of 
entry and exit-retreat. “Sometimes the novelty of the right to vote and to occupy 
political positions pushes women into politics, even in countries with political 
systems too old; but, I repeat, they abdicate when the excitement declines or when 
they find that the political game is very harsh.” (Chaney 1983:45) 

It is also important to stress that almost all Latin-American countries initially 
restricted the right to choose and to be chosen only for those educated women. 
This sole fact, in societies with a significant indigenous population became 
translated into a real limitation for an enormous proportion of women, especially, 
in countries such as Brazil, Guatemala, Bolivia and Ecuador. 

Authoritarian regimes that predominated during the 1970s in Latin America 
slowly incorporated neo-liberal components of the new model of economic 
development. In a context where civic and political freedoms were decreased or 
leveled, the State reduced its size in a considerable manner, providing no more 
important services to society. Social expenditure was cut and underemployment 
and marginality increased. In this juncture, options adopted by men and women 
differed considerably. While men tended to insist on the stability and the payment 
of wages (even if it was individually, clandestinely or through workers organizations 
tolerated by the militaries), women invented all kinds of organizations and 
collectives, such as “popular pots” and consumption cooperatives, which, with 
sufficient efficiency, combined an increase in their allowances with a decrease in 
the costs of consumption. More than survival, these new strategies allow them 
to consolidate (sometimes without a clear intentionality’) locally based women 
organizations. This new organizing, along with the emergence of other groups 
and social movements that expressed strong claims associated with life-quality 
of the population (neighborhood organizations, feminist movements, human 
rights movements, the environment?) and where feminine participation had 
the majority, permitted consolidation of a strong presence of women in both 
the public and private sphere, and extended feminine citizenship to issues that 
transcended the mere electoral exercise. Women reclaimed the recognition 
of a presence, in fact already being produced, and at the same time, claimed 
the institutionalization of political (the right to vote and for the eligibility for 
positions in the pubic sphere of decision making), civil (the legal statute of 
the married women, equality facing law, family), economic and social (access 
to education and health, nondiscrimination in jobs, family benefits, access to 
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credits and to leisure and culture), and even reproductive rights (right to freely 
and responsibly decide the number and the span between children, access to 
information, education and means to use them). 

The return of democracy, frequently after an important social uprising against 
dictatorships and authoritarian regimes, where women and women movements 
achieved a special transcendence, signified the return to the scene of traditional 
actors, mainly traditional political parties. But this democratic reconstruction 
not only signified a reconstitution of the classic frames of participation in these 
societies. In part, steps made in the above-mentioned direction provoked at least 
two substantial changes in the political arena: the first one has to do with a new 
conception of territory (fundamentally in the municipal context) as a particular 
context for citizenship participation (this process was corroborated by processes 
of political-administrative decentralization, which in some cases was happening 
since the 1970s, but were deepened with the re-democratization reforms of 
the new Latin-American regimes in the next decade); the second one, with an 
increasing recognition of the feminine presence in the diverse collective or public 
spaces, as well as its complexity? 

After the transitions to democracy, a participatory modality was established 
in Latin-America that we could say combined, on one side, the predominant 
classical participation structures — in which women were still excluded in good 
proportion — and on the other, new channels of expression where feminine 
incorporation had a space in an increasing way. To study the political and social 
participation of women in Latin America, it is certainly important to distinguish 
between the formal political systems of power (at the national level as well as at 
the regional or local one) from the informal systems of participation and power 
in the social sphere, at the communitarian, neighborhood and vicinity levels. This 
article will center on the analysis on the former ones, although without neglecting 
that the contribution of women is indisputable at the social level. 


The Political Participation of Women in Latin America in the 1990s 


Although there has been abundant research completed on electoral behavior and 
the problematic of feminine vote, two principal features still deserves attention: 
in the first place, a greater electoral abstention of women having the right to vote 
(in part due by its absence in the formal political process), and in the second 
place, the more conservative vote of the feminine electorate. Both facts seem to 
be confirmed in Latin America. Even in countries with important traditional 
levels of absenteeism of the whole electorate, the rate by which women do 
not vote is greater than that of men’ (although probably this difference in 
participation has more to do with the political culture of these societies and 
with the process of historical consolidation of democratic institutions.*) Equally 
true is the fact that women tend to support conservative parties." 
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However, the large differences between the political participation of men 
and women in Latin America are not limited to the electorate context, but 
rather, they encompass the inequality suffered by Latin-American women 
in gaining access to political positions, whether in the State or at the level of 
directives of political parties. Thus, the real problem takes root in the exclusion 
or subordination of the feminine representation in the structure of political 
power, due to the coexistence of what Garretón (1990) calls "the excluding 
circle" (el círculo excluyente) and the "the virtuous circle” (el círculo virtuoso). 
The first one of these points to the exclusion of women from the political game, 
in the sense that a political class, greatly constituted by men, feels threatened 
by the incorporation of women in leadership or representation positions that 
are already scarce. The second one, the “vicious circle" tends to extend and 
reproduce the first one. *For these reasons, women have few opportunities to 
participate, either in a subordinate way or in the directive positions. If they 
do not have many opportunities, they loose interest. If they loose interest in 
participating, they don't pressure the government to obtain changes in the 
current situation of participation, which means that no new opportunities are 
being generated, and with it the vicious circle is sealed.” (Garretón 1990: 25). 

The entry of Latin-American women to the executive power was through 
their designation as ministries. Chile was the first country to have a woman as 
a Ministry of State in the Justice cabinet (1952), and Argentina and Paraguay, 
the last ones (1989).!! This process took place rather slowly and it had to wait 
until 1989 to see a woman elected as President (Violeta Chamorro in Nicaragua), 
even if two have been previously exercised without the mediation of an elective 
process (Estela Martínez de Perón in Argentina between 1974 and 1976, and 
Lidia Guelier in Bolivia, in the final 1979 until 1980). Women had previously 
entered the judicial branch, since numerous women had begun careers in 
justice when the right to vote was obtained (even though their access to the 
Supreme Court was banned for many years and it still is in many countries of 
the region). 

In the example of Ministries and Secretaries of State, based on countries 
information of 1999, the feminine presence varies between 796 and 2096 of the 
total positions (in Ecuador about 3096, while the federal government of Brazil 
was the lowest one with 596 of the cabinet positions occupied by women). 

However, it must be highlighted that in most cases, the designation of women, 
as well as men, in Ministries or Secretaries is related to their involvement with 
the political parties that compose government. And this makes that political 
parties became the principal mediators in the access of women to this sphere 
of state power. Briefly speaking, it must be said that for the available data about 
Latin American in the '90s, it doesn't seems to detect a relation with the political 
orientation of the parties in power. Thus, for example, in countries without 
feminine representation at high decision-making levels, governments are neither 
a conservative nor a social democrat in character. 
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Table 1. Women Appointed Ministries or Secretariats of State, Based 
on the Most Recent Available Data (sorted by percentage, in 
decreasing order) 


5 5 eme ie ea Du 1 .  — — — — —— 


UE KH Om MM E RR ee 
COUNTRY YEAR TOTAL WOMEN PERCENTAGE OF 
WOMEN IN 
GOVERNMENT 
ECUADOR 1999 14 i 28.5 
PANAMA 1999 13 3 291 
HONDURAS 1999 17 B 17.6 
COLUMBIA 1999 18 3 16.6 
HAITI 1999 18 3 16.6 
CHILE 1998 I9 3 15.8 
COSTA RICA 1999 14 2 14.2 
PERU 1999 15 2 19:9 
MEXICO 1998 17 2 11.8 
DOMINICAN REPUBLIC 1999 17 2 11.7 
ARGENTINA 1999 9 1 mA 
PARAGUAY 1999 10 1 ri 
GUATEMALA 1999 p? 1 8.3 
URUGUAY 1999 12 1 8.3 
BOLIVIA 1997 14 1 El 
CUBA 1999 28 2 7.1 
VENEZUELA 1999 14 1 7.1 
BRAZIL 1999 24 1 4.1 
EL SALVADOR 1999 14 0 0.0 
NICARAGUA 1999 13 0 0.0 


Source: ECLAC, 1999. 


At the level of vice-ministries or sub-secretaries, feminine representation seems 
to be somewhat higher than at the level of the Ministries. However, the situation 
is rather different from one country to another. While in Haiti the proportion of 
women by the end of the nineties was 60%, in Ecuador it barely touched 2.9% 
(CEPAL, 1999). Female representation at the level of embassies is very limited 
and it occurs the same in the intermediate levels of government (surprisingly, by 
the end of the nineties, among the federal countries — Argentina, Brazil, Mexico 
and Venezuela — only Brazil had one woman as governor in 1998). The frequent 
heterogeneity among countries and levels of government, and so in different 
periods, added to the recurrent incoherence (countries with high percentage of 
feminine representation in some levels and very low in others) makes it difficult 
to find an explicative pattern for the process of incorporation of women in the 
institutional structures of Latin-American States. 

The entry of women into national parliaments has been particularly slow, 
and even in countries that have incorporated women into parliament, the actual 
percentages of feminine participation remains low. By the same token, this is 
generally true of all countries, perhaps with the exception of northern countries 
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in Europe, and seems to be independent from the socioeconomic development 
of these societies (Astelarra 1990: 16). 

In countries with unicameral parliament, Cuba stands out in the nineties 
with a 27.6% percent of women, while Panama has only 8.3%. In countries 
with bicameral parliament, the participation in the Low Chamber seems to be 
greater than in the High Chamber. In Mexico, there were 14.8% women senators 
(1999) while in Dominican Republic there was not a single woman in the Senate 
in 1990. By the beginning of 1999 in Argentina, 27.6% of the deputies were 
women, while Paraguay (with 2%) and Haiti (with 3.6%) are found in the last 
positions. It’s interesting to highlight the profile of these women gaining access 
to the parliamentary elite. A large proportion of women, even greater than 
in the case of male deputies, come from a high social class (more than 58% 
of Latin-American deputies report that they come from the upper or upper- 
middle class), enjoy a privileged and prosperous position (thus manifested by 
73% of women parliamentarians), have family members who have political 
backgrounds or are dedicated to political activity (more than 55%); and in 
23% of the cases, with more than two relatives, all men, and with an elevated 
academic education." 

In the last few years, national Parliaments have formed parliamentary 
commissions focused on legislative issues related to women. Names of the 
commissions and their creation dates allow one to analyze their focus. The 
Bicameral Commission for Women Rights in Argentina (1995), the Committee 
for Gender and Generational Issues in Bolivia (1997) or the Commission 
for Family in Chile (1991), the Commissions for Equity and Gender in the 
Venezuelan assembly (1997) seek to assist and promote women as well as correct 
the ideological balance and the current power in each national Parliament. These 
commissions have turned to be very important as mechanisms to legitimize 
legislation that address the problematic of feminine representation and the 
specialized handling of the different parliamentary initiatives, among them, 
special laws designed to allocate slots for women. However, these committees 
only pave the way towards correcting the gross imbalance of representation in 
the legislative branches of government. 

In general, the justice power has been more open to feminine participation. 
Notwithstanding, it shows great stratification, concentrating women in the 
inferior levels of the justice system. In nine Latin-American countries a woman 
has never been a Magistrate of the Supreme Court of Justice, meanwhile in the 
lower courts of law women occupy more than 60% of the positions. However, 
at these levels specialization by sex can also be seen, with more women present 
in courts resolving young and family issues and men in labor and penal ones 
(CEPAL-UN 1995). 

Political parties constitute one of the most important barriers for women 
to access formalized political power, and even though women have tended to 
be active at the entry level of political party activity, very rarely do they exceed 
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2596 percent of the managerial positions of parties. This imbalance is reflected 
in the designation of candidacies for popular representation positions, and in 
the conformation of the ministerial cabinets." Normally, the presence of women 
tends to be greater in the new parties or in process of renovation, and many 
times, it is independent from the ideological character of these parties. 

As the Chilean case shows, and opposite to what could be thought about 
how women representation will be more reduced in parties situated in the right 
side of the political spectrum and increase as we approach to the left of it, the 
Chilean parliament indicates the contrary finding that the proportion of women, 
equally reduced in all parties, is relatively indifferent to the parties' ideologies, 
and is more related with their process of foundation and subsequent evolution. 
Thus, in traditional parties, with a more settled political trajectory, like for 
example the Christian Democracy (Democracia Cristiana) or the Socialist Party 
(Partido Socialista), with deep cleavages and inter-party subcultures, the number 
of women is more reduced. However, in parties created more recently, like the 
Party for Democracy (Partido por la Democracia), but also in conservative groups 
such as National Renovation (Renovación Nacional), the percentage of women is 
greater (Del Campo 1995). 

The presence of women in national directives of the political parties varies 
considerably. It oscillates between 396 and 5096 with an average of 2096. However, 
women constitute between 40% and 50% of its affiliates. The highest rates of 
feminine presence in the head of parties are in the Christian Democratic Party 
(Partido Demócrata Cristiano) in Paraguay (in 1997, the head of the party was 
partitioned); very closely situated is the Humanist Party (Partido Humanista) 
of Chile with 42.896 of women in its directives (1998) and the Christian Social 
Unity of Costa Rica (Unión Social Cristiana) (PUSC) with 4096 (in 1998). 
Situated around 3096 are the Party of Workers (Partido de los Trabajadores) 
(PT) in Brazil (1998); the Party for Democracy (Partido por la Democracia) 
(PPD) in Chile (1998); the Party for the Democratic Revolution (Partido por 
la Revolución Democrática) (PRD) in México (1997); the National Encounter 
Party (Encuentro Nacional) and the Februarist Revolutionary Party (Partido 
Revolucionario Febrerista) in Paraguay (1997) or the Reformist Social Christian 
Party (Partido Reformista Social Cristiano) in the Dominican Republic (1993). 

One way to increase participation of women in electoral politics has to do 
with proposals for the allocation of “minimum quotas” of female presence 
in directive positions and in the list of candidates for positions of popular 
election.'° In 1997, the World Inter-Parliamentarian Conference discussed the 
allocation of quotas for legislative positions and established that these should 
be fixed around 30%, since it has been shown (Dahlerup Drude 1985) that 
by increasing the representation of a minority group in political institutions 
(parliament, parties) to a rate close to 30%, they acquire the capacity to raise 
problems in public agendas and to establish alliances that will favor them. 
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Two types of allocation of slots for gender are currently applied in Latin 
America: rules for allocating slots within parties and national laws for the 
allocation of slots. The latter ones are shown to be much more effective than 
the former ones mainly because of two reasons: in the first place, they are 
applicable to all parties, and not just a few; and in the second place, laws for 
the allocation of slots are being reinforced by the bureaucracy or the judicature, 
while the deployment of the parties rules depends on the interest, capacity and 
juncture of leaders of the same parties (Jones 1997). 

The first law of allocating slots for gender in national elections of legislators 
was approved in 1991 in Argentina. The law for allocation of slots (Law 24012) 
contains two important requisites: 1) that a minimum of 30% of candidates of 
the closed lists of parties in the 24 electorate districts be women; and 2) that their 
position in these lists implies a real possibility of gaining access to the board. 
Lists from parties that do not comply with these requisites will be rejected. 
Until 2001, these slots have only been applied to the Chamber of Deputies. This 
has signified a fundamental shift in Argentinean politics. During the period of 
1983-1991, only a 4% of deputies were women; in the next election of May 
1995, it increased to a 28% and with the following elections it turned to be 34%, 
positioning Argentina as the tenth country of the world in the ranking of female 
representation in National Parliaments. The percentage of women in the High 
Chamber has gone from 5.6% (without the allocation of slots in the year 1995) 
to 33.3% in 2001. Currently there are thirteen countries in the region that have 
a law for allocating slots, with the percentage of quotas varying between 20% 
and 30%, and generally, they operate through lists of candidates and imply a 
real guarantee of equity in opportunities to integrate political competence." In 
some cases, like the Argentinean, not only a slot is being secured but also the 
eligibility of the slot so that quotas finally become effective. In various countries, 
quotas are progressive and tend towards a paired system. 

As examples of the way the allocation of slots in political parties were 
debated in Uruguay, the Socialist Party of Workers (Partido Socialista de los 
Trabajadores) approved it in 1991, while the Wide Front (Frente Amplio) voted 
to reject quotas as a means to increase feminine participation. In El Salvador, 
the Mujeres-94 platform proposed that in the contending political parties in 
the elections of 1994, a quota of participation of 5096 in positions of power 
for women(Moreno 1997:50). Costa Rica and Mexico have approved laws 
that encourage, although do not demand, political parties to increase political 
participation of women. In both cases, percentages of feminine representation 
have increased significantly: in Costa Rica, in the 2002 elections, the percentage 
of women in the Low Chamber reached a 35.1%, while in Mexico it was 22.6%. 
In 1995, Brazil adopted a law for the allocation of slots for the 1996 municipal 
elections, and Bolivia and Peru followed suit at the beginning of 1997. The 
Christian Democratic Party (Partido Democrático Cristiano) of Chile established 
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Table 3. Legislation by Quotas 





COUNTRIES 


DATE 


LEGISLATION 





Argentina 


Bolivia 


Brasil 


Chile 


Costa Rica 


Ecuador 


Mexico 


Panamá Panama 


Paraguay 


Perú Peru 


Dominican 
Republic 


Venezuela 


1991 


1997 


1997 


1997 


1996 


1997 


1996 


1997 


1996 


1997 


1997 


1997 


Law Ne? 24.012 (3096) 
Ley Ne 24.012 (30%) 


Law to Reform and Complete the Electoral System 
(multinomial candidates) (30%) 

Ley de Reforma y Complementación del Régimen 
Electoral (candidatos plurinominales) (3096) 


Law 9.504 (20% quota, with possible increase to 30% in 
the year 2000) 
Ley 9.504 (cuota 20% ampliable a 30% en el afio 2000) 


Legislation Project to modify the Organic Law on 
Political Parties 

Proyecto de Ley sobre modificación de la ley orgánica 
de partidos políticos 


Law 7653 (party quotas and delegates to the Assembly, 
40%) 

Ley 7653 (cuota partidaria y delegaciones de la Asamblea, 
del 40%) 


Law to Protect Labor (20%) 
Ley de amparo laboral (20%) 


Ammendment of the Federal Law on Electoral 
Institutions and Procedures (30%) 

Modificación al Código Federal de Instituciones y 
Procedimientos Electorales (3096) 


Law Ne 22 (3096) 
Ley N° 22 (30%) 


Law 834 Electoral Code (20%) 
Ley 834 Cédigo Electoral (20%) 


Law N° 26864 Article 10°, paragraph 2 de la Law of 
Municipal Elections (25%) 

Ley N° 26864 Artículo 10°, inciso 2 de la Ley de 
Elecciones Municipales (25%) 


Electoral Law 275/97 (2596) 
Ley Electoral 275/97 (2596) 


Law on Voting and Political Participation (3096) 
Ley de Sufragio y Participación Política (3096) 





Source: ECLAC, 1999. 
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the obligation to allocate the recognized percentage of slots for women; its 
criteria was not limited to setting up an allocation of slots for candidacy or 
elected positions, but secured that, in relation to each vote, quotas would be 
met. The result was that in the last Party internal election, women reached 
approximately 40% of the local directives. 

The most significant advance of women in the executive power has been the 
creation of different organisms of governance specialized in putting together 
public policies for the advancement of women. With the aim of making anti- 
discrimination legislation normative, Latin-American countries have deployed 
government mechanisms to foster gender equity — the majority of these 
specialized organisms were deployed between 1980 and 1990. Characteristics, 
objectives, management modality, normative, institutional legitimacy, dependence, 
financial and human available resources and the social positioning of these 
mechanisms varied considerably among countries. 

The values that originated these mechanisms express different degrees of 
legitimacy and interlocution. In many countries the legal mandate is precarious 
since it depends for its creation as well as its sustainability on a will of the 
executive branch and sometimes is not incorporated in the national legislation. 
This fact is especially serious for the articulation between mechanisms and the 
demands and necessities of women and their organizations. 

Institutional dependence also reflects the influence and capacity of these 
organizations dedicated to the advancement of women within the state. In 
six countries they have the rank of ministries: Costa Rica, Chile, Haiti, Peru, 
Paraguay and Dominican Republic. In the rest, they are mainly constituted 
as directions or departments, which means a lower position in the hierarchy, 
less resources and greater difficulties for coordinating policies. Some of these 
departments depend on the presidency, and almost all depend on ministries 
or vice-ministries, which handle social issues. 

On the other hand, management models of these national mechanisms 
vary. Argentina, Brazil, Ecuador, Mexico and Venezuela created councils and 
national commissions on the advancement of women. Conversely, in Central 
American countries (Costa Rica, El Salvador, Honduras and Nicaragua) institutes 
for women were created, as in the Spanish style that except in the Costa Rican 
case, are not autonomous. Cuba and Puerto Rico are different, since in the 
first one the Federation for Cuban Women (Federación de Mujeres Cubanas), 
a constitutionally recognized governmental organism, plays at the same time 
the role of the governmental mechanism for the advancement of women, while 
in Puerto Rico there exists a separate governmental commission for Women 
Issues. 

From the experience of the law for equity of rights from the Nordic countries 
and the Plan for Equal Opportunities in Spain, some countries of the region 
have put together plans for equal opportunities that pursue the aim of gender 
equity and reflect the government politics in this context. Their adoption reflects 
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above all the political will of governments, for coordinating across departments 
and ministries, the more general objectives of governments with the more 
specific goals of promoting the advancement of women. Except in Chile and 
Argentina, all of the countries that have put them in practice have done it after 
the World Conference of Women. These are plans for the long term, most of 
them over a period of 15 years, and whose success depends again on the force 
and legitimacy of the government in charge, translated in terms of presidential 
support and its capacity to negotiate and establish alliances with the rest of 
sections of the government, as well as other non-state actors, both nationals and 
international actors (it has to be highlighted the crucial role that international 
cooperation has for the development of specific programs, like in Bolivia). The 
variety of these institutional procedures is very important. In Central America, 
while Costa Rica and El Salvador have ventured themselves in national plans 
of action; Guatemala and Panama have put together laws that institute gender 
equity as have both Honduras and Nicaragua (CEPAL-UN 2000).'* 

In different countries (Bolivia, Colombia, Costa Rica, Cuba, Chile, Mexico, 
Paraguay, Peru, Dominican Republic) extra-ministerial committees or ministry 
commissions, or both mechanisms, have been created in the context of the central 
government to address specific problems. National special commissions have been 
established in others (Argentina, Costa Rica and Mexico) for the follow up and 
monitoring of sectional agreements on issues related to education, health, work, 
housing, agriculture and protection of rights for women and girls. In federal 
countries, regional mechanisms have been created — within the state — for the 
fostering of these rights, and as in all countries municipal offices in charge of 
women issues have been developed. 


Decentralization and Political Participation at the Local Level 


In studies made during the nineties, a greater participation of women in the local 
level is evident. As it has been pointed out, *The Local level is the one that has 
more to do with everyday life and where their presence is greater" (Saa 1993: 
11) This participation takes root in barrio organizations, in vicinity councils, 
in all types of organizations related with households, in center for parents, in 
schools, in sanitary centers, in issues related to housing. Women bring to these 
movements “an important realism in terms of the fixation of policies,” “a style 
less sacramental comprising the exercise of power,” and “an authority exercised 
more from responsibility and service, than from the power of one over the rest.” 
(Saa 1993:13) 

These local spaces of participation respond to what has been named as 
“communal management" (gestión communal) (Arboleda 1993: 24). Gender 
relations expresses a generic division of labor and power, since even if women 
constitute the greater group of those in charge of communal management - that 
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is, they manage the habitat and survival for them and their families — men have 
the hegemonic power and have the communal leadership in their hands.” 

The process of political and administrative decentralization embodied in 
deep liberal and deregulatory reforms of the state also signified a profound 
democratization of local governments. This democratization was immediately 
translated to popular elections of mayors, prefects and municipal presidents. 
We could say that by 1990 this process had been completed in all countries of 
the region. The augment in political legitimacy of elected positions, as well the 
increase in skills and resources in the municipal sphere, made local governments 
one of the most permeable to the action of women, also because of the fact that 
local government was linked with spaces of everyday communitarian life. 

Political participation of women experienced an important advancement 
in almost all countries by the end of the eighties. For example, in the case of 
Ecuador, indexes of participation of women candidates for mayor at a local 
level increased from 6.12% in 1978 to 14.81% in 1988 and from 6.09% in 1988 
to 9.30% in 1992 for the presidency of councils-town halls or municipalities. 
Women’s participation increased from 1.01% in 1980 to 11.60% in 1992 for the 
commissions of province capitals and from 0.85% to 8.70% in the rest of the 
cantons (Vega, 1992:40). 

In Brazil, in 1992, the number of municipalities governed by women 
represented 2.45% of the total of the Brazilian municipalities and close to 
9% of the agricultural and commercial areas (Biasi Ferrari Pinto 1993: 69). In 
Paraguay, in 1991, of 206 existing municipalities, only in 13 of them women 
had been elected for the intendent position — mayor — which signified 6% of 
women in the highest position in the communes. In the legislative communes 
— municipality boards — access of women to councils reached even 60% in the 
town of Desmedrados (Soto 1993: 109). 

As a good proportion of studies of the local level have showed, the individual 
leadership of women is more present in small municipalities than in the larger 
or intermediate ones. This fact could be explained through the lower number of 
internal struggles within political parties in order to obtain first candidacies, and 
sometimes, through the difficulties found in some parties for the elite replacement 
at the national level, which leads them to a process of cooptation at the local 
level, which normally includes a good number of women. 

Another interesting fact is that new right-wing parties, which were created or 
reconstituted after the transition, incorporated a good number of women into 
their lists and leadership positions of their parties: examples can be found in 
Ecuador, where 41.37% of women councils belonged to right-wing parties, while 
only a 10.34% belonged to left-wing parties (Arboleda 1993: 39). The same occurs 
in Brazil, where during the period of 1989-1992, the PFL and PMDB elected a 
great percentage of women mayors (prefeitas), approximately 30% each. Behind 
them were the PL (9%) and the PDS (8.1%). Meanwhile, the two oppositional 
parties of the moment, PDT and PT, barely had a 5.4% of feminine representation. 
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Table 6. Women in Local Government: Mayors, Most Recent Year Available 
(selected countries, sorted by percentage, in decreasing order) 


COUNTRY YEAR TOTAL WOMEN PERCENTAGE TITLE 
OF WOMEN 
LE Va ren ME M cM Is 
Nicaragua 1996 145 30 20.7 MAYOR 
Panama 1999 73 10 15:7 MAYOR 
Honduras 1994 291 97 127 MAYOR 
Chile 1997 341 32 9.4 MAYOR 
El Salvador 1998 As A 8.4 MAYOR 
Venezuela 1998 330 22 6.7 MAYOR 
Cuba 1998 169 9 2 PRESIDENT 
OF THE 
MUNICIPAL 
ASSEMBLY 
Costa Rica 1998 81 4 4.9 MUNICIPAL 
EXECUTIVE 
Columbia 1998 s T 4.7 MAYOR 
Bolivia 1997 311 n 3.9 MAYOR 
Haití 1995 157 5 3.8 MAYOR 
Argentina 1992 1100 40 3.6 INTENDANT 
Perú 1998 194 Z 3.6 MAYOR OF A 
PROVINCE 
Brasil 1997 5378 190 3.5 PREFECT 
México 1998 2418 79 3. MUNICIPAL 
PRESIDENT 
Paraguay 1996 220 6 27 INTENDANT 
Dominican Republic 1998 115 2 1.7 REPRESENTATIVE 
Guatemala 1994 330 4 bo MAYOR 
Ecuador 1997 27 0 0 MAYOR 
Uruguay 1998 19 0 0 INTENDANT 





Source: ECLAC, 1999. 


However, one must bear in mind that these left-wing parties correspond to parties 
of the elected prefeitas in capitals and large cities (Biasi Ferrari Pinto 1993: 89-90). 
Regardless of party, it is generally on the local level where advances in positions 
adopted by women as a collective in social movements of the 1970’s decade seem 
to be condensed in a more consistent way. 


Conclusion: Challenges in feminine political participation 


Generally, scholars have affirmed three factors that are responsible for the lack 
of female participation and representation in public positions: the process of 
political socialization, larger structural issues, and a strong discrimination against 
women (Hedblom 1987). In the Latin-American case, I have tried to highlight 
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the institutional features that exist beyond the indisputable constrictions that 
operate as givens because of the structural and intermediary factors distancing 
woman from politics, such as occupation, education, social organizations, and 
the psychosocial and socialization dimension. The difficulties that women face 
while trying to gain access to political positions are indeed considerable. In the 
first place, there are the political parties that truly act as “bottle-necks,” making 
the access to positions of party leadership and candidacies for representative 
positions more difficult. In the second place, the problem of influence in the 
superior ranks exists. “Those who exercise governmental or regional positions 
tend to give less authority to women than to men. It is the problem with the entire 
political class, because other leaders are less likely to recognize the authority of a 
woman.” (Saa 1993:13) In the third place, women have problems in the interior 
of institutions, where they always find themselves to be “in a gender minority 
and as a minority in party politics.” (Arboleda 1993: 41) Women find themselves 
confronted with significant limitations that come with institutional difficulties, 
where a dominant instrumental rationality controlled by men rules and makes 
difficult women’s institutional practice. Women have also less influence in mass 
media. Maria Antonieta writes, “In my position as a mayor — I was designated 
as a mayor of the Conchali commune of Santiago de Chile by president Alwyn 
after the restoration of democracy — I was interviewed several times, but in 
feminine sections or in sections concerning the portrayal of personalities, more 
than in terms of my municipal management.” (Saa 1993:13) 

A sixth reason that women are prevented from participating in political life is 
that women suffer important limitations from their own families and the rampant 
machismo that exists in the culture. The social context presents many obstacles 
for women. Women have less time since most of them are still exercising all roles 
in their families. Since the remuneration for dedicating one’s life to politics, 
especially at the local level, is not very rewarding, many of them are forced to 
make these roles compatible while carrying out other job-related functions. 

In summary, the confluence of all these factors has contributed to reduce the 
political opportunities of Latin-American women, These limitations continue 
today; however, undoubtedly, the changes that occurred during the nineties 
make us think that the feminine advance in exercising leadership in democratic 
institutions is going to continue as a gradual but unstoppable process in the 
21st century. 

Regardless of the positive evolution that took place in the nineties, women are 
still underrepresented in the different spheres of power. This attempt to correct 
the imbalance in the representation of women encounters strong resistances on 
structural, institutional, and psychosocially levels, making it difficult to change 
in the short term. At the same time, the disparity of data, the incoherence in 
much of it, the diversity between countries and inside them, forces one to think 
that positive results do not always indicate a general policy that will facilitate 
women access to the decision making levels; on the contrary, it must be pointed 
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out that each country has its own dynamic, and institutional reforms have yet to 
mature. Modernization processes and state reforms, of both decentralization and 
globalization, strengthen the presence of women in centers of public decision, 
but they also incorporate strong tensions in relation to their participation. Those 
different institutional actors have not always considered satisfactorily women's 
rights to participate in the public debate and to carry out a more complete 
citizenship than the one defined by the right to vote. In substantial terms, it could 
be said that these processes presented important deficits of citizenship. 


Notes 


1. Already in December of 1979, the United Nations General Assembly had adopted the 
Convention for the elimination of all forms of discrimination against women, pledged by all 
Latin-American countries during the eighties. The ratification of this Convention revealed 
the political will of Latin-American countries to join themselves to the international agenda 
of reaching gender equity. But it is only with the realization of the IV Women's World 
Conference in 1995, where actions to promote gender equity received strong support for 
their incorporation in government agendas. Both in the Platform for Action (Plataforma de 
Acción) originated from this meeting as well as in the Program for Women Regional Action of 
Latin America and the Caribbean 1995-2001 (Programa de Acción Regional para las Mujeres de 
América Latina y el Caribe) (approved by State members during the VI Regional Conference 
for the Integration of Woman in the Economic and Social Development of Latin America 
and the Caribbean, (VI Conferencia Regional sobre la Integracion de la Mujer en el Desarrollo 
Económico y Social de América Latina y el Caribe) held in Mar de Plata in September 1994), 
the emphasis was that governments include gender perspective in the highest levels of decision 
making with policies, programs and projects from State. 


2. Already in the beginning of the 20th century, four international feminine congress were 
held in Latin-America, groups of women pushing for their voting rights were formed in 
almost all countries, an even in some of them feminine parties were created: in Brazil (1910), 
Cuba (1914), Argentina (1918), Chile (1922 and 1924) and Panama (1923). These struggles 
for feminine vote extended for more than forty years, thus while feminine vote — optional 
vote for literate women — was recognized in Ecuador in its 1929 Constitution (it is not 
convenient to forget that under the shadow of conservative sectors of the National Assembly 
and without the presence of a movements pursuing voting rights for women), feminine vote 
in Paraguay was not approved until 1969 (under the dictatorship of Alfredo Stroessner). 
Most countries recognized citizenship for women between 1945 and 1955, after many years 
of mobilization and actions of women groups: Brazil (1932), Uruguay (1932), Cuba (1934), 
Dominican Republic (1942), Guatemala (1945), Panama (1946), Argentina (94), Venezuela 
(1947), Costa Rica (1949), Chile (1949), El Salvador (1950), Bolivia (1952), Mexico (1953), 
Colombia (1954), Honduras (1955), Peru (1955) and Nicaragua (1955). In many countries, 
even in those where women struggled actively, it was a rule that voting rights were first 
granted for municipal elections and sometimes, after several previous attempts — in the 
case of Dominican Republic — women voted in two opportunities, as a preparation, before 
obtaining full citizenship (1934 and 1938). 


3. Many of the elected women, especially in the local level, point out that they arrived to 
power without a feminist or gender position. Although, it is true, they do assume as “natural” 
their role of working for practical necessities of women. 
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4. We could add to these types of explanations, those signaled by some authors, who point 
out that the same characteristics of these organizations in the authoritarian context (less 
requirement for specialization of functions, extension of domestic roles to a communitarian 
level) fostered a larger permeability of feminine participation inside of them. 


5. During the ‘80s and the ‘90s feminine participation in these social movements had signified 
their condition as citizens, bestowing them with strong expressive components in their 
identity. In regard, see Jelin (1987). 


6. Women were notable during the beginnings of the Latin-American movements of workers 
(in Uruguay, in 1881 the Feminine Section of the Montevidean Federation of International 
Workers Association (Sección Femenina de la Federación Montevideana de la Asociación 
Internacional de Trabajadores) was created while in Chile, in 1887, the Society of Women 
Workers for Mutual Help and Saving Trust (Sociedad de Obreras de Socorros Mutuos y Cajas 
de Ahorros) was born. Even if, and the same happens in the directives of parties, today, the 
feminine presence in directive positions of unions, whether in base unions or federations, 
confederations and centrals, is also very low. It is not a coincidence that the beginnings 
of the decrease in feminine participation started with the institutionalization of Women 
Workers organizations and the creation of large centrals unions such as the Argentinean 
General Confederation of Work in 1930 or the Colombian Working Central in 1935. Apart 
from Cuba and Nicaragua, the 2596 barrier of feminine presence in these types of positions 
is strangely exceeded, the average touching 996. Much higher is the presence in guild and 
professional associations, especially in those with more presence of women (teachers, 
journalists, psychologists, nurses, etc.). In business organizations the feminine absence is 
almost complete. But where women are collective social actor is at the level of barrio, local 
or regional social organizations. Countries like Colombia, Venezuela and Chile have long 
established networks of neighbors and communitarian organizations fostered by governments 
during the *60s as instruments of communitarian development, and have always counted 
with a large feminine participation and leadership. The ‘60s and the ‘70s saw the rising of 
these movements of citizens, with important presence of women, also in countries like Peru, 
Mexico, Chile, Uruguay, Paraguay, Venezuela, Colombia and Guatemala. 


7. In El Salvador, the electorate process of 1994 was the first to be realized after the Peace 
Treaties were signed in January 1992. These elections match up the elections for the municipal, 
legislative and presidential level, and were the first elections in Salvadorian history where 
participation for all forces was granted. The significance of these elections increased in the 
case of women, since this was the first time they will freely exercise their right to vote and to 
be elected. However, the absentee level among citizens remained very high: so, while in the 
1989 elections absenteeism reached 48.6%, in 1994 it was of 47.6%. Absenteeism of women in 
ballots was much more pronounced than that of men. Approximately 6 of each 10 Salvadorian 
women with the age to vote didn’t exercise the right (LAS DIGNAS 1995). 


8. In a country like Chile, with one of the most longstanding democratic histories in Latin 
America until the 1973 coup d'état, electoral participation of women was not inferior to 
men, although differences regarding interests in politics did exist. Even more “women cares 
about the problematic of politics, conceived as the resolution of people's problems, and not 
so interested in the activity professionally defined as politics" (Garretón 1990: 3-4). 


9. Still with the example of San Salvador and the elections mentioned, the last pre-electoral 
poll realized by the University Institute of Public Opinion (Instituto Universitario de Opinión 
Püblica) (IUOP-UCA) in February 1994, clearly showed that the intention of the feminine vote 
for the government and conservative party ARENA was 56%, three times greater than the two 
closest adversaries, Christian Democracy (Democracia Cristiana) (PDC) and the coalition of 
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left organizations, with the FMLN at the head (1896 for each one of them ). These preferences 
could be explained by the stabilizing discourse around family and society developed by the 
party government ARENA, for the deployment of social compensation programs developed 
by the government in marginal populations attending minimum necessities closer to the 
everyday life of women, because of less information in hands of this collective, which had less 
educative resources at hand, and who saw themselves greatly as subject to the basic cultural 
values transmitted through social tradition (Moreno 1997: 44-45). 


10. Garretón (1990: 24) points out: *Told in a very simple way, if women votes expand, men 
vote will not decrease. Instead, if participation of women in the scarce positions of leaderships 
and representation increases, participation of men will decrease” 


11. Already in 1948, a woman in Cuba was designated as a minister but without a cabinet. 
Of the recount that CEPAL makes between 1952 and 1989 for all Latin-American countries, 
the first cabinets occupied by women concentrate in social policy, especially in Education 
(6), Work (3), Social Welfare (3) and Health (2). Examples of cabinets of Justice in Chile in 
1952, of Protection in Venezuela in 1968 and of Foreign Relations in Argentina in 1989 are 
exceptional (CEPAL-UN 1995). 


12. Maria Estela Perón assumed the Presidency of Argentina between 1974 and 196, after 
the death of her husband, President Jan Domingo Perón, while being vice-president. In the 
case of Lidia Gueiler, she was president of Bolivia for eight months (from November to June 
1980), designated by the Parliament, and finally overthrown by a coup d'état. In March 
1990, Judge Ertha Pascal-Troullot provisionally assumed the presidency of Haiti, to prepare 
the elections at the end of the same year. In Ecuador, Rosalia Arteaga was in position for 
only three days, thanks to a National Congress mandate, in the middle of the crises occurred 
because of the impediments of Abdalá Bucaraman. And finally, in Panama, in the 1999 
elections, Mireya Moscoso was elected, widow of Arnulfo Arias of the ARENA party. In this 
case, in the same sense as in the Violeta Chamorro, the political capital that she enjoyed had 
been built by her husband. 


13. These data came from the study “Elites Parlamentarias en América Latina,” directed by 
professor Manuel Alcántara, which gathered the opinions of the member of the Chambers 
of Representatives in the 18 Latin-American countries. See Mateos (1997) 


14. Regardless of the important deficiencies in data at the level of parties in the region, we 
could say in advance, that by beginnings of the nineties, data from the feminine participation 
corresponding to the national directions of political parties, go from zero — the UCR 
in Argentina, the Authentic Liberal (Liberal Auténtico, Laborista (Labor) or Democratic 
Revolutionary (Revolucionario Democrático) parties in Panamá — to even a 4096 in the 
National Democratic Union (Unión Democrática Nacional) in El Salvador in 1993 or 
the Colombian Liberal Party (Partido Liberal Colombiano) with 33.396 of electoral party 
positions for women. 


15. Although it seems impossible, the first Latin-American precursor is the Peronist Party 
(Partido Peronista) that in its feminist section guaranteed 33.396 of electoral party positions 
to women. 


16. Anyway, the immediate materialization of institutional reforms is not clear. Let's see 
as an example the Paraguayan case. Presence of parliamentarian women in the National 
Congress experimented an increase from 5.696 for the period of 1993-1998 to 896 for the 
period of 1998-2003, being the Chamber of Senates the one which experimented the largest 
increase (from 11.196 to 17.896) as an effect of the establishment of a minimum quota of 
20% for women dictated by the Electoral Code. However, in the Chamber of Deputes, the 
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representation still was only 2.596, even with the functioning of the quota. This persistence 
in the low feminine representation has tried to be explained by the existence of reduced 
electoral circumscriptions for the Congress, while the system of quotas is reinforced in the 
Senate with more extended lists (CEPAL 2003). 


17. Brazilian law differs from the Argentinean one in two basic ways: in the first case, the 
minimum percentage of women is 20% and not 30%, like in the Argentinean case; and in the 
second place, Brazil chooses its municipal legislators in unique plurinormal district making 
use of the method of open lists. ' 


18. In the Costa Rican case, the technical process was conducted and led directly by the 
maximum government authority of women, without including directly the movement 
of women, who only participated in concrete deliberations. The Salvadorian case, with a 
technical profile since the beginning, was led by the First Lady, but had direct participation 
from the women's movement. In Panama, the plan is the result of an initiative coming 
from movements of women, before Beijing, in a process, led and negotiated with the State, 
civil society, under the leadership of the Woman and Development Meetings (Foro Mujer 
y Desarrollo). Without a doubt, the Guatemalan case is the more complex of all, because it 
combines a proposal from the government under the coordination of the ONAM and the 
First Lady, and a national policy for the development of women (EJES) that emerged since 
the National Meeting of Women (Foro Nacional de la Mujer) under the umbrella of the Peace 
agreements (Acuerdos de Paz) (CEPAL 2000). 


19. Certainly interesting is the distinction made by Fontes y da Graça (1992) between 
the different necessary roles in the community: a productive role, a reproductive role, a 
communitarian administrative task and lastly, what they denominate as a “communitarian 
policy,” related with the formalization of communitarian organizations as political selves that 
translate in a position of authority from which power and status emerge. Men essentially carry 
out this last one, and only those women candidates that have been in office, are postulated 
to municipal positions of first level. See Angela Fontes and Maria da Graça (1992: 145). 
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Democracy and New Social Forces in Bolivia* 


PILAR DOMINGO, Universidad de Salamanca 





Abstract 


The forced resignation of Gonzalo Sanchez de Lozada from the presidency in October 
2003 attests to the fragility of the process of democratization in Bolivia. A period 
of political and economic stability following the 1985 economic austerity measures 
prevailed for over ten years, but social discontent with the inadequacies of the 
political and economic model has become more evident since the end of the 19905. 
This article charts the political, economic and social processes which have interacted 
in complex — and indeed contradictory — ways since the 19805, advancing towards 
democracy in some important aspects, but failing to overcome structural barriers of 
exclusion, and rooted patterns of state patrimonialism. In a context of diminishing 
societal acceptance and legitimacy of the democratic institutions, new social forces 
have emerged which are challenging assumptions within the transitions literature 
regarding the path towards democratic consolidation and even the end-goal. 


Introduction 


More than a year after the resignation of President Gonzalo Sanchez de Lozada, 
in October 2003, Bolivia's political system seems a far cry from the model of 
liberal democracy propounded by modernization theory, or the more optimistic 
transitions theorists of the 1980s and 1990s. Political institutions of democratic 
representation have not been successfully stabilized despite appearances of 
political stability for approximately over a decade after the 1982 return to 
civilian rule. Institutions of rule of law and oversight mechanisms — some of 
them newly inaugurated in the 1990s — present a mixed bag of some successes, 
but also persistent expressions of political control and manipulation. The 
political economy of structural adjustment, fiscal austerity and liberalization, 
while successful in bringing an end to hyperinflation, has failed to deliver on 
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the promised "trickledown" effects for the majority of the Bolivia population. 
The heightened sense of social exclusion prompted by economic recession and 
continued structural poverty and inequality has fuelled the growing gap between 
a discredited political class perceived as corrupt and self-serving by the majority 
of the population. Disenchantment has increased to some extent owing to a 
variety of new social forces that combine old demands of social justice with 
new expressions of identity politics increasingly articulated around indigenous 
demands, and anti-globalization discourses. 

Bolivian politics, however, despite the escalation in instability and expressions 
of political violence in the last four years, does not as yet appear to exhibit the 
scale of social decomposition, and state failure that is evident in some neighboring 
countries. Nonetheless, it has become a clear example of a failing project of liberal 
democracy modeled around Dahl’s concept of polyarchy. Clean elections are 
more or less secure, but civil rights and human rights abuses have worsened in 
recent years. Corruption in public office is rife. Thus, even on some indicators of 
polyarchy, Bolivia falls short of the mark — and, even before the events of October 
2003, was showing signs of regression to authoritarian forms at some levels. On 
the other hand, democratic institutions have not broken down completely, and 
the political system has even shown itself to be permeable to new social and 
political formations. The current climate of change, possibly even at a systemic 
level with the prospects of a constituent assembly, is, indeed, fraught with high 
levels of uncertainty and a sense of instability. However, at the same time, it may 
reflect precisely the kind of resilience and openness to renewal and innovation 
that can allow Bolivia to forge its own path to a democracy. Clearly, though, the 
current conjuncture is uncertain, and the political outcome unknown. 

The Bolivian experience is an example of the complexities of post- 
authoritarian political development. Contrary to the hopes of the authors 
writing in the transitions literature, modern liberal democracy is not taking 
root (Carothers 2002). Moreover, the political forms that are taking shape do 
not quite resemble liberal democracy — and in some respects some forms of the 
emerging political discourse indeed challenge key premises of liberal democracy.’ 
This is not to say that democratic institutions have proved irrelevant. Not only 
are they not irrelevant, in some cases certain political and institutional reforms 
have even facilitated or opened up new political spaces for the emergence of 
new political formations perhaps more in tune with popular sentiment than the 
discredited traditional political parties. Nonetheless, recent social mobilization 
and political protest politics reflect the very fragility of democratic institutions, 
and highlight the real risks of political reversals to either authoritarian forms on 
the one hand, or new — and uncharted — political waters of different forms of 
political participation on the other. 

This paper will map key processes that have contributed to forging Bolivia’s 
fragile democratic polity. It will address how these developments have opened 
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the space for an open-ended political process that in some respects deviates not 
only from the desired path of democratic consolidation outlined in the transitions 
literature, but also from the more unpalatable forms of semi-democracy identified 
in the experience of various countries of the region semi-democratic populisms, 
“delegative democracy.” The paper will analyze the emergence of new social 
forces in the context of a “failing” democratizing polity, as the political and 
economic project since 1985 has not acquired enduring levels of rootedness 
and societal acceptance. The 2003 October events that led to the resignation of 
Gonzalo Sanchez de Lozada were the culmination of an escalation of political 
violence and state repression, and reflected cumulative failings of a fragile and 
incomplete democracy. 

In hindsight, the commitment to a democratic project has been half- 
hearted and only served to benefit a ruling political class, which, despite passing 
important political and institutional reforms, ultimately proved unable to shed 
illiberal practices of a patrimonial state. The political project was complicated 
by the structural difficulties of a highly dependent economy in which neo-liberal 
measures have not resolved problems of poverty and increasing inequality, nor 
provided the prospects for sustainable long-term growth. Finally, the discontent 
that this has fuelled has propelled new political and social formations that appear 
to be displacing a discredited “traditional” political class, and moving the Bolivian 
polity towards the uncharted waters of a constituent assembly. This may prove to 
be a new political foundation, or, given Bolivia’s structural and political legacy, 
yet another project that fails to meet the expectations that it generates. 

A first section will outline the institutional development of the Bolivian 
political system in the last twenty years. This is relevant, precisely because 
Bolivia was held as an example of political stability. It was considered that the 
cumulative political reforms served to strengthen representative and accountable 
government. The results have been mixed, however, and in recent years, there 
have even been “reversals” with regard to the levels of politicization of some 
of the new institutions. The second part of the paper will briefly examine the 
economic policies since 1985, which have been a key component of state reform. 
The consequences of the economic model are closely linked to the levels of 
disenchantment with democratic forms, and the emergence of what are viewed 
as “anti-systemic” political and social groupings. The third section of the paper 
will review the nature of the key political and social actors in Bolivia. These have 
evolved in tandem with the political and economic reforms. A key aspect of the 
political process of the last twenty years has been the dominance of coalition 
politics that have been the backbone of institutional stability. At the same time, 
though, the competing political forces have changed and evolved over time. 
New actors have joined the game of electoral competition, altering the stakes 
involved. And some of these actors combine electoral strategies with street-level 
protest politics. 
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The fourth part of the paper will examine the more recent expressions of 
social mobilization that have both undermined the institutional niceties of the 
polyarchical forms of government that had been taking shape, and also reflect 
widespread societal disaffection with the political system, as it stands, but not 
necessarily with democracy as a form of government. This has culminated in 
the October crisis of 2003 and subsequent levels of political and institutional 
uncertainty — albeit, still contained within constitutional limits of sorts. Here the 
recent referendum on the gas question and the forthcoming Constituent Assembly 
are key moments of political and economic redefinition. The final section of 
the paper will examine precisely these contemporary challenges to Bolivia's 
democratic process, which have to do with the prospects of adjusting economic 
and political forms to social demands without upsetting the developmental 
exigencies of the global economic setting in which Bolivia's highly dependent 
economy is inserted. 


The Politics of Stability and Institutional Reform 


COALITION POLITICS 


The stability of the political system has been underpinned since 1985 by a politics 
of coalition building to secure stable government with majority congressional 
support.? As a political strategy, coalition politics emerged as the consequence 
of a combination of both institutional and political factors. 

Firstly, the political strategy of ruling through coalitions was in part the 
result of an arduous learning process forged through the tumultuous transition 
experience since 1977. Specifically, by 1985, the disastrous experience of the 
Siles Suazo minority government, beset by hyperinflation of 23,00096 and a 
widespread crisis of social mobilization and political polarization had highlighted 
the necessity of ruling with a majority coalition in order to secure a minimum 
of governability. Clearly, the importance of learning processes as an explanatory 
factor to understand more long-term political development trends is extremely 
uneven. In the case of Bolivia, the learning process seems to have mattered in 
terms of prompting coalition formation after 1985. 

The multi-party system eventually also adapted to the incentives of coalition 
building, by the fact that no party could hope to achieve much more than a 
quarter of the vote in presidential elections. (By contrast, this has not led to 
consensus building strategies in other political systems). The electoral system 
of proportional representation yielded multi-parties fraught with internal 
fragmentation and personal infighting during the early years of the transition. 
After 1985, the party system, while allowing for the emergence of new political 
formations, proved somewhat more stable. For a few years, it settled around 
coalitions dominated by the "traditional" political class made up of the centre- 
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right MNR (Movimiento Nacionalista Revolucionario), the right wing ADN (Acción 
Democrática Nacional), an electoral vehicle for Hugo Banzer, and the somewhat 
less than social democratic MIR (Movimiento de Izquierda Revolucionaria) headed 
by Jaime Paz Zamora. 

Linked to this, coalition formation has also been prompted by the system 
for electing the president which requires a congressional majority of 50 percent 
in the event that no candidate obtains 50 percent of the vote in the presidential 
election. This is key to understanding the incentive structure for the formation 
of coalition governments (say in contrast with second round presidential 
elections).^ Since 1989, the congressional majorities that come together to elect 
the executive, did so after a period of harsh negotiation in which what was 
established were terms and division of power quotas of the future government 
coalition. The system of closed lists for electing members of congress (prior 
to the electoral reform of the 1996 which introduced uninominal candidates) 
facilitated coalition building between parties. Party leaders have generally been 
able to impose decisions resulting form top-level negotiation between coalition 
partners on the party rank and file. 

Finally, the lure of state patronage has proved to be a crucial factor in 
holding together unseemly political alliances. The importance of clientelism 
in understanding the institutional stability of the last eighteen years in Bolivia 
cannot be overstated, (Gamarra and Malloy 1995; Gamarra 2003; Domingo 
2001). Political parties need access to power to obtain sufficient resources, and 
retain sufficient base support through rewards in the form of employment 
possibilities (pegas and peguismo), or prospects of business and moneymaking 
opportunities, in order to survive in the electoral game. The attraction of public 
office is powerful, and induces parties to enter into political alliances in which 
the terms of political negotiation have little to do with programmatic or electoral 
commitments and revolve essentially around the distribution of power quotas 
and state patronage.° 

Clientelism in Bolivia penetrates public office and government at all levels, 
and both overshadows and steers other aspects of the policy making process. 
Political clientelism is a complex and multi-layered phenomenon. On the one 
hand, it fuels the perception of chronic corruption at all levels of the public 
sector and government, thus contributing to undermining the legitimacy of 
the political process. On the other hand, clientelism forms part of the public 
expectations of how governance works. The failure to “reward” loyal supporters 
with employment prospects is perceived as a betrayal and even misuse of public 
office (Lazar 2004). 

Overall, though, while clientelism has served to underpin political stability and 
hold together unlikely political partners for difficult policy-making process, it has 
also been a key contributing factor to the process of increasing disenchantment and 
disaffection with the political system and with the institutions of representation. 
The growing social base of protest mobilization and what are called “anti-system” 
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parties (currently the MAS, led by Evo Morales and the smaller MIP, led by the 
aymara leader Felipe Quispe) is precisely linked to the perception of corruption 
that pervades the traditional political parties — in particular, but not exclusively, 
the MNR, the MIR and ADN. (The evolution of the political actors in democratic 
rule will be discussed below.) 

Coalition government had the merit of facilitating institutional reforms, 
many of which augured well initially for the prospects of building a democratic 
polity based on accountability, broader political participation, and the societal 
embeddedness of certain political institutions. Many of these reforms have 
signified important advances in terms of strengthening not only formal 
democracy, but also building some legitimacy capital in key areas of state-society 
relations. In particular, the first Corte Nacional Electoral (CNE) appointed in 1991, 
and the office of the Ombudsman became institutions of widespread credibility 
and public trust. But the reforms overall have had mixed results. The political 
class has been reluctant to relinquish control to autonomous institutions. In some 
cases, the reforms have had unforeseen consequences for the balance of power 
between political forces — both old and new. 


Electoral Reforms 


A series of reforms of the electoral and party system during the 1980s and 1990s 
affected the political system in different ways. Proportional representation is a 
historical feature of the electoral system. Prior to 1991 reform the formula had 
been tinkered with on a number of occasions, but on the whole, elections operated 
by proportional representation, yielding upon the return, in the late 1970s, a 
fragmented multi-party system. Overall, the electoral process was politicized, 
fraudulent and lacking in reliability and transparency. It was only with the reform 
of 1991, the product of a political pact among the major political parties, that 
substantial changes were introduced to improve the credibility and reliability of 
the electoral process. The CNE was established as an autonomous and bona fide 
institution — the first appointed members (with two thirds majority in congress) 
proved to be worthy officers of electoral control and oversight. Moreover, 
mechanisms were introduced to reduce the possibility of fraud at the various 
stages of the electoral process.’ 

The 1996 reform introduced important modifications to the electoral formula 
with the establishment of a mixed electoral system (inspired by the German 
model). The introduction of uninominal seats in congress has had an important 
impact on the internal workings of political parties, and has paved the way for 
new political actors such as the MAS (Movimiento al Socialismo) with direct 
appeal at the grass-roots level.’ 

Recent reforms, hastened through congress in August of 2004, will weaken 
political party dominance in electoral politics. The Ley de Agrupaciones Ciudadanas 
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y los Pueblos Indígenas allows for independent candidates to run for elections at 
the general and municipal elections, without a party ticket. This represents an 
important challenge to political parties, and significantly alters the rules of the 
political game, with unclear consequences for the dynamics of coalition formation. 
The reform is in tune with a climate in which political parties are held in low 
esteem, and there is a general call to more.“direct” forms of government in which 
civil society plays a stronger role in policy making. What this means in practice 
is unclear, as are long-term consequences of current reforms, which weaken 
political parties’ dominance in the political system. 

Electoral reforms, to the degree to which they have added credibility to 
electoral politics and to the vote — no small achievement given long-standing 
practices of fraud and malpractice — provided a certain degree of legitimacy to 
the political process. Also, significantly, they have contributed to opening up 
the opportunity structures for new social forces to emerge, and new expressions 
of political participation. In some respects, electoral reforms are facilitating 
new forms of political exchange whose consequences for long-term political 
development in Bolivia remain to be seen, but which are already shaping new 
political strategies. Significantly, also, the reforms seem to reflect popular demands 
regarding the direction that the political system should take. This is important 
in as much as it reflects that, despite the traditional parties’ resistance to the 
changes (that can only be to their detriment), these parties have not been able 
to duck popular pressures. This would suggest a constant process of adaptation 
of the system, through cycles of protest, political crisis, institutional reform, and 
subsequent adaptation by the various political actors to the changing rules of 
the game. This is repeated in other institutional spheres. 


“Ley de Participacion Popular” and Multicultural Government 


The political decentralization process introduced in 1994 under Sanchez de 
Lozada, in part as a concession to social pressures and the more progressive 
elements in his coalition government, has been considered a landmark move 
towards enhancing — or at least redefining — political participation, strengthening 
local government and formalizing multi-cultural dimension of Bolivian society. 
The Ley de Participación Popular (LLP) led to a substantive decentralization of 
resources to the municipal level: it extended local government through the rural 
areas, established direct elections for municipal officers, and set up oversight 
mechanisms on the management of resource. The LLP amounted to a measure 
that facilitated the incorporation of the (largely excluded) rural and indigenous 
population to political life, and has included the legal recognition of indigenous 
communities. 

The impact of the LLP is complex, and uneven in terms of the degree to which 
it has led to more inclusive and participative government, or a better and more 
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transparent management of public resources. Critical analyses of its consequences 
point to the different experiences of Participación Popular throughout the national 
territory. In some municipalities, it has indeed served to pave the way for new 
political forces; in others, it has reinforced local power structures to the detriment 
of the more vulnerable social groups.!! Nonetheless, it has clearly contributed to 
transforming the political landscape, and new social movements have successfully 
capitalized on the political space provided by municipal elections.” 


The “Defensor del Pueblo” and Judicial Reform 


Other reforms carried out mostly in the 1990s have included a variety of measures 
aimed at strengthening oversight and accountability measures, some with far from 
negligible success (Burgos 2003). Laws and the constitutional reform of 1994 
aimed at strengthening the rule of law included the creation of an ombudsman, 
a judicial council and a constitutional tribunal — although they only became 
enacted as late as 1998, due mostly to political resistance and inter-party fighting. 
The ombudsman, with the first appointment of Ana Maria Romero, is a striking 
example of a new institution that very rapidly developed a keen presence in civil 
society and a very positive public image. The development of this institution as 
politically independent and truly committed to denouncing legal wrongdoing 
and human rights abuses proved to be hugely disquieting for the successive 
governments. Following the ombudsman’s first term, the appointment of a second 
defensor del pueblo became a political circus as the coalition government under 
Sanchez de Lozada went to great lengths to avoid the re-appointment of Ana 
Maria Romero, and to seek to place a politically loyal candidate. The constitutional 
tribunal has also proved to be an institution far from averse to confronting the 
interests and decisions of the executive branch in its rulings. 

Cumulatively, the effect of these reforms was to strengthen checks and limits 
on government. Overall, they were the result of a combination of concessions to 
electoral promises and political and social pressures, the consequence of inter- 
party negotiations, and at some level even some degree of sincere commitment 
by elements within the different governments to constructing rule of law. At the 
same time, though, it soon became evident that the ruling political class was less 
than keen to abide by rule of law imperatives that come with democratic rules of 
the game, and attempted in different ways to subvert these new institutions. This 
contributed to worsening the democratic deficit that has fuelled social protest 
and widespread disaffection with the ruling parties. 

Institution building, then, has advanced in contradictory ways. On the one 
hand, important reforms have put in place worthwhile mechanisms to improve 
accountability and broaden political participation. On the other hand, the 
traditional political parties that have governed since 1985 have proved to be not 
only reluctant to support the spirit of democratic rule encompassed in these 
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reforms, but have at times been downright obstructionist in the application of 
new norms and institutions directed at more effective limited government. 


The Nueva Política Económica (NPE) and Economic Reform since 1985 


On the economic front, the neo-liberal model in place since 1985 has failed to 
provide a propitious environment of growth and development for democracy 
to take root. The Decreto 21060 passed in 1985 lay the foundations for the "new 
economic policy" or nueva política económica (NPE) introducing an orthodox 
plan of stabilization and fiscal austerity measures, which promptly brought 
an end to the crisis of hyper inflation under Siles Suazo. Subsequent reforms, 
most closely associated with Sanchez de Lozada, but not challenged in any 
fundamental way by other government administrations, have added shape to the 
NPE, deepening Bolivia's acquiescence to the ^Washington consensus" through 
diverse privatization and liberalization measures." 

One cannot overemphasize the importance of coca production and 
contraband as important sources of revenue, which have helped to boost growth 
levels through much of the 1990s. The crackdown on coca production under the 
Banzer government (1997-2002) severely slowed economic growth, in addition 
to fuelling social conflict, and not only in the coca producing areas (mostly the 
Chapare and Cochabamba, which have become the political heartland of the Evo 
Morales MAS movement). 

Overall, the record of the economic model has been mixed. Keeping inflation 
at record lows is a commendable success of the NPE. At the same time, though, 
the model has seemingly not laid the groundwork for long-term sustainable 
growth or poverty alleviation. Bolivias economy remains highly vulnerable to 
external shocks (patently evident with the Brazilian and Argentine crises). It has 
been able to shake off an extreme dependency on foreign aid. Moreover, the 
export-led model of development upon which the NPE is structured remains 
highly vulnerable to price and demand fluctuations on the two principle exports 
— gas and soy (Crabtree 2004). 

There is consensus that the NPE has not served to reduce poverty and income 
inequalities in any structural way, despite the creation of a number of poverty 
alleviation programs (most notably the Fondo Social de Emergencia under Paz 
Estenssoro, and the BONOSOL fund under Sanchez de Lozada). Recent reports 
in fact indicate that inequality has worsened (Crabtree 2004). Moreover, the 
NPE has lead to a deterioration of labor conditions (Assies and Salman 2003). 
The rural sector has remained under-resourced, and recent trends point to a re- 
concentration of land holdings, especially in the lowland areas. 

The NPE has thus also affected patterns of political mobilization and the 
shape of recent protest politics. First, the economic model dramatically weakened 
old structures of union politics centered on the COB and the miners’ union, 
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following the closure of state mines in 1985 — this in turn served to strengthen 
the dominance of political parties in the political game. Secondly, the impact of 
privatization on specific sectors and the change in terms of exporting natural 
gas in favor of foreign companies have prompted various incidents of social 
mobilization, with varying degrees of organization and structured strategies. 
The *water war" in Cochabamba in 2000, when the local population took to the 
streets to protest the privatization of water and the rise in water bills, is a clear 
example of societal reaction to direct consequences of the NPE on real living 
conditions. More dramatically, the “gas war” in October of 2003 culminated in 
the resignation of the President. 

Thus, opposition to the economic model not only reflects a general widespread 
perception that economic liberalization has benefited only a minority of the 
population and excluded the many (added also to the sense that privatization 
has been fraught with high levels of corruption among members of the ruling 
parties). It also expresses specific grievances by sectors of society who in very 
concrete ways feel they have lost out in process. This is echoed at various social 
levels. Finally, it is important to point out that neo-liberalism has been imposed 
with increasing recourse to repressive measures. Every government since 1985 has 
had to resort to states of emergency to quell social protest (Assies and Salman 
2003). 


Old and New Political and Social Actors 


Over the past two decades, Bolivia has shown a combination of relative political 
party stability, and the emergence of new political and social forces. This indicates 
shifting patterns of political allegiances in the electorate, as well as growing 
disenchantment with an increasingly discredited “traditional” political class that 
has proved incapable to consolidate institutional and electoral cohesion in the 
political system. At the same time, the system has allowed for the incorporation 
of new political forces, which, moreover, have had important electoral successes, 
notably at municipal level. 

A first point to note is that while political parties have dominated the political 
landscape for much of the last two decades, this in itself is a new phenomenon 
in Bolivian politics, as noted before. Previously, politics was brokered not 
through institutions of electoral representation, but through the military and 
union activity, namely, the COB — Central Obrera Boliviana. Before 1985, the 
democratic credentials of most political parties were highly questionable. It was 
only with the severity of the economic crisis, and the widespread exhaustion with 
the dramatic transition process that political parties emerged as the principle 
political actors. 

Electoral democracy arrived as the “only game in town” by default. The 1985 
economic measures effectively swept aside old cleavages. The COB and miners’ 
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federation were severely weakened, and for a decade or so, the ideological 
polarization and zero-sum politics of the past was left behind, as consensus 
politics became the prominent strategy by the surviving political forces. Political 
parties became the principle political actors for a while, although policymaking 
has been principally a top-down affair, with decisions taken at the leadership level, 
and congress effectively turned into a rubber-stamping institution. Thus, political 
parties have been increasingly seen as channels of state patronage and little else. 
Coalition politics were dominated by three main political formations — the MNR, 
ADN and the MIR, around which successive governments were structured. The 
competition between them was fierce, but for the electorate it seemed that the 
stakes increasingly revolved around access to state patronage and little else. 

New political cleavages have shaped a variety of emerging political and social 
formations. Firstly, towards the end of the 1980s, an assortment of populist 
formations emerged, with strong appeal in displaced urban poor and informal 
sector (namely, CONDEPA and UCS, and to some extent more recently, Nueva 
Fuerza Republicana, or NFR). These political groupings have tended to lack clear 
political discourses, and their electoral strength lay in the direct appeal of the 
leadership figures with the electorate. In addition, they were soon co-opted into 
the logic of coalition politics; thus, their distinctive "protest vote" quality eroded 
as soon as they were seen to be replicating similar patterns of clientelist practices 
of the traditional political parties. 

A second grouping of new political formations include a variety of rural based 
social cum political movements, which have grown on the back of a combination 
of cleavages, organizations, and demands. Through their role in the October 
2003 events and a more radical political discourse, they represent a more forceful 
challenge to the modus vivendi of the ruling political parties. The most recent 
incarnations of these forces include the MAS, led by the charismatic cocalero 
leader, Evo Morales, and the smaller MIP, led by Felipe Quispe, an aymara leader 
with a more explicit indigenista agenda, rooted somewhat in earlier versions of 
katarista movements. The combination of social mobilization strategies that 
these movements have adopted, with relative success and positive political results, 
marks a new form of politics — reminiscent and no doubt rooted in an earlier 
radical tradition in Bolivia — but effectively picking up on high levels of political 
discontent in large sections of the population.'^ 

Evo Morales, and Felipe Quispe, both very different leaderships, with 
distinctive discourses, have some common roots in earlier campesino politics. Both 
were involved in the development of the katarista movement (and subsequent 
political incarnations) in the 1970s, the CSUTCB (peasant union funded in 
1979), and in the case of Evo Morales, the organization of the coca-producers 
movement against the coca-eradication programs of the 1980s and ‘90s. The 
katarista line has been more forcefully advanced by the Quispe faction of the 
campesino movement, through the development of a more explicitly indigenista 
movement advancing indigenous rights. Evo Morales, first through the ranks of 
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the ASP (Asamblea por la Soberanía de los Pueblos), and then through the MAS, 
has been a forceful voice for the cocalero movement, but has also espoused a 
broader discourse of social justice. In the general election of 2002 Evo Morales 
obtained 1.596 fewer votes than Sanchez de Lozada of the MNR, raising alarm 
bells among the urban political elites. 

The rise of both these political formations responds to a series of factors. 
Firstly, it is important to note that they are not conjunctive political phenomenon, 
but are rooted in the complex (and conflictive) campesino movement that gathered 
momentum with the foundation of the CSUTCB. Both Evo Morales and Felipe 
Quispe, in different ways, have won their support in hard-won grassroots power 
battles over several years. Secondly, they voice the demands of potentially large 
political constituencies — bearing in mind, albeit the high levels of electoral 
volatility in the Bolivian population — the politically disenfranchised and 
marginalized rural poor. Thirdly, especially in the case of Evo Morales, they have 
tapped into the widespread disaffection with the traditional political class.” 

In both cases, success has been the result of a combination of political 
strategies, which range from social protest mobilizations to formal electoral 
competition — and both with effective results. On various occasions, social protest 
has led to the government altering policy decisions — at least in the short run. 
More worrying for the traditional political class has been the electoral gains of 
what they view as “anti-system” parties, and which are more difficult to discredit, 
especially in the case of Evo Morales. 

Thus, the configuration of political forces has shifted significantly over the 
last decade. It remains to be seen whether the more recent political formations 
represent a lasting feature of the political landscape. At the same time, though, 
it is evident that the traditional political parties, already by the 2002 election, 
but especially in the aftermath of the October 2003 events are in the throes of 
deep internal crises. Also, it is worth stressing that political parties of any kind 
are viewed with suspicion in contemporary Bolivia. 


Changing Dynamics of Social and Political Actors and the Political 
System 


Recently the political landscape and the balance of powers involving various 
political forces have shifted. Different forms of political exchange have evolved 
in tandem with these changes. On the one hand, coalition politics has evolved. 
In 1985, the imperative was to secure a large enough congressional majority (of 
50%) to ensure minimum levels of governability and the possibility of policy- 
making. By the mid 1990s, the governing coalitions have sought to be large enough 
(to control two-thirds of congress) for two reasons. Firstly, large coalitions were 
necessary to secure the passage not just of legislation, but also of constitutional 
reforms. Secondly, the control of two thirds of congress would give the ruling 
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coalition the power to control appointments to the new oversight institutions that 
had proven to be too independent, namely the CNE, the Defensoría del pueblo; 
and the constitutional tribunal. Thus, there has been a process of increasing 
isolation of ruling political parties from the electorate. 

At another level, new political strategies emerged because of the mobilization 
of new social forces and protest incidents. It is important to stress that protest 
politics has not always been the result of orchestrated efforts by specific social 
movements, but rather the consequence of spontaneous social mobilizations 
against specific government measures. Often, the more structured political 
movements have latched on to these protests at a later stage. The October 2003 
revolt against the Sanchez de Lozada administration in protest against the sale of 
gas to Chile, and in terms favorable to the gas companies according to the Ley de 
Hidrocarburos (Gas Law), was the culmination of a succession of popular protests 
which escalated from 2000 onwards, but which responded to different issues and 
demands. Overall, though, they express increasing opposition to the economic 
model, and disenchantment with the institutions of political representation. 
This assortment of concrete incidents of social protest, and more long term 
mobilization strategies on behalf of increasingly organized social movements, 
particularly regarding rural issues and coca production conflicts, increasingly 
came to encompass more nationwide concerns and frustrations. 

The *water war" in Cochabamba in 2000, following the unpopular attempt to 
privatize water and raise the price of consumption lead to street protests, was a 
dramatic, but by no means isolated, example of the increasirig recourse to protest 
politics as a way of confronting unpopular government measures. The government 
was forced to step down. Equally, the confrontation in February 2003 between 
members of the police, who protested against a huge hike in income tax, and 
members of the armed forces left a high number of deaths, leading ultimately to 
the government reversing its tax decision. Several features stand out. Firstly, both 
the Banzer and Sanchez de Lozada administrations increasingly sought recourse 
in repressive measures. Secondly, protest politics is seen effective where other 
channels of political mobilization fail. Thirdly, protest politics has not always 
been the result of organized political strategies. 


Challenges to the Complex Task of Constructing a Democratic Polity 


The Bolivian case is an example of political development that exhibits 
contradictory trends. Ultimately, these amount to a complex web of institutional 
and political processes which, at some level, indicate degrees of democratic 
deepening, and, at another, show the retrenchment of illiberal and, at times, 
downright authoritarian practices. 

Thus, what we have is neither a linear nor cumulative process of democratization. 
Instead we observe multilayered processes of democratization in which there are 
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some advances some of the time in certain areas and reversals in others. The 
deterioration of democratic forms seriously undermines the process by which 
democratic values can be strengthened in society. 

Linked to this, it is difficult to establish which social groups and political 
formations (old or new) can be categorized as consistently democratic, or 
consistently undemocratic or anti-systemic. A cursory glance at Bolivian media 
seems to suggest that there exists a clear identification of these categories. 
Perhaps the picture is more complex than this. Most political formations show 
a combination of both: at times, they exhibit democratic attitudes and even a 
genuine commitment to democratic rules of engagement. Yet many, if not most, 
will exhibit simultaneously an assortment of illiberal practices and undemocratic 
behavior, albeit in varying degrees and with different levels of intensity. 
(Undemocratic attitudes, for instance, might prevail at times of heightened 
political conflict.) 

The point here is that the multi-layered effect of different and overlapping 
levels of democratization (with advances and reversals) applies also to the internal 
dynamics and development of the different social and political actors competing 
with the system. This produces a disjointed, if relatively stable polity — even 
following the October 2003 events — in which illiberal practices and undemocratic 
behavior patterns are deeply rooted in the political practice of the competing 
political forces. At the same time, this co-exists in a complex way with genuine 
expressions of commitment to democratic rules of the game and a widespread 
acceptance not only that democracy is the only game in town, but also that it 
is a desirable goal. 

Another problem lies with the degree of agreement around the meaning of 
democracy. Here too the complexities of a culturally and ethnically diverse society 
as Bolivia is, moreover compounded by extreme inequalities, and growing regional 
divides, yields a broad spectrum of definitions and meanings regarding the rules 
and nature of democratic government. It is not necessarily the case that liberal 
democracy contained within Dahl's model of polyarchy responds to the demands 
and expectations of the various political groupings and social groups. Moreover, 
in the Bolivian case, there is a long-standing discourse of radical participatory 
democracy rooted in the union politics of the revolutionary period, and picked 
up in a different format in the communitarian language of indigenous politics. 

And all of this is happening in the context of a globalizing economic setting 
in which Bolivia's room for maneuver is limited. The structural legacy of extreme 
poverty and social hardship is problematic for the development of a positive 
relationship between state and society, or a culture of trust in democratic 
institutions that have yet to prove themselves. 

In this climate of adversity, it is evident that smooth democratic consolidation 
is the least likely outcome. Moreover, the impending Constituent Assembly 
can introduce an even greater sense of uncertainty than that produced by the 
successive political and social crises of recent years, finally culminating in the 
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October 2003 event. At the same, while the incidence of political violence has 
increased, it is important that democratic institutions have not broken down, and 
that constitutional forms in a general sense are being respected. The uncertainties 
of Bolivia's political development, both in the short and long-term, are still 
significantly contained within the confines of a democratic discourse. 


Notes 


1. Whitehead's analysis of the “democratic viability” in Bolivia precisely points to the need 
to view the Bolivian political process with a more flexible political lens than that allowed by 
the transitions paradigm (Whitehead, 2001). 


2. Labels are useful to distinguish real political processes from idealized models, but, as 
Carothers suggests, maybe we need to move away form the model itself in order to develop 
a less constrained understanding of the political dynamics at play (Carothers, 2002). 


3. Key coalitions since 1985 have been: a) Pacto por la Democracia in 1985 under the Victor 
Paz Estenssoro presidency of the MNR which included the MNR and ADN; b) Acuerdo 
Patriotico between 1989 and 1993 under Jaime Paz Zamora of the MIR, which included 
the MIR, ADN and CONDEPA; c) the first Gonzalo Sanchez de Lozada government of 
the MNR included a pre-electoral alliance with a katarista movement, and post-electoral 
coalition politics with an assortment of left-wing and populist parties — the MBL (an offshoot 
of the MIR), UCS - the latter, a populist party led by a beer company owner; d) the 1997 
presidential victory of Hugo Banzer of ADN, with an alliance made up of ADN, MIR, UCS 
and CONDEPA; e) finally, the coalition government behind Gonzalo Sanchez de Lozada's last 
government which included the MNR, the MIR, and the NFR — a new populist party. 


4. Much has been written on the particulars of the Bolivian hybrid form of presidential system, 
as a consequence of the format for electing the executive. See among others, Gamarra and 
Malloy(1996); Gamarra (1994) and Domingo (2001). 


5. In this regard 1985 was an exception in the sense that the coalition government, made 
up predominantly of MNR-ADN, was not the congressional majority which elected Paz 
Estenssoro of the MNR. He was elected with the support of the MIR. 


6. See Lazar (2004) for an interesting discussion on the structures of clientelism at the 
grassroots level in Bolivia. Grindle (2003) examines the use of clientelism to ensure the 
smooth passage of radical reforms in the 1990s. 


7. See Barreda and Costafreda (2003) and Lazarte (2003) for a discussion of electoral reforms 
in Bolivia. 


8. A political party law passed in 1999 has introduced, at least on paper if not in practice, 
regulations to ensure greater internal democracy in political parties, and greater scrutiny 
on party financing. It has also introduced a gender quota system for electoral and internal 
positions within political parties. See Barreda and Costafreda (2003) for a discussion of the 
law. 


9. It is worth recalling that political party dominance in the game of political power brokerage 
is a feature of the last twenty years. Before 1985, politics was in the hands of corporatist 
structures — be they military or union. Even the MNR operated not as a democratic and 
competitive party but as a corporatist body. 
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10. See Lee van Cott; Albó (2002), Gray-Molina (2001), amongst others for analyses on the 
impact of the LPP in Bolivia politics. 


11. See for example, Albó (2002) for a critical discussion of the LPP. 
12. Assies and Salman (2003). 


13. The most spectacular of these took the form of the “capitalization” in program by which 
proceedings from the sale of public enterprises would be used to establish a social fund. 


14. Mainwaring and Shugart have characterised the Bolivian political party system as 
“inchoate”(1995). Nonetheless, until the 2000 election, it showed signs of a certain degree 
of stability — certainly in comparison to neighbouring countries — while at the same time 
allowed for renewal and the incorporation of new political forces. 


15. It is important to note that at the level of general elections, the prospect of victory 
by Evo Morales caused widespread alarm among urban sectors and some capital flight, 
pointing to the limits of political pluralism. Democracy is allowed within limits — this was 
also clearly expressed by the U.S. ambassador who warned against the perils of voting for 
the MAS candidate. 


16. See Whitehead (2003, and 2001) for a discussion of long-term cyclical patterns of top- 
down constitutional projects followed by bottom-up mobilisation process, the most dramatic 
of which was the 1952 National Revolution. See also the chapters in Grindle and Domingo 
(2003). 


17. See Assies and Salman (2003), Calla (2003), and Van Cott (2000), amongst others, for a 
discussion of the emergence of these new political movements. 
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Abstract 


The electoral victory of Luís Inácio "Lula" da Silva in the presidential elections 
of 2002 epitomized two decades of social and political transformations in Brazil. 
Nevertheless, instead of launching an alternative mode of doing politics, the program 
of the Workers! Party affirmed a state logic with a view to gradually updating 
the economic structure of Brazilian capitalism by means of successive transitions 
directed by the state, avoiding the active intervention of the subaltern classes in 
this process. In this logic are inscribed fiscal discipline, social security reform, and 
giving value to private pension funds. Such funds established a bridge that makes 
viable the organic alliance of a union bureaucracy, now the manager of these funds, 
and globalized financial capital. 


Introduction 


The election to president of former metalworker Luís Inácio "Lula" da Silva 
in November of 2002 reflected a profound desire for change in a Brazilian 
society exhausted by more that a decade of neo-liberal experiments. This desire 
for transformation was expressed so strongly that social and political analysts 
commonly referred to it. It is not only the desire for economic change, and this is 
"where we begin to notice a difference among analysts. It is also a desire for social 
and political transformation, and this is perhaps more significant for the present 
analysis. The vote for Lula expressed for millions the possibility of transforming 


* Alvaro Bianchi is Professor, Department of Political Science, State University of Campinas 
(albianchi@terra.com.br) and Ruy Braga is Professor, Department of Sociology, University of Sao 
Paulo (ruy.braga@uol.com.br). This article was translated by Professor Robert Anderson, University 
of North Carolina at Chapel Hill. 


© The University of North Carolina Press Social Forces, June 2005, 83(4):1745—1762 


1746 / Social Forces 83:4, June 2005 


politics into something that can be done in the first person plural —“us”— no 
longer exclusively of the third person —“them.” 

The symbolism of this winning candidacy is extremely strong. Lula is a 
Northeastern migrant who fled the hunger that overran his native town of 
Garanhuns, in the interior of the state of Pernambuco, together with his mother 
and seven siblings. He is a metal lathe operator, who left school after the first few 
years. He is a unionist who in the late seventies inscribed his name forever in the 
history of Brazil. Voting for him was, for many, an exercise in recreating social 
identity and of creating a political consciousness. It meant affirming something 
that is usually hidden and denied AND finding oneself again. It was the revenge 
of the defeated, of the humiliated, of the despised. 

However, the very act of reconstructing the identity of the subaltern classes 
was only possible insofar as Lula's electoral strength expressed a change in the 
relation of forces in Latin America and, at the same time, was the founding 
moment of this change. A series of events that surfaced powerfully and with 
multiple significance in Latin American politics in the year of 2002: the defeat of 
the military-business class coup in Venezuela, the social rebellion in Argentina, the 
electoral performance of Bolivian peasant leader Evo Morales, and the victory of 
ex-colonel Lácio Gutiérrez in Ecuador. On the one hand, these events indicate 
the weakening of the neo-liberal project on our continent, resulting in part from 
its economic exhaustion, and, on the other hand, indicate the reactivation of 
social movements after the retraction of the '90s. 

Brazil did not find itself at the front of the continental transformation in 
the relation of forces. In going against its own historical dynamic, its rhythm 
was slower. It was the last country to get onto the neo-liberal boat, and it was 
also the last one to try to get off. In fact, the social radicalization in Argentina, 
Ecuador, Bolivia, and Venezuela preceded the events in Brazil and took the 
form of direct action by the social movements. Even when this radicalization 
was expressed through electoral victories in these other countries, they were 
preceded by important social agitation that opened the way for the rise of political 
leadership directly or indirectly identified with these movements. This was not 
the case in Brazil. 

Paradoxically, the election of a candidate whose personal trajectory is deeply 
identified with the union and political movements of the Brazilian working class 
occurred in a context in which the reactivation of these movements had not yet 
happened. It expresses, therefore, political changes that took place slowly below 
the surface of society: a profound weakening of the neoliberal model, a loss of 
faith in traditional politicians, and a silent revolt against the dominant classes 
that prospered amid the increasing misery and social inequality. 

Even so, as we were saying above, the electoral victory is also a founding 
moment of the present relation of forces. It can allow these movements, 
which had been taking shape underground, to accelerate and break through to 
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the political surface, and it can transform the potential energy of these social 
movements into the mechanical energy of change. The collective identity created 
around the figure of the president-elect can put hope into motion, transforming 
it into struggle fed by the deep social resentments of the subaltern classes, by the 
desire for change, and by the perception that these classes have of the potential 
change in the relation of social forces. 

The social movements interpret the electoral victory as the beginning of an 
era of change. Thus, in its “Letter to the Brazilian People and to President Lula,” 
the National Steering Committee of the Landless Rural Workers’ Movement 
(Movimento dos Trabalhadores Rurais Sem Terra, or MST) stated: 


We have the opportunity at this time to carry out the historic task of 
implementing a true agrarian reform in order to democratize access to 
land and to eliminate hunger, unemployment, and social injustices. We 
call on all workers, male and female, on Brazilian society in general, to 
organize, to mobilize, and to help us carry out agrarian reform. A more 
just and equal Brazil is possible. And this is the time! (MST 2002) 


This is not, however, the only voice heard. The National Board of the Central 
Workers’ Organization (Central Unica dos Trabalhadores or CUT) has not made 
any call to action or mobilization. Its most important resolution was to form 
“six CUT working groups (tax and fiscal reform, union and labor reform, social 
security reform, agrarian and agricultural reform, employment and income, and 
the state and public policy),” which will have the goal of presenting proposals to 
the new government and of participating in all of the "forums of negotiation." 


From the Logic of Difference to the Logic of the State 


Indeed, the electoral victory Lula and the Workers' Party (Partido dos 
Trabalhadores, or PT) epitomized more than two decades of social and political 
transformations in Brazil. Great transformations. In these two decades historical 
time was accelerated, fragmented, convulsed and broken, creating a country that 
would have been recognized only with difficulty in the preceding decades, even 
though its social formation was marked with centuries-old continuities. Such 
temporal acceleration was not only experienced in the great metropolises, where 
it was to be expected, but also in the rural milieu, where the slow tempo of 
natural cycles, with its seasons, its days and nights, was shaken by the intrusion 
of politics, with its own rhythm. 

Historical time gained thrust and speed from the unusual decision to Seize the 
Moment. On 12 May 1978, the workers at the Saab-Scania truck factory, in the 
*Sáo Paulo ABC" — the industrial zone formed by Santo Amaro, Sao Bernardo, 
Sao Caetano and other municipalities in the metropolitan Sao Paulo area — 
decided to shut down the machinery and cross their arms. Demanding a wage 
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increase of about 2096, the two thousand metalworkers of the company began 
a landmark strike. They broke the cadenced rhythm of the clock, they stopped 
the tempo of production, and with this, they appropriated political time. 

The work stoppage by the Saab-Scania workers was the flashpoint for an 
intense workers’ rights movement that would spread throughout the region and 
beyond. In the course of that year, about a half a million workers carried out 
strikes, and in the following year the figure reached 3,241,500 strikers, giving 
rise to a long cycle of labor struggles and organization of the working class that 
lasted practically without interruption until 1989. 

The movements that took place in the Sao Paulo ABC at the end of the 1970s 
started a broad struggle against labor overexploitation and repressive political 
legislation that subordinated the union movement to the state and restricted forms 
of workers’ representation (Antunes 1992). Marked by its spontaneity and by 
its going on the offensive, such strikes launched new trade unionist and political 
practices. Rejecting the social pacts and immobilization that had characterized 
a good part of the Brazilian Left up until then, the ABC strikes created a social 
movement founded on social confrontation and class independence. 

The birth of the PT is connected insolubly to this strike movement. By 
the middle of 1978, the newspaper Versus had already begun to circulate the 
proposal to create a party of workers without bosses that would defy the military 
dictatorship then in power. Such a proposal gained voice in the thesis that 
the Santo André Metalworkers’ Union presented at the 9th Convention of the 
Metalworkers' of the State of Sào Paulo, held in the city of Lins in January of 
1979 (PT 1998:47-48). 

On 1 May 1979, Brazilian Labor Day, a “Letter of Principles of the Workers’ 
Party,” was circulated. It reaffirmed the following proposal: “Repudiating all 
forms of political manipulation of the exploited masses, including, above all, 
manipulations particular to the pre-64 regime, the PT refuses to accept among its 
ranks representatives of the exploitative classes. In other words, the Workers’ Party 
is a party without bosses!”(PT 1998:53) (By this, the Letter of Principles referred 
to the paternalistic and corporatist policies of the pre-dictatorship republic.)? 

In its early years, the strategic definitions of the PT’s political project 
continuously manifested extreme generality and ambiguity. The “Program of the 
Party,’ approved at the National Founding Meeting in October of 1980, proposes 
building, in the struggle against the repressive regime, “an alternative of economic 
and political power, dismantling the repressive machine and guaranteeing the 
broadest freedoms to workers and the oppressed, that may be supported in 
the mobilization and organization of the popular movement and that may be 
an expression of its right and will to decide the destinies of the country.” The 
“Program” defined this alternative power as a constitutive of a society without 
exploited or exploiters and stated that its construction would go against the 
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interests of major national and international capital; yet, it said little about the 
ways of arriving at that power(PT 1998:68-69). 

In its early years, what gave the Party its strength was not its strategic plan 
but rather its social force. What attracted union and youth movement militants 
was the possibility of acting effectively through political participation in a party 
that did not bear the heavy stigma of immobilization and bureaucratism that 
had drowned the Brazilian Left before the 1964 military coup. In this sense, the 
PT consolidated the process of the Brazilian working class's social and political 
reconfiguration. In the 1970s, a multiplicity of social movements contested the 
old forms of institutionalization of the politics of the subaltern classes (see Sader 
1995). What made the new party promising was its logic of difference (Keck 
1997). 

The construction of the Workers' Party represented, thus, the founding of a 
practical classism — that is, an empirical class action, without clear programmatic 
definitions — the political-organizational translation of that spontaneous strike 
movement that had shaken the military dictatorship at the end of the 1970s, 
profoundly altering the shape of the exercise of politics in Brazil (Bianchi 
2001:106-116). The marks of this practical classism were its almost instinctive 
rebuff of politics of collaboration, agreements and alliances with the bourgeoisie, 
its affirmation in its birthplace of an anticapitalist calling, and a belief in the 
miraculous power of the “bases of the party.” This practical, spontaneous 
dimension was what gave the Workers’ Party the vigor and momentum capable of 
renewing the political task of the working class and, in reaction, of the dominant 
classes themselves. It was the political affirmation of a force unimagined and 
unexpected up to that time. 

The spontaneous practices of this classism soon became a theoretical 
spontaneity. The absence of more precise strategic definitions was credited to the 
plasticity of the very movement and was presented as the greatest virtue of the 
Party. From that came the insistence that the PT was not born “ready made” and 
that its program arose “from the political practice of its social bases” and would 
be fleshed out “by the political practice of the workers.” (PT 1998:70-71) 

In Lula’s speech at the 1st National Convention of the Party in 1981, this 
denial of theory was made explicit. Referring to those who asked about the 
PT’s ideology and conception of socialism, Lula responded that, “those questions 
only serve to express mistrust in relation to the political capacity of the Brazilian 
workers to define their own path.” Further on in the same speech he presented 
what would be his own conception: 


The socialism that we want is defined by all of the people, as a concrete 
requirement of popular struggles, as a global political and economic 
response to all of the concrete aspirations that the PT might encounter. 
It would be very easy, sitting here comfortably in the precincts of the 


1750 / Social Forces 83:4, June 2005 


Senate of the Republic, for us to decide on one definition or another. It 
would be very easy and very wrong. The socialism that we want will not 
be born of a decree, neither ours, nor anyone's. The socialism that we 
want will be defined in the daily struggles, in the same way that we are 
building the PT (PT 1998:114). 


The constitution of a new societal project was reduced, thus, to a spontaneous 
movement. In the conflict of rationalities that takes place in the class struggle in 
action, what the PT had to offer was only the act of struggle itself. This absenteeist 
attitude in the conflict of rationalities did nothing more than to create a passive 
subalternity of the Party in the ideological terrain, feeding an increasingly strong 
theoretical eclecticism. Without establishing a conception of an alternative world, 
it imprisoned those energies within a trade-unionist horizon far short of a societal 
project capable of affirming a new order. The political identity the Party had 
constructed was limited. It had not been translated into a theoretical identity 
that might understand in an innovative and radical way the problems posed by 
the complex Brazilian and Latin American reality. It limited itself to some vague 
phrases the reproduction of a shared theoretical sense. It did not produce new 
knowledge; it reproduced old truths. 

This theoretical spontaneity got its strength from the social energy of the cycle 
of strikes that began in 1978 and was limited to that energy. But the practical 
classism that fed and energized the reorganization of the subaltern classes' political 
and union movement was short-lived. It could not withstand the weight of the 
PT's electoral victories. The Party had elected only two mayors in the first election 
in which it participated in 1982. In 2000, it elected 2,485 members of municipal 
councils and 187 mayors six of which headed state capitals, including the city 
of Sáo Paulo, the largest and most important in the country. In the national 
elections, the growth was also impressive: 8 federal deputies in 1982; 16 in 1986; 
35 in 1990; 49 in 1994; 58 in 1998; and 91 in 2002. 

The PT's rise within the state apparatus was accompanied by the strengthening 
of a party bureaucracy increasingly removed from the Party's bases. By the mid- 
19805, the groups that should have organized the base militants had shown clear 
signs of atrophy. In the beginning of the 1980s, they simply no longer existed, 
having been substituted by the candidates' electoral committees, thus reproducing 
the personalization of Brazilian politics, dictated by votes for a name and not 
for a party. 

This transformation of the Party is graphically illustrated by the growing 
presence of political employees and by an accelerated decrease in the number 
of unionists at the meetings of the PT. At the 11th National Meeting, held in 
1997, 60% of the delegates were professional politicians: 18% were members of 
congress; 13% were congressional aides; 9% were paid militants of the social 
movements; 8% held positions of trust in the state and municipal governments; 
6% were paid directors of the PT; 2% were employees of the Party’s internal 
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constituencies; 1% were Party aides or employees; 1% were mayors or governors. 
Only 31% were not professional politicians (Garcia 2001:93-105). 

The large space that this body of political and public employees occupied 
within the Party apparatus took its toll. Bit by bit, the practical classism diluted 
its social points of reference and retained only its pragmatic dimension (for a 
critique of political pragmatism in Brazil see Lessa 1995). The trade-unionist 
horizon had its social content modified. The logic of difference that had oriented 
the early years of the trying life of the Party gradually yielded to the logic of the 
state sustained by the large number of members of congress, mayors, governors 
and their cortege of aides. Such logic rested on the concept of an ambivalent state, 
supposedly distanced from determinations of class, the bearer of a heightened 
capacity of adaptation to the new content that its directors had incorporated. 
(For a critique of the ambivalent state, see Braga 2003.) 

Without running into ideological and theoretical barriers, the raw pragmatism 
of realpolitik flooded party life, impregnating the resolutions of its meetings and 
conventions, and especially the political practice of its directors. Adherence to a 
state logic was not, however, simple and mechanical. There were great conflicts 
within the Party, with the expulsion of at least one important current — exactly 
that which, through the newspaper Versus, had proposed the creation of the PT. 
There was controversy over the institutionalization of statutory mechanisms 
that restrict public expression of differences and the censorship of the current 
of the so-called PT Left. But after just a few years the state logic became largely 
overwhelming. 

When the Workers’ Party announced the beginning of a strategic debate at its 
Ist Convention, held in 1991, it could no longer be the translation of practical 
classism. The defeat of the Lula candidacy in 1989, the defeat of the civil servant 
strike in 1990, and the first steps in the implementation of a neoliberal model 
had dissipated the energies of the social movements and paved the way for the 
consolidation of a state logic of management of politics within the party. 

Already in the debates leading up to the 1st Convention, the consolidation 
of this logic asserted itself with a force and clarity unaccustomed in the PT 
tradition. The PT's Secretary for International Relations, Marco Aurélio Garcia, 
presented its consequences with crystal clarity: "Political democracy cannot only 
be understood as a way of arriving at social democracy or a better position from 
which to fight for it. Political democracy is an end in itself: it is a strategic value 
as well as an enduring one. If this thesis is social democratic, patience: let us be 
social democrats.” (Garcia 1990) 

The final resolution of the Convention, titled simply Socialismo, was marked 
by innumerable amendments introduced by the Leftist constituencies, but even so 
it lost its fundamental character: democracy sans phrase was defined as a universal 
value. Socialism was conceived as a combination of “state planning and a socially 
oriented market,” and, finally, power to work miracles stopped being of the “bases” 
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— the rank and file — and passed to a “state [that] exercises regulating activity 
over the economy through its own enterprises and mechanisms of control of the 
financial system, of tax policy, of prices, of credit, of antitrust legislation, and of 
consumer, wage earner, and small property owner protections.” (PT 1998:502) 

The theme was taken up again recently on the occasion of a series of debates 
that the Workers’ Party held on socialism. In a contribution that sums up to a 
great extent the majority view-of the Party, economist Paul Singer developed a 
proposal of market socialism: 


The function of the socialist market is to make viable the freedom of 
initiative of individuals or groups with new ideas or plans. They should be 
encouraged to offer their products without restrictions and without having 
to get permission from some planning jurisdiction. [. ..] Competition in 
these cases should last until the consumers have decided to adopt the new 
products or stick with the old ones (Singer 2000). 


In his elaboration, the argument does not clarify either why competition should 
be held in check or why it would not give rise to relations of exploitation. 


Radicalizing Singer's argument, Lula registered in this debate the 
contradiction: Human beings are eminently competitive. To the extent 
that competitive capacity of the human being is blocked and everyone is 
set up to earn the same in a factory, that factory's chances of success are 
cut short. People are leveled from below and not from above. Socialism 
has not managed to solve this problem (Singer 2000:72). 


Evidently, there is no viable program in the absence of a subject able to bring 
it about. For this reason, at the heart of the PT project, next to the market we 
find the state. 

The solution to the problem introduced by Lula takes up once again the 
theme of state protagonism: 


The market only functions if there is a very strong state regulating that 
market and obliging it to fulfill some social contracts. The market alone 
does not resolve this. To make the market compatible with a regulatory 
state, able to guarantee people's needs, would be the ideal. How to do 
this is what is put before the PT (Singer 2000:73). 


The arguments as well as the solution presented can only remind one of Proudhon. 
Competition, and therefore the market, is presented as the mechanism capable 
of affording equality and social development. There might be, however, the bad 
side of competition: its most likely effect is to turn against those that participate 
in it. Proudhon's solution is analogous to the PT's. The French philosopher 
stated: "here, the matter of destroying competition would not be appropriate, 
something as impossible as destroying freedom; rather, it is a matter of finding 
the balance — I would say, to a good degree by policing." 

A naturalized market and an essential, permanent human trait (competition) 
together form the situation upon which socialism should be built. This is nothing 
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other than the exorcism of the evil of the market by means of the state, recovering 
that which is potentially good: the opportunity to allow choices based on opinions. 
The marketplace — that is the place of freedom. The Proudhonian regulatory 
state is its guardian. 

Defined in such a way, the PT's socialism is transformed into a solid base for 
a program of capitalist government that has as its goal overcoming the crisis of 
the neo-liberal model. It is not a matter of building the future but managing 
the present, removing what is bad or undesirable. A PT government makes 
this exorcism of evil feasible, and it values what good the market presents, and 
humanizes it. Guido Mantega, the new Minister of Planning and one of Lula's 
principal aides during the electoral campaign, summed up this objective thus in 
response to what a Lula government would hope for: 


I would say that the PT is a party of the modern Left, similar to the 
Socialist Party of France, to the English Labor Party, to the Italian Left. I 
would put it on this list of parties that aspire to and long for a capitalist 
society because socialism is something totally undefined today; it no 
longer exists. We do not seek a more efficient capitalism, rather one made 
human (Mantega 2002). 


Just as in Proudhon, the dialectic is mutilated, the immanent contradiction 
becoming something that can be externalized and eliminated through state 
management. In this way the dialectic is reduced to a game of good/bad 
oppositions, capable of being eliminated through suppression of its poles. So 
suppression (as opposed to dialectical overcoming) through political cancellation 
of the bad side would permit the recreation of the situation on new bases. The 
result of this operation is a constant reproduction of the previously existing by 
means of a process of perfection and harmonization of the real. Let us recall 
what Gramsci said about the Proudhonian “dialectic”: 


There is in Proudhon a mutilation of Hegelianism and dialectic, as much 
as in the Italian moderates; therefore, the critique of this politico-historic 
conception is the same, always alive and current, as that contained in The 
Philosophy of Misery. The philosophical error (of practical origin!) of such 
a conception consists in the fact that in the dialectic process presupposes 
“mechanically” that the thesis should be “preserved” by the antithesis in 
order not to destroy the process itself, which therefore is “foreseen” as 
an infinitely, mechanically, arbitrarily preset repetition (Gramsci 1977: 
1220-1221). 


If as a method we have a mutilated dialectic as a political program, we have 
passive revolution. This and no other is the content of the PT government: the 
gradual updating of the economic structure of capitalism by means of successive 
transitions commanded by the state, avoiding the active intervention of the 
subaltern classes in the process. Updating and not overcoming capitalism, as 
Minister Mantega makes clear in his declaration. Such a strategy reproduces, 
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under a new political command, the history of Brazilian capitalist development, 
marked by permanent passive transformations carried out by a guiding state. Far 
from making real the hope of Brazil as a “country of the future,” such processes 
have done nothing other than recreate the past through the management of the 
present. 


Social Liberalism as “Passive Revolution” 


The perspective of passive revolution outlined above oriented the Programa de 
Governo 2002 Coligacáo Lula Presidente (PT 2002). Stripped of any reference 
to socialism (the word is not present in the 88-page document), the PT exit 
program for the neo-liberal crisis rested mainly in the broadest proposal for 
implementation of the “new social contract” under the tutelage of a Proudhonian 
state of the social liberal sort. In the future government, the social liberal state 
would assume the role of court of final appeal, guaranteeing, supposedly, the 
mediation between the imperatives of the market and the social demands, between 
the interests of the dominant and the subaltern, through the intermediary of the 
new social contract: 


The new model cannot be the product of unilateral government decisions, 
such as occurs today, nor will it be implemented by decree, in a voluntarist 
way. It will be the fruit of a broad national negotiation that must lead to 
an authentic alliance throughout the country, to new social contract, able 
to ensure growth with stability. A premise of this transition will naturally 
be respect for the country’s contracts and obligations (da Silva 2002). 


With the diffusion of public policies appropriate to the action of a developmentalist 
manager, the state would be the moderator of conflicts, the moving force of 
“hope, and the economy would no longer be thought of as an autonomous 
instance seeking to incorporate the working class peacefully. The social liberal state 
would shape the universe of the commodity fetish at the expense of regulatory 
initiatives over prices, wages, investments, subsidies, etc. As is pointed out in the 
Programa de Governo 2002, “the huge task of creating an economic alternative to 
confront and beat the historical challenge of social exclusion requires the active 
presence and regulatory action of the state over the marketplace, preventing the 
predatory behavior of monopolies and oligopolies.” (PT 2002:8) 

Upon stabilizing the contradictory, integrating elements of the subaltern 
classes into its apparatus, the social liberal state would be able to build a 
correlation between economic objectives and political objectives, guaranteeing 
the circularity of public policies that would rely on the economic to promote 
the social. The expansion of the domestic market would become the means of 
rescuing the economy from distress. The authority of the state would find itself 
immersed in the economy because of the management of global demand, to 
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the extent that the coercive forces of the market would be preserved within the 
social liberal state: ` 


The state cannot limit its actions to administering the short term and 
emergency issues, but it must plan for a strategic long term vision, 
articulating interests and coordinating public and private investments that 
pour into sustained growth. This implies reactivating economic planning, 
ensuring farther horizon for investments (PT 2002: 42). 


However, expansion of the domestic market through the intermediary of the 
social liberal state is reduced only to expansion of the financial market. The 
orientation of the Lula government strengthened the Brazilian stock market by 
instrumentalizing wage and pension funds. As was already clear in the government 
program: “pension funds have established themselves as an increasingly important 
mechanism of mobilizing wage earners’ savings. This would also create the 
opportunity of directing additional resources to special credit institutions, 
reinforcing their funds available for loan.” (PT 2002: 43-44) 

Causing the Brazilian stock market to become an effective instrument for 
leveraging productive investments illustrates in great measure the central concern 
of the Lula government: the stimulated involvement of Brazilian workers in 
the “strengthening of the future domestic market.” The PT proposal of the 
financialization [financeirizacdo] of wages vindicates the engagement of pension 
funds — particularly the resources coming from wage funds, like the Time of 
Service Security Fund (Fundo de Garantia por Tempo de Serviço or FGTS) but 
also the use of stock investments with supposed worker participation in the 
management of actually mobilized or potentially useable resources. 

It is worth recalling that the Fernando Henrique Cardoso government (1994- 
2002) was a pioneer in the use of resources from the wage funds for capitalization 
of state companies like Petrobrás [the national oil company] or privatized firms 
like the Companhia Vale do Rio Doce (the formerly semipublic iron and steel 
company). Or rather, the strengthening of the Brazilian capital market through 
the use of resources from wage and pension funds joins a movement already 
begun by the previous government that is not very novel in conceptual terms. 
The political novelty derives from the government's incorporation of the proposal 
from a party on the Left of financialization of the wage funds with the goal of 
making social security reform viable. 


Fiscal Discipline and Regressive Income Distribution 


Lula's Social Security reform continued the changes begun during the Cardoso 
government with a delimitation of the benefits ceiling for the private sector. The 
Lula government's reform took aim at the public sector. Proposed Constitutional 
Amendment 40 of 2003 (PEC 40/03) directed to the National Congress by the 


1756 / Social Forces 83:4, June 2005 


Lula government limited the retirement benefit ceiling and raised the minimum 
retirement age to 55 for women and 60 for men, requiring social security 
contribution of retirees and pensioners, and creating a system of complementary 
private retirement funds, among other changes in the legislation formerly in 
effect. The changes hit workers who were then in public service, eliminating 
rights already acquired and creating an exception for the judicial sector. Civil 
servants organized a strike that contested the government's policy, although it 
has not been able to overturn it. 

The defenders of the reform mustered two arguments: the first, fiscal; the 
second social. The fiscal reform did nothing more than reproduce the discourse 
of the Cardoso government: public social security is in the red (for a critique of 
the fiscal argument in the Lula government see Andrade 2003:21-25; Granemann 
forthcoming). In 1995, Proposed Constitutional Amendment 33 (PEC 33/95), 
reforming the social security system, was founded on the thesis of structural 
bankruptcy of the system. PEC 33/95 did away with the retirement for time of 
service, raised the minimum retirement age, and established a ceiling of benefits 
for workers in the private sector of R$ 1,200.* Echoing the World Bank report 
(World Bank 1994), the Cardoso government's social security reform proposal 
asserted that "the rapid aging of the Brazilian population would impose a heavy 
fiscal burden on the a relatively small work force. It is evident that that fiscal 
horizon is neither desirable nor executable." (Presidéncia da Repüblica 1997) 

The 20th Amendment to the Constitution, approved in 1998 during the 
Cardoso administration, initiated social security reform. In the fiscal argument, 
social security reform should lead to a complementary system of private 
retirement funds through the establishment of pension funds. Such a system 
would unburden the country and, at the same time, would constitute a domestic 
savings able to finance economic growth. What is important to point out is that 
Lula’s government kept the fiscal argument that was made in Lula’s campaign 
platform. 

In his inaugural address Finance Minister Antonio Palocci, was already 
proclaiming the argument: 


The current public account imbalance has to be faced with vigor and 
determination. A planned adjustment of the public budget is essential. 
Thus we are committed to social security reform as a priority of this 
government. What is more, we said in the campaign, we said in the 
transition, and we say now: we will create the primary surplus necessary 
to guarantee the sustainability of the public debt. This is the most direct 
way to reduce Brazil’s risk and interest rates so as to make possible the 
resumption of growth (Palocci 2003). 


In defense of the fiscal demand for reform, Lula has not stinted in openly 
manipulating the social security numbers in order to show it as running a 
deficit. The National Union of Federal Revenue Financial Auditors (Sindicato 
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Nacional dos Auditores Fiscais da Receita Federal — Unafisco) and the National 
Association of Social Security Inspectors (Associação Nacional dos Fiscais da 
Previdência Social — Anfip) argued repeatedly in their publications that the deficit 
was forged discounting the allocation destined for payment of the interest on 
the foreign debt: “The total of receipts from social security contributions on the 
payroll, shows a collection of R$ 136.9 billion, with expenditures for health, social 
security, and welfare of R$ 105.4 billion. One can see, then, in this year alone, 
a surplus of R$ 31.5 billion from social security was passed back to the federal 
budget to generate a primary surplus?" (Anfip 2003:21) Unafisco and Anfip also 
argued that the impact of social security reform on public finances would be, in 
the middle term, eroded by the reduction in receipts by fiscal renunciation and 
misappropriation, which reveals the fragility of the fiscal argument. 

The second argument is social. Ex-unionist and Social Security Minister 
Ricardo Berzoini has argued that reform was necessary to rebalance the federal 
budget, stopping the financial hemorrhage caused by growing expenses and 
insufficient revenue for the social security system. The alibi was furnished by 
the *excluded": *Reform is indeed a necessity, but for the neediest population 
of the country, which does not have access to the state's public policies because 
the budget is tied up in other types of expense,” Minister Berzoini affirmed 
(Berzoini 2003:14). 

The social argument is empty. From the government's point of view, the 
resources necessary for social policies must come, then, from cuts in expenditures 
on social rights of wage and salary earners. The redistribution policies would 
proceed thus horizontally, with the transfer of income within the working class 
itself. But social security reform presented in PEC 40/03 does not contain any 
proposal for broadening the number of beneficiaries of social security. There are 
only proposals for reduction of the benefit values. In the face of the contradiction, 
the government added to the final draft an innocuous clause transferring to 
future legislation the creation of a "special system social security inclusion for 
low income workers.”> Making social security benefits universal, a highlight of 
the structural policies announced in the Zero Hunger (Fome Zero) program 
became thus a dead letter. 

If it is clear that these compensatory social policies do not expand rights, then 
what is their function? In the fiscal logic proper to “tropical” social liberalism, 
they merely fulfill the function of legitimating the government. They become 
reduced, thus, to the creation of clientelist networks with the goal of reducing 
social pressure and conflicts, since political engineering is a mechanism that 
preserves archaic patterns of political domination in Brazil at the same time that 
it innovates in its forms. The passive revolution à la mode brésilienne in the Lula 
government, instead of seeking its legitimacy in market stability in the forms of 
Latin American 1990s neo-liberalism, seeks to construct it from the articulation 
between market imperatives and social demands that can be channeled through 
compensatory policies (and only through those). 
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Pension Funds and Patrimonial Capitalism Brazilian Style 


This analysis of social security reform is fundamental for understanding not 
only the guidelines government policy but also its social base. We defend the 
thesis that the fiscal argument is fragile and the social argument is empty. If 
such arguments are no more than rhetoric, what are the reasons for the reform? 
These reasons can be found in the constant defense on the part of the government 
of a system of pension funds capable of energizing the stock market and the 
national economy. 

The proposal gained strength with social security reform and the 
establishment, envisioned in PEC 40/03, of a new, private Complementary Social 
Security. Representatives of the CUT and of the Union Force (Força Sindical), 
a union affiliation along neo-liberal lines, insisted in negotiations that the labor 
unions should have the right to establish social security plans for unions or 
professional organizations. Before social security reform was even finalized, the 
Council for Complementary Social Security (Conselho de Gestáo da Previdéncia 
Complementar) authorized the creation of these “union funds.” Right away the 
Sao Paulo Engeneers' Union (Sindicato dos Engenheiros de Sáo Paulo) and the 
alumni association of the Getúlio Vargas Foundation (Fundação Getúlio Vargas) 
sent in their requests for recognition. The Union Force also began to negotiate 
its own fund.° 

According to Minister Berzoini and the Secretary for Complementary Social 
Security of the Ministry of Social Security Adacir Reis, "President Luiz Inácio 
Lula da Silva has reiterated his commitment to stimulate growth in Brazilian 
social security savings, democratizing workers’ access to pension funds.” (Berzoini 
and Reis 2003: A-3) In Berzoini's estimates, "the expectation is that we can reach 
between 4 and 5 million workers. If each one of 4 million workers saves an average 
of R$ 2,000 per year for retirement, we will have R$ 8 billion in savings.” Then ` 
he added, “That is a useful amount of money for investment in the country, and, 
best of all, it is domestic money?" 

The estimate of Fernando Pimentel, President of the Brazilian Association of 
Complementary Social Security Companies (Associação Brasileira das Entidades 
Fechadas de Previdência Complementar) was much more daring: “If we add the 
increase in funds from the class and union organs, which are being authorized, 
to the plans that will be offered to new civil servants, we will double the potential 
from R$ 200 billion to R$ 420 billion in 2007,” he stated to the press. Pimentel’s 
calculation is bolder because he included public employees that receive above 
the new social security ceiling and that will be obliged by law to join pension 
funds operated within the system of defined contribution. The system of defined 
contribution envisions fixed payments on the part of workers, but does not 
commit itself to defined benefits at the time of retirement; or rather, what the 
worker receives will depend on the success of the fund in the market. 

Thus is designed, by means of these funds, a powerful regressive and vertical 
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transfer of income from wage and salary earners to financial centers. In this 
way, the determinants of social security reform are revealed. The reduction in 
fiscal gains that it will produce in the short term would be compensated by the 
opening up of new fronts of expansion for financial capital. For the workers 
would remain only the promise of employee shareholding, the possibility of 
benefiting indirectly from the growth of the financial market through pension 
funds controlled by the unions. 

Already in October of 2002, on the eve of the second round of the elections, 
representatives of the Lula campaign, among them future Minister Berzoini, had 
signed a joint document with representatives of the Sáo Paulo Stock Exchange 
(Bolsa de Valores de Sáo Paulo or Bovespa), guaranteeing that as a complement 
to a universal public social security system, incentive measures should be taken 
to establish and develop complementary social security funds. The growth of 
the accumulated savings in these funds will have an important role in financing 
productive activity, by means of relevant participation in capital market, as occurs 
in the main developed countries? 

In the “main developed countries,” it is true; the proposal of making relations 
between the financial system and the industrial structure a sort of precondition 
for a new wage-earning pact has gained MUCH support. In France in particular, 
“Leftists” — like the economists of the School of Regulation (see Aglietta 1998) 
— and “Rightists” — led by Alain Minc (Minc 2000) — find themselves among 
the principal defenders of the proposal of “patrimonial capitalism” as an economic 
model best able to adapt to contemporary circumstances. At the end of the 
1990s important economists, like Michel Aglietta and Robert Boyer, defended 
the proposal to create employee share ownership plans under the control of 
companies and unions as, supposedly, progress in the direction of “new social 
rights.” Employment is not guaranteed in this program but certain workers, like 
management, receive a part of the profits in the form of shareholder participation 
(for a critique of the theory of patrimonial capitalism see Husson 2001:81). This 
shared responsibility would be accompanied by workers’ share in the profits. 

Such an alternative system built on the long transition of the 1970s would 
be made viable by recourse to a new growth regime, first established in the 
United States, and would be progressively structured in Europe due to the 
internationalization of trade and the increasing individualization of the salary and 
wage earner. Aglietta qualifies this “new” capitalism as patrimonial, because of the 
role played by the extension of employee shareholding and of the importance of 
institutional investors in the governance of companies. Patrimonial capitalism, 
supported by the assets and the extension of employee share ownership, would 
be inseparable from economic globalization characterized by these three factors: 
technological changes, individualization, and financial globalization. 

“After a quarter century of crises and structural transformations, a new regime 
of growth directs the development of salary and wage earning societies,” stated 
Aglietta (Aglietta 1997). If capitalism could present the “capacity of mobilizing 
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human energies to transform into growth; the same would not occur with the 
production of a “joint coherence of the shock of individual interests" Only 
the mode of regulation could orient the regime of growth to “improve also the 
conditions of life of the salary and wage earner.” (Aglietta 1998) The public funds’ 
option for capital, whose argument is centered on fiscal logic, must be articulated 
bearing in mind that human resources are different from nonhuman resources 
— that is, bearing in mind the argument based on social logic. 

From the renewal of the wage-earning pact would arise a sort of third way free 
both from the instabilities of neo-liberal capitalism and the shackles imposed on 
companies by outdated bureaucratic statism: "that is a capitalism of collectively 
owned funds, socialized property of the companies that could emerge through 
the intermediary of institutional investors. This evolution, if it receives the push 
from a political plan, can lead to a sort of wage-earning society markedly different 
from the market economy of the Anglo-Saxon sort, and different also from the 
European corporative capitalisms of the Fordist era.” (Aglietta 1997:462) Mediated 
by the employee shareholding, the old reformist thesis, according to which the 
collaboration of classes will guarantee common prosperity, appears more active 
than ever. If the unions of wage earners find again a power of influence over 
the division of income, they should be aware that the control of the stock of the 
company is a battle that they must pitch and win. In this way, the development of 
wage funds becomes the main institutional mediation able to produce a “virtuous” 
dynamic involving the financial sector and productive activities. 

The ideological unfolding of the economic argument implies an evident 
“appeal” addressed to the workers to defend the competitive position of their 
company and, even more, to participate actively in a system of intense competition 
involving national corporate groups and geographic blocs. Progressive rationality 
understands that the renewal of the social-democratic project must occur within 
the scope of the subjection of national societies to the materiality of capital. 
Therein lies the grand objective of arriving at a new social commitment negotiated 
among companies and unions around the proposal of employee shareholding. 

In Brazil the argument was endorsed by President Lula himself during the 
1st International Pension Fund Seminar in May of 2003: “obviously I am aware 
that the pension funds have as a basic premise the sustainability of the pension, 
of the retirement funds, for their members. Therefore the pension funds cannot 
invest to lose money. They must invest to earn money, preferably reasonably 
well, because the stronger the pension fund, the more privileges they can offer 
their members and the more influence they can have in some decisions in our 
country.” (da Silva 2003) 

From this, justifying the participation of the pension funds in the Brazilian 
privatizations was just a step: “Obviously, if I were asked at the time, I would not 
have hesitated to say that I was against the funds getting into buying Brazilian 
public assets, public assets that often at the time were contributing more to 
firing workers than to hiring workers. In the meanwhile, many of the fund and 
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company interventions had good results. And we witness today some companies 
successful in their participation in the important pension funds in Brazil.” (da 
Silva 2003) 

Still during the campaign, candidate Lula presented his "Letter to the Brazilian 
People" — which critics nicknamed "Letter to the Bankers" — and in his visit 
to Bovespa, showed a strong willingness to work towards "strengthening the 
Brazilian stock market.” It was not merely anti-alarmist rhetoric as some critical 
segments within the PT itself thought at first. It was a case of something at the 
same time broader and more complex. 

It was broader in that it involved an unprecedented and far-reaching change 
in the PT’s programmatic trajectory: the support for the financial economic 
sector, previously so harshly criticized. It was complex because it involved new 
interests seated in the very heart of the PT’s bureaucratic structure: the caste of 
syndicalists and employees of the state companies’ pension funds. On the one 
hand, it is difficult to see in these interests the ready-made manifestation of the 
rise of a “new social class” composed of managers, originally from the working 
classes and the base of the CUT, and from the state company pension funds, as the 
Brazilian Marxist economist and sociologist Francisco de Oliveira has argued (de 
Oliveira 2003). On the other hand, it seems quite clear to us that relative weight 
and the hegemonic capacity of the managerial segments (frações gestoras) of the 
working class itself that share responsibility valuing these funds has developed 
enormously. The pension funds are thus the bridge that makes the organic 
alliance of the sectors of union bureaucracy and of financial capital viable. 

The political and social change in the Workers Party is clearly shown in the 
reform of the welfare system. However, the nature of the change is not restricted 
to this reform. It is an ambitious program to update the Brazilian capitalism 
through a passive revolution led by a new layer of state managers. As indicated 
before, the history of republican Brazil may be understood as a succession of 
processes of passive revolutions. The characteristic difference of the current 
moment is the role played by the new social subjects with origins coming from 
the subaltern classes. Although it is true that in other historical moments the 
Brazilian subaltern classes acted as supporters for projects akin to this, such as 
during Getúlio Vargas’ government (1930-1945), the novelty that the government 
has brought about by the current government consists of the centrality of a union 
and party bureaucracy whose roots lay on a past marked by social struggles. 


Notes 
1. CUT. A CIT e o novo cenario politico. São Paulo, 28 November 2002. 
2. Translator’s note. 


3. Quoted in Karl Marx, Miséria da filosofia. Resposta à Filosofia da Miséria do Sr. Proudhon. 
São Paulo: Ciências Humanas, 1982, p. 136. 
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Diminished Democracy: From Membership to Management in American Civic 
Life. By Theda Skocpol. University of Oklahoma Press, 2003. 366 pp. Paper, 
$24.95. 


SARAH SOBIERAJ, Tufts University 


Drawing on an extensive qualitative and quantitative data archive compiled as 
part of the Civic Engagement Project at Harvard, Diminished Democracy offers 
a provocative reassessment of the origin and development of large membership 
associations in the United States (those that have enrolled at least 1 percent of the 
adult population). 

The findings challenge several circulating notions about early American civic 
life. First, Skocpol makes a compelling case that these large membership associations 
were not the product of robust, spontaneous local activity. On the contrary, she finds 
that most of the 58 organizations in her sample emerged in a top-down fashion as 
supra-local civic organizers aggressively recruited and trained leaders, establishing 
groups in communities in a way that mirrored the federated structure of the United 
States government. This historical insight presses academics and politicians alike to 
relinquish nostalgia for an idealized mythical past in which civil society was small, 
community-based, and organic. 

Skocpol challenges a second assumption, popular with conservatives, in which 
the perceived decline of civil society is attributed to the development of social 
programs that eliminated the space for such membership organizations. Skocpol 
argues that these associations flourished in large part because of their relationship 
with the state, not because they were sheltered from its auspices. For example, 
she argues that associations thrived in conjunction with the development of the 
postal service, the removal of state sponsorship of religion, and the evolution of 
government partnerships during times of war. 

In addition, Skocpol provides ample evidence in the form of illustrative 
examples (e.g., the case of the American Legion’s involvement in the passage of the 
G.I. Bill) that these associations were never merely apolitical clubs for social activity 
and charity work. Instead, she describes the integral role that large membership 
associations played in the development of social programs providing aid to farmers, 
mothers and children, and veterans. 

In addition to offering important insights into the history of large membership 
organizations, Skocpol offers a careful analysis of the relationship between 
institutional arrangements and civic life, asking not (only) how many people 
are joining associations, but rather how associations have changed and what the 
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implications of these changes are for democracy. She contends that the trans-local 
federated membership associations of the past were spaces in which members 
could socialize and develop recurrent ties to others, across class lines, if not across 
boundaries of race and gender; develop citizenship skills, such as voting and 
participating in public discourse; articulate shared values; and influence elected 
officials, through interactions with powerful leaders and organizing in chapters 
that often mirrored legislative districts. For Skocpol, the very structure of these 
associations reinforces external political activity, enhancing member involvement in 
the political process. The image she creates is one of involved members developing 
relationships and civic skills that connect them to community and political life. 

Skocpol argues that the social movements of the 1960s and 1970s gave birth 
to a civic world dominated by professionally managed advocacy groups (rather 
than membership groups) and nonprofit service providers, and, to a lesser degree, 
PACs, foundations, and think tanks. The common denominator of these groups is 
the absence of active, engaged members. Here, Skocpol challenges an assumption 
of the left, arguing that the explosion of rights-based advocacy organizations has 
weakened, rather than strengthened democracy by crowding out meaningful 
participation in favor of professionally administered organizations whose members 
pay dues in exchange for representation in the political process. While the number 
of associations has grown dramatically since the 1960s, Skocpol maintains that 
these specialized organizations are narrow and instrumental rather than broad 
and expressive. 

While Skocpol corrects our revisionist understanding of large membership 
associations and artfully describes the role that social structures have played in 
their decline, there are components of the book that are less compelling. Skocpol 
uses an impressive data archive to bring the classic associations to life, but the 
image of the contemporary associational arena is less thoroughly documented. 
The portrait of professionally managed citizen groups lacks the rich dimensionality 
that characterizes her analysis of the membership associations. If our notions of 
membership associations were ripe for debunking, it seems equally plausible that 
our caricature of advocacy organizations may also need reassessment. Certainly, 
these organizations have relatively nonparticipatory members, but do these 
organizations offer something to the democratic process worth recognition? For 
example, might advocacy group advertisements, public events, and publications 
help facilitate political discussion among nonmembers? 

I was also left wondering if new associations that may be less visible in these 
data might be filling the void left by large membership associations. The activist 
group Critical Mass, serves as an instructive example. One Critical Mass Web site 
(http://www.critical-mass.org) states, “The “org’ domain name notwithstanding, 
Critical Mass is not an organization, it’s an unorganized coincidence . .. this is not 
the official Critical Mass [W]eb page, because there is no official Critical Mass [W ]eb 
page. There are, however, a bunch of unofficial [W]eb pages.” There are no officers, 
there are no members, but there are critical mass groups in over one hundred cities 
globally that take group bicycle rides the last Friday of each month. Much of what 
Skocpol wants to see happening in civil society from political discussions and the 
development of civic skills, to the formation of recurrent ties, a trans-local structure, 


Book Reviews / 1765 


and connections to broader social and political movements (e.g., environmental, 
anti-globalization) are happening through Critical Mass, but without members 
or bylaws, such an unconventional *organization" is invisible. This is merely one 
example, but we must remain vigilant about rendering new associational forms 
visible that may fail to fit prevailing understandings of what associations look like 
and how they behave. 

In closing, Diminished Democracy offers a fresh and deeply insightful 
examination of the rise and fall of large membership associations that troubles many 
current debates, and offers a normative call to assess the toll that the deterioration 
of such communities of interest and influence may take on democratic life in the 
United States. 


Smart Mobs: The Next Social Revolution: Transforming Cultures and 
Communities in the Age of Instant Access. By Howard Rheingold. Basic Books, 
2002. 266 pp. Paper, $16.00. 


NALINI P. KOTAMRAJU, University of California at Berkeley 


Howard Rheingold’s latest book is not intended to be scholarly research so it is 
not entirely fair to assess it by strict academic standards. That being said, what 
then should sociologists take from this book? The book’s greatest strength is that 
it focuses on digital technology, in particular the mobile Internet. This technology 
is increasingly relevant not to an elite few, but to people who use communication 
technology to maintain familial relationships, conduct business (both legal and 
illegal), and engage in paid labor, and, of course, to the academics who study such 
behaviors. Rheingold goes beyond the desktop computer and the mobile phone 
to render visible the digital technology of which non-technologists are usually not 
aware. If Rheingold’s account of cultures and communities in the age of instant 
access reads like science fiction, it is in part because the reality of digital technology 
— what it can do — bears more than a passing resemblance to science fiction 
novels. 

Smart Mobs provides an eight-chapter walking tour of what Rheingold 
enthusiastically calls “the most relevant social science observation on mobile 
telephone use” (24) on the levels of individual personality, immediate social 
network and neighborhood, and society. The introductory chapter introduces 
Rheingold’s fascination with the interactions of physical space, people, and mobile 
phones. In the second chapter, Technologies of Cooperation, Rheingold attempts 
to put mobile phones in the context of a few theories of social groups informed by 
social network analysis, biology, psychology, and economics. In the third chapter, 
Rheingold provides clear explanations of technologies related to mobile phones 
like peer-to-peer computing, distributed processing, and grid computing. His 
fourth chapter focuses on sentient technologies, so called because of the embedded 
chips that can “sense, receive, store, and transmit information” (85), thus enabling 
scientists to link geography and information in novel ways. 
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The chapter that will interest a sociological audience is the fifth one, in which 
Rheingold claims that the *most profoundly transformative potential of connecting 
human social proclivities to the efficiency of information technology is the chance 
to do new things together, the potential for cooperating on scales and in ways never 
before possible” (114). He describes the import of reputation and its managers 
for digitally-based systems like eBay. In the sixth chapter, Rheingold reviews the 
implications, including policy, of the transition from wired to wireless technologies. 
In chapter seven, he provides a few concrete examples of what he would characterize 
as smart mobs, pointing to the 2001 presidential election in the Philippines, in 
which residents and activists relied heavily on text messages sent on their mobile 
phones to coordinate protests. In his eighth and final chapter, Rheingold discusses 
what he sees as the implication of the “always on” nature of the mobile Internet, 
focusing only, as technologists always do, on the specter of surveillance. 

Smart Mobs greatest strength, its focus on digital communication technology 
and social action, also turns out to be its greatest shortcoming. Sociologists who 
study community, social movements, participatory demography, cooperation, 
and perhaps even information and communication technologies (ICTs), will 
be frustrated by Rheingold’s privileging of technology over the at least equally 
important contexts in which technologies exist. This problem is apparent in the very 
idea of a smart mob, the most sociologically intriguing aspect of the book, which 
is neither defined nor discussed in enough detail to satisfy a sociological audience. 
Rheingold characterizes smart mobs as “consisting of people who are able to act 
in concert even if they don’t know each other . . . [they] cooperate in ways never 
before possible because they carry devices that possess both communication and 
computing capabilities” (xii). Rheingold purposely intends the concept of a smart 
mob to be visionary and provocative, reminding readers that "smart mobs aren't a 
*thing that you can point to with one finger or describe with two words, any more 
than ‘the Internet’ was a ‘thing’ you could point to” (182). 

However, for a sociologically informed audience, the discussion of new forms 
of social groups without even an acknowledgement of well-known research on the 
state of U.S. communities, the rise of ethnic-based groups in Europe, or the variable 
forms of family, is troublesome. Similarly, many sociologists will find it troubling 
to think about cooperation and collaboration in terms stripped of any notion of 
power that stems from people’s positions in society. Rheingold, like many other 
commentators on digital technology, gravitates to research in biology, psychology, 
and economics, fields that lend themselves to the study of individuals and less to the 
complexity of human societies. Core social structures such as social class, religion, 
nationality, gender, ethnicity, and race — important because they constitute the 
contexts of these technologies — do not appear in any significance in the book. 

Rheingold passionately believes that this technology can and will foster social 
revolutions and his Web site (http://www.smartmobs.com) collects such stories 
and accounts. His claim that the mobile Internet and other digital technology 
enables “people to act together in new ways and in situations where collective 
action was not possible before” (xviii) remains to be corroborated by systematic 
research. He rightly reminds his audience that digital technology might be making 
new forms of social organization possible, encouraging sensitivity to the presence 
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of technology that the field of sociology certainly needs. The intricacies of how 
much digital technologies integrate, modify, or bounce harmlessly off of existing 
social practices is a subject that the field of sociology is beginning to investigate, 
albeit slowly. While Rheingold's agenda is laudable, pointing out the possibility of 
the mobile Internet and other technologies to facilitate positive (and, in less detail, 
negative) collective action, the limitation of the book is its uninformed model of 
how societies work in general. 


Shared Beginnings, Divergent Lives: Delinquent Boys to Age 70. By John H. Laub 
and Robert J. Sampson. Harvard University Press, 2003. 338 pp. Cloth, $49.95. 


MEGAN M. SWEENEY, University of California, Los Angeles 


Why do some adolescent offenders find their way to a crime-free adult life of stable 
employment and happy marriages, while others persist in criminal behavior long 
into adulthood? In Shared Beginnings, Divergent Lives: Delinquent Boys to Age 70, 
Laub and Sampson blend diverse methods of data collection and analysis to study 
continuity and change in criminal behavior over the adult life course. Laub and 
Sampson develop a general theory of crime which emphasizes informal social 
control, routine social activities, and human agency. They aim to account both for 
persistent offending and for desistance from crime, and argue that their findings 
*are more general than specific with respect to place, historical time, gender, and 
race (283) 

The research effort undertaken for Shared Beginnings, Divergent Lives is 
ambitious and impressive. Laub and Sampson follow up Sheldon and Eleanor 
Glueck’s original sample of delinquent youth who were sent to two Massachusetts 
correctional schools during adolescence. They conduct death record and criminal 
history searches for all 500 original sample members. They also complete detailed 
life-history interviews with a subset of 52 of these men, now approaching age 70. 
This tracing effort is particularly remarkable given that over 35 years had elapsed 
since these individuals were last contacted by the Gluecks. Moreover, sample 
members all had a criminal past (and as Laub and Sampson note, sometimes also 
a “criminal present”), bringing into question their eagerness to be found in the 
first place. ; 

Laub and Sampson begin their analysis with a detailed quantitative examination 
of age-crime trajectories. They note considerable stability in the determinants of 
crime across the life span, but also individual variability in the age-crime relationship 
which is not well explained by traditional indicators of childhood risk. Although 
the overall level of offending does tend to be greater in a high than low childhood 
risk group, the shape of the age-crime profile is largely indistinguishable between 
these two groups of adolescent offenders, suggesting similar causal pathways 
towards desistance. Persistent offending is not well predicted based on indicators of 
childhood or adolescent risk. Although it is important to keep in mind that Laub and 
Sampson's sample includes only serious adolescent offenders, their analysis offers 
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a significant challenge to a developmental taxonomy approach to understanding 
patterns of offending. These results are certain to stimulate additional research, 
using population-based samples, on the causal determinants of desistance. 

Laub and Sampson turn next to their richly detailed life-history narratives. 
Here they particularly demonstrate the importance of turning points associated 
with life events such as entering a high-quality marriage, reform school, or military 
service. Many results from the in-depth interviews are further confirmed in a final 
quantitative analysis in which within-individual variations in crime are considered 
simultaneously with between-individual differences in the propensity to offend. 
Reflecting on both their qualitative and quantitative results, Laub and Sampson 
offer a more nuanced perspective on how turning points function in the desistance 
process than in their previous book, Crime in the Making: Pathways and Turning 
Points Through Life. They point to direct causal effects of situations and contexts, 
but also allow for selection effects and for personal agency. For example, Laub 
and Sampson argue that marriage can promote desistance by providing access to 
support and other resources (including direct monitoring of behavior) offered 
by a spouse, restructuring routine social activities, and promoting an increased 
sense of responsibility. But they also acknowledge that individuals vary in their 
responsiveness to the potential turning point offered by marriage, such that men 
who make an inner resolve to change are “. . . better positioned situationally to 
respond to the monitoring and control and the love and social support around 
them” (249). Like employment and military service, marriage can be a catalyst for 
change, but it is not deterministic. 

Some readers may question the generalizability of these findings, given that 
Shared Beginnings, Divergent Lives reflects the lives of white men who grew up in 
Depression-era Boston. Yet the interesting issues here are not whether Laub and 
Sampson’s data are too geographically specific or too old to be of interest (as Laub 
and Sampson note, we do not yet have 70 years of life data for more recent cohorts 
of adolescent offenders), but whether and how explanations for continuity and 
change in criminal behavior may vary across structural contexts characterizing 
particular places and historical periods. Do marriages of the early 21st century 
tend to restructure routine social activities to the same extent as those of the 1950s 
and 1960s? Does military service tend to provide the same catalyst for life change 
in a post-G.I. Bill era? Assessments of the robustness of these findings across 
subpopulations, including those defined by race and gender, might further confirm 
the authors’ assertion of the general applicability of their theory. Identifying points 
of variation (or stability) across demographic subpopulations, place, and historical 
time will provide important theoretical leverage on trajectories of crime. 

In sum, Shared Beginnings, Divergent Lives offers considerable insight into 
continuity and change in criminal behavior over the adult life course. It is certain to 
spark debate, as well as further research, on the determinants of criminal offending, 
and should be required reading for anyone with interests in this area. 
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The Starting Gate: Birth Weight and Life Chances. By Dalton Conley, Kate W. 
Strully and Neil G. Bennett. University of California Press, 2003. 258 pp. Cloth, 
$55.00; paper, $21.95. 


Conceiving Risk, Bearing Responsibility: Fetal Alcohol Syndrome and the 
Diagnosis of Moral Disorder. By Elizabeth M. Armstrong. Johns Hopkins 
University Press, 2003. 304 pp. Cloth, $42.95. 


MAIREAD MOLONEY, University of North Carolina at Chapel Hill 


Fetal Alcohol Syndrome (FAS) and low birth weight (LBW) are two health 
outcomes typically diagnosed at the outset of life and are intricately linked to 
maternal health, socioeconomic status and race. Not all babies are at equal risk 
of being born LBW and not all mothers are at equal risk of bearing children with 
FAS. The two books reviewed here, Conceiving Risk and The Starting Gate, examine 
FAS and LBW respectively, in an effort to disentangle the complex relationship 
between social and biological influences on health outcomes and life chances. The 
authors of these works utilize different theoretical and research frameworks in 
order to investigate the causes and consequences of their chosen health outcome. 
I will provide a brief overview of each work and conclude with a summary of the 
findings and implications. 


The Starting Gate 


About 7 percent of children born in the United States are labeled LBW at birth; 
these babies are more likely to be African American and poor. LBW babies, defined 
as any child weighing less than five and a half pounds at birth, are at an increased 
risk of infant mortality. Those who do survive are more likely to face a host of 
negative outcomes including poor health and chronic illness as well as physical, 
mental, and intellectual disabilities. Behavioral disorders, lower IQ scores, lower 
educational attainment, and chronic illness tend to persist into adulthood. 

Conley, Strully, and Bennett use data from both the Panel Study of Income 
Dynamics (PSID) and the 1995-97 Matched Multiple Birth Data Set to investigate 
the biosocial interaction between economic resources and inherited risk. The data 
confirm a hereditary link between LBW parents and LBW offspring. However, there 
is a strong indication that higher family income may all but eradicate the hereditary 
consequences of LBW by providing additional resources pre- and postnatally. 
Resources may include better prenatal care, better nutrition, less exposure to a 
potentially harmful environment, and access to superior educational resources. 
The authors conclude that children predisposed to LBW are faced with either a 
“double advantage” or a “double disadvantage" dependent on the socio-economic 
state of the family into which they are born. 

Conley, Strully, and Bennett follow their statement of findings by proposing 
public policy advocating for the label of LBW to be taken into account when families 
seek state or federal aid. They suggest that throughout their educational careers, 
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children who are born LBW should have this status noted by their teachers. They 
write, “Attention to birth weight status may help parents, teachers and physicians 
recognize and refine treatments of disability and developmental delay" but do not 
explicate how such a policy would be effectively implemented. 


Conceiving Risk, Bearing Responsibility 


The slogan of the National Organization on Fetal Alcohol Syndrome (NOFAS) 
embodies, to a large extent, the attitudes commonly held in our society about 
the interaction between alcohol and pregnancy. “No safe time. No safe amount. 
No safe alcohol. Period.” These beliefs are, Armstrong argues, social constructions 
that ignore the facts. In truth, FAS occurs in the children of only a small minority 
(596) of women who are heavy drinkers — those who drink upwards of 14 
drinks per week during pregnancy — and is highly correlated to race, smoking, 
advanced maternal age, malnutrition, and poverty. In the vein of Freidson and 
Zola, Armstrong asserts that medical authority has edged beyond its bounds of 
scientific expertise and applied diagnostic criteria to capture a socially deviant 
behavior — drinking alcohol during pregnancy. 

Armstrong provides an historical overview of attitudes toward alcohol use and 
pregnancy as well as qualitative data highlighting the lack of cohesion amongst 
physicians regarding the diagnostic criteria, prevalence and cause(s) of FAS. 
Armstrong also examines the drinking behaviors of women using data from the 
National Maternal and Infant Health Survey (NMIHS). She finds that nonpregnant 
women who are poor, minority, and less educated — while less likely to drink in 
general — are also less likely to alter their drinking behavior once they become pregnant. 
Thus, of women who are minority and poor, those who drink heavily tend to do 
so regardless of pregnancy status. This may partially explain why this subset of 
women is at the greatest risk for bearing offspring afflicted by FAS. 

Armstrong is critical of broad public policy interventions. She asserts that 
by ignoring the social inequality and risk factors that are at the root of FAS and 
promoting a discourse of individual responsibility and blame, society essentially 
overlooks the women and children most in need of education and assistance. 
Socially constructed notions of appropriate behavior offer only judgment and 
fail to consider why a woman would continue to drink heavily throughout her 
pregnancy. As Armstrong notes, "It's as if a woman's drinking problems have no 
social meaning until she becomes pregnant." 

In Conceiving Risk, Bearing Responsibility, Armstrong does an excellent job 
of viewing the social construction of a medical disorder using the tenets of 
labeling theory. In contrast, it would have behooved Conley, Strully, and Bennett 
to consistently emphasize that LBW is a biological state resulting from health or 
development issues and is not a medical diagnosis of disease or disability. The 
authors call for the construction of a label for individuals born LBW, meant to 
increase their access to public assistance. Link and Phelan, as well as others, have 
demonstrated that labeling a disorder, even when it is being done so that the 
individual can receive services beneficial for their condition, can produce stigma 
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that reduces life chances. Further, the label of LBW would most often be applied to 
individuals who are already at social disadvantage — those who are minority and 
poor. In sum, while both FAS and LBW are acknowledged to have roots in social 
and economic disadvantages, only Armstrong fully explores how our propensity 
to apply medical labels to social phenomena is worked out within a particular 
cultural context. Analyzing and understanding the subjective elements inherent to 
any medical diagnosis or label is an invaluable exercise for any sociological thinker 
and one that may be accomplished through exploration of either of these books. 


Altered Lives, Enduring Community: Japanese Americans Remember their 
World War II Incarceration. By Stephen S. Fugita and Marilyn Fernandez. 
University of Washington Press, 2004. 253 pp. Cloth, $45.00; paper, $24.95. 


JONATHAN MARKOVITZ, University of California, San Diego 


In the early pages of Altered Lives, Enduring Community, Fugita and Fernandez 
note that the vast majority of Issei (members of the immigrant generation of 
Japanese Americans who were, in the authors’ words “forcibly removed from their 
Pacific Coast homes and incarcerated in desolate camps in the interior of the 
country” where they were held for an average of two to three years) have died. But 
the Issei made up only about one-third of the incarcerees. The other two thirds 
were the second-generation, American-born Nisei, who are, for the most part, now 
in their 70s, 80s, or 90s (3). Fugita and Fernandez argue, rightly, that there is an 
“imperative to preserve the experiences” of this population, and that the Nisei's 
advanced age lends urgency to the task. The authors note that, while there is an 
expansive literature describing the process of explusion of Japanese Americans 
from the West Coast, and documenting the conditions of the internment camps, 
there have been remarkably few efforts to collect "detailed information from a 
representative group of former incarcerees” (4). The timeliness of this volume 
is therefore beyond question, and in a post-9/11 climate the need for thoughtful 
reconsiderations of histories of U.S. civil liberty violations directed towards recent 
immigrants and their communities only makes this book even more welcome. 
Altered Lives, Enduring Community is not another historical account of the 
internment years (though it does provide complex and nuanced historical context), 
but is instead primarily an attempt to assess the role that policies of exclusion 
and incarceration during World War II had for Japanese Americans during the 
postwar years, as they tried to "reestablish themselves in American society" (6). 
Fugita and Fernandez draw upon a remarkably rich resource for their analysis: the 
Denshó project (the full title is "Densho: The Japanese American Legacy Project"). 
The Densho project was founded in Seattle in 1996 to document oral histories of 
Japanese Americans who had faced internment during World War II. The oral 
histories (which were videotaped) were supplemented in 1997 with a representative 
sample survey of Japanese Americans living in King County, Washington, which 
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includes Seattle. Fugita was one of the people involved in designing and directing 
the survey (6). Fernandez and Fugita rely most extensively upon the survey data, 
though they draw upon the oral histories to illustrate and contextualize key points 
in often heartbreaking ways. One especially poignant example: Frank Fujii, a Nisei, 
and a child in 1941, is quoted describing his memories of the knock on his door 
after the bombing of Pearl Harbor. He opened the door to find two white men 
identifying themselves as FBI agents, asking for his father. “And I say, 'Oh Dad's 
here somewhere’ And I get him and they took him. I didn’t see him after that for 
three-and-a-half years” (49). One minor frustration of the book is that the authors 
chose not to have tapped into the oral histories as often as they might have. The 
excerpts that are included are invariably powerful, and including more could only 
have added resonance and emotional depth to the major claims of the book. The 
authors do, however, direct us to the Densho project website (www.densho.org) 
where more of this material is available. 

The book includes chapters on Japanese American life before World War II, 
during incarceration, and during and after resettlement. The authors provide 
excellent literature reviews in every section, and make good use of the Densh 
survey data to shed light upon aspects of incarceration that have not received 
much popular or scholarly attention. The discussions of the difficulties during the 
resettlement period are most effective, as the authors address not only widely noted 
problems of economic deprivation and discrimination, but also the geographic 
dispersal of formal internees (many of whom were initially not permitted to return 
to the Pacific Coast), and the need for rebuilding entire communities in the face 
of tremendous barriers. The kinds of internal struggles faced by resettlers (e.g., 
how to avoid drawing attention to their ethnicity which had been so thoroughly 
demonized, and how to deal with the stigma attached to incarceration), are only 
one part of the story captured here but generally ignored in the literature. 

Because the authors take pains to note that a major issue surrounding the bulk 
of work on the incarceration has to do with the difficulty of knowing how many of 
the historical accounts are representative of incarcerees in general, it is necessary to 
note that the small sample size of the Densho survey (only 183 respondents, when 
there were well over 100,000 incarcerees) makes it difficult to see this study as truly 
addressing the need for representativeness that the authors identify. ( The sample is 
too small to provide confidence of representativeness even for Seattle-area Nisei.) 
But the authors’ arguments about the dynamics and effects of the incarceration 
period ring true, and are supported by their data in compelling ways, even if the 
representativeness cannot be fully established. Certainly, the failure to prove that 
the data of the Densho project is representative does nothing to diminish the 
importance of this book. 
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Social Dynamics of the Life Course: Transitions, Institutions, and 
Interrelations. By Walter R. Heinz and Victor W. Marshall. Aldine De Gruyter, 
2003. 306 pp. Cloth, $55.95. 


J. BRIAN BROWN, The Ohio State University 


Heinz and Marshall's edited volume Social Dynamics of the Life Course: Transitions, 
Institutions, and Interrelations arose from the proceedings of a symposium at the 
University of Bremen entitled “Status Passages and Risks in the Life Course.” 
The contributors make this volume an enjoyable international affair designed to 
expose the reader to the history of life-course research, the dominant perspectives 
of today, and the challenges of life course in an increasingly global society. 

In the first section, “Tradition and Innovation in Life-Course Research,” the 
contributing authors lay the theoretical foundation for life-course research. Marshall 
and Mueller begin the section with an overview that compares and contrasts the 
United States and European life-course perspectives. Next, Kriiger uses the example 
of gender and the division of labor to argue that we must focus on how the areas 
of the life course are gendered to explain ongoing sex differences in an increasingly 
individualized life course. In the next chapter, Elder draws from his understanding 
of period effects to emphasize the growing importance of geographical effects, 
especially in cross-national research. Period and geographical effects, he concludes, 
are not just methodological obstacles; they are worthy of substantive examination. 
Heinz concludes the section by detailing a mixed-methods panel survey design that 
could prove effective at answering questions about the dynamic between individual 
agency and institutional/structural influence. 

The second section, “Life-Course Transitions and Sequences,” begins with 
Sackmann and Wingens’ theoretical overview of life-course “trajectories” versus 
“transitions,” which sets up their proposal for a typology of sequence types that 
bridges the gap between those two concepts. The following chapter by Buchmann, 
Kriesi, Pfeifer, and Sacchi is the first traditionally empirical contribution. They 
use women’s employment interruptions in Switzerland to demonstrate the utility 
of both organizational and segmented labor market theories in understanding 
women’s life-course transitions. Continuing this theme, Schaeper and Falk 
examine Germany’s post-unification gender order and the effect unification had 
on (formerly) Eastern and Western German women’s life-course experiences in 
the labor market. The theoretical clarity, historical significance, and multifaceted 
empirical evidence make this chapter worth singling out as the strongest overall 
contribution. 

To begin the third section, “Institutions and the Life Course,” Weymann extends 
the examination of the unification of Germany with a more general discussion of 
life-course policy, institutions, and changing individual life-course options. In the 
next chapter, Leisering and Schumann use Germany’s social welfare system as an 
example to examine the role of institutional control, or regulation, over the life 
course. Furstenberg, turning the attention to the United States, emphasizes the role 
of social class on the relationship between institutions and the life-course. 
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All of the authors in the fourth and final section, “Interrelations and the 
Life Course,” attempt to conceptualize “linked” or “interdependent” life-course 
experiences. Whereas early life-course research focused on an individual's history 
of successive life-course stages, this new direction in life-course research attempts to 
observe patterns that emerge when the life courses of specific individuals are linked 
empirically. For example, one can examine the career paths of dual-earner couples, 
as Moen did in the first chapter of this section. Without a couple-level framework, 
she argues, it is difficult to understand the roles of conflict and adaptation in 
the individual life course. In the next chapter, Drobnič expands on the previous 
chapter using survival analysis (i.e., hazard models) to illustrate how the presence 
and characteristics of a husband affects wives employment transitions. Finally, 
Born takes a different perspective on “linked lives” and descriptively illustrates 
intergenerational convergence in attitudes toward gender equality that are not 
observed in actual behavior. 

The editors of Social Dynamics of the Life Course: Transitions, Institutions, and 
Interrelations clearly thought through the organization and content of this volume. 
Every chapter finds its place, and most segue seamlessly to the next, which makes 
reading it from cover to cover an intellectually cumulative experience. Further, as 
I progressed through the chapters, I realized that many of the contributions focus 
specifically on gender inequality and the division of labor. As a result, this volume 
should be of special interest to those researchers involved in that field of research 
and/or instruction. I also highly recommend this for use in a graduate-level seminar 
on the life-course or gender stratification. 

This volume makes it apparent that life-course theories have become far 
more advanced than the methods currently employed to test them. Indeed, 
there is no shortage of proposed typologies and linear regression models with 
dichotomous variables to test them. The final section of this volume only adds to 
the methodological challenge. Therefore, my only frustration with the volume is the 
absence of discussion on how we might use already available advanced quantitative 
methods to test these complex theories of interrelatedness. 


The Unthinkable Revolution in Iran. By Charles Kurzman. Harvard University 
Press, 2004. 287 pp. Cloth, $27.95; paper, $19.95. 


JOHN FORAN, University of California, Santa Barbara 


This is perhaps the best work on the Iranian revolution that I have seen, in one 
way or another better than those of Misagh Parsa, Said Amir Arjomand, Mansoor 
Moaddel, Ervand Abrahamian, or Nikki Keddie, the best English-language social 
scientists and historians who have written book-length studies on the revolution. 
Of them all, Charles Kurzman’s is the richest, most compelling account and 
interpretation, tapping more primary and hard-to-get sources than any other work 
on the revolution, and evincing a real gift for interpreting the meanings of people’s 
acts and words. And make no mistake, this revolution presents multiple layers of 
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complexity; as one Iranian woman put it, “If you ask a thousand Iranians about 
the revolution, you will get a thousand different stories and explanations, and each 
is correct in its own way” (84). 

Kurzman’s general narrative and organizational strengths shine throughout the 
book, which is innovatively written and organized around competing theories. The 
standard explanations stressing the significance of the condition of the economy, 
the role played by U.S. influence, the internal cultural and organizational work of 
mobilization, and the limits of military repression are considered in turn. Without 
discarding any of them entirely, each is subjected to a barrage of awkward data and 
recalcitrant anomalies to reveal their limitations. Therefore, and further, because 
all prediction of revolutions is retrospective, he feels he has undermined most of 
the existing explanations. 

The book builds up to a great final chapter stressing the significance of 
individuals’ perceptions in deciding to join mass movements, showing how these 
change in the face of uncertainty. The argument goes something like this: as the 
shah wavered in his will to carry out full-on repression of the protests, their size 
swelled as people felt there would be safety in numbers. The influential free-rider 
problem is turned on its head by Kurzman and other “critical mass” theorists who 
argue instead, “What if people want to engage in collective action but only let 
themselves do so when it looks safe and seems likely to succeed?” (132). The key 
to this opening up of the moment to hitherto unthinkable alternatives is a feeling 
of the viability of those alternatives, and the making of “a viable movement” (the 
title of the chapter): “The point is not for some social scientist to look back, after 
the fact, and declare a movement to have been viable. The point is that people 
make such judgments in real time, during moments of confusion, and that these 
judgments can be self-fulfilling” (136). 

Kurzman considers this an anti-explanation because “[i]nstead of seeking 
recurrent patterns in social life, anti-explanation explores the unforeseen moments 
when patterns are twisted or broken off. . . . What is left when we part from 
retroactive prediction? Understanding" (138). The theoretical implications are 
profound, centering the actors' perceptions and their agency as key principles of 
social life, and lending support to Giddens's notion of structuration, but going 
beyond the theorist's abstractions and untested theory by the force of a rich 
empirical analysis. Foregrounding the confusing anomalies of lived experience in 
times of change allows Kurzman to speculate even further that "viability offers a 
window onto the whole of social life" (170), including the reproduction of routine 
social reality, putting the reader in mind of the ways the current Bush regime preys 
on the public's desire for a viable way of life in troubled times. In the Iranian context, 
“So long as revolution remained ‘unthinkable, it remained undoable” (172). When 
enough people came to think the unthinkable, the revolution triumphed though 
no one had predicted it. 

Kurzman’s thesis is a strong one that takes on existing theories of revolution and 
dominant styles of sociology more generally in a way that will draw discussion. I 
don't agree with all that I read, but I found the argument highly thought provoking. 
The main problem I see with this approach is that by eliminating factors, taken 
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singly, the author may overlook their combination(s) in producing results (to his 
credit, he attempts to address this problem). He makes, I think, a similar mistake 
when he compares Iran with other oil-producing nations that experienced no 
upheaval to discard an economic interpretation of the revolution (91ff), rather 
than considering the full constellation of factors — the state, political culture, 
or world-system — that might have explained these varied outcomes. Another 
option would be to develop more complex concepts that cut across the categories 
of economy, politics, and culture, such as my own “political cultures of opposition” 
to account for the varied motives for participating, or “dependent development” 
to make sense of a complex, contradictory political economy. A third is to engage 
more with the interesting work being done on emotions by an emerging generation 
of social movements and revolutions scholars. Such approaches would go against 
the grain of Kurzman’s anti-explanation, and it would be interesting to know how 
he would respond to them. Not all sociologists will embrace this book and its ideas, 
but this only testifies to the novelty and force of the arguments made. 


Cultural Trauma and Collective Identity. By Jeffrey C. Alexander, Ron 
Eyerman, Bernhard Giesen, Neil J. Smelser, and Piotr Sztompka. University of 
California Press, 2004. 314 pp. Cloth, $65.00; paper $24.95. 


BRIAN STEENSLAND, Indiana University 


Is it possible that the tragic events of September 11, 2001, might not have 
traumatized the American people as it subsequently did? In developing a theory of 
cultural trauma, the authors of this volume suggest just that possibility. Alexander 
and his colleagues adopt an unapologetically social constructionist orientation to 
argue that events, even those on the scale of the 9/11 attacks, are not inherently 
traumatic. Rather, the cultural templates through which they are experienced 
render them so. This approach to trauma redirects our attention from looking 
at the nature of events to looking at their collective representation. 

The book is a collective enterprise, but each of the six chapters (plus an epilogue) 
is written by a single author. Alexander sketches a theory of cultural trauma and 
offers this definition: “Cultural trauma occurs when members of a collectivity 
feel they have been subjected to a horrendous event that leaves indelible marks 
upon group consciousness, marking their memories forever and changing their 
identity in fundamental and irreversible ways” (1). The analytical linchpin here is 
the relationship between cultural trauma and collective identity: no threat to group 
identity, no cultural trauma. This defines many types of tragedy as something other 
than cultural trauma, and thus beyond the scope of the book. For example, slavery 
was not culturally traumatic for blacks until the creation of an African American 
identity during Reconstruction against which slavery was retrospectively cast as 
traumatic. Neither was the Holocaust initially traumatic for Jews (much less, the 
West) until it was later coded and narrated as an assault on collective identity. 
Some may find this focus on the relationship between trauma and collective 
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identity restrictive, since it brackets the significance of other sources of subjective 
experience. Certainly blacks were in some sense traumatized by slavery before it 
became a touchstone for collective identity decades after Emancipation. Yet it is 
threats to collective identities that the authors seek to elucidate. 

Alexander distinguishes the book's explanatory approach from two existing "lay 
theories" of trauma — Enlightenment and psychoanalytic versions — that each 
suffers from the “naturalistic fallacy:” the assumption that events themselves create 
trauma. While Alexander’s past work has been self-consciously neo-Durkheimian, 
his theory of cultural trauma contains both Durkheimian and Weberian elements. 
He discusses cultural classification, collective memory, and symbolic pollution, but 
also identifies carrier groups, institutional settings, and stratification hierarchies 
as essential elements in explaining the origins and routinization of narratives of 
trauma. In his empirical chapter on the creation of the Holocaust as a universal 
“trauma drama, these Weberian strands play second fiddle to a reading more firmly 
grounded in literary theory and French structuralism. Nevertheless, the synthetic 
theoretical framework is spelled out clearly. 

The other co-authors highlight and extend various facets of the theory. Smelser 
explicates how emotions provide a bridge between cultural and psychological 
levels of trauma, and how affective states create the conditions in which particular 
claims about trauma resonate. Eyerman inverts Alexander’s analytic framework by 
placing collective identity in the foreground. He examines the formation of African 
American identities in the late 1800s against the backdrop of an emergent narrative 
on the trauma of slavery, and then charts how this trauma narrative changed in 
response to diverging black identity movements during the subsequent century. 
Giesen’s chapter on German national identity after the Holocaust takes a similar 
interest in trauma as a touchstone for identity formation and change. Sztompka 
outlines an ideal-typical sequence through which traumas unfold and applies it to 
the collapse of the communist system in East-Central Europe. 

Smelser’s epilogue applies a cultural trauma framework to the events of 
September 11, 2001. Written just four months after the attacks, it demonstrates the 
difficulty of applying a constructionist approach to events that are still so fresh. He 
successfully shows how strands of American culture mediated the perceptions of 
the attacks, but unfortunately does not bolster the strongest premise of the book 
— namely, that a counterfactual narrative existed that could have rendered the 
attacks as non-traumatic. Smelser in fact describes the events of 9/11 as containing 
elements of the “quintessential cultural trauma,’ a description dangerously close 
to the “lay trauma” theories the authors eschew. The book’s strength, however, is 
to provide a framework in which such a counterfactual becomes conceivable. For 
example, if opinion leaders had promoted a convincing narrative that Americans 
“deserved” the attacks or if the nation’s values were construed as so fragmented 
that there were few widely shared sentiments to disrupt, then 9/11 might not have 
become a cultural trauma. In illuminating such possibilities and providing heuristics 
for their analyses, Alexander and his colleagues have opened up an important line 
of inquiry into threat, collective identity, and disruptions of the normative order. 
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The Man Who Shocked the World: The Life and Legacy of Stanley Milgram. 
By Thomas Blass. Basic Books, 2004. 360 pp. Cloth, $26.00. 


THOMAS F. PETTIGREW, University of California, Santa Cruz 


This book offers a well-written biography of Stanley Milgram (1933-84), the 
social psychologist who conducted the widely known experiments on obedience. 
Thomas Blass, a social psychologist himself, has made Milgram and his work his 
career focus in recent years. Dozens of intensive interviews, including two with 
the reviewer, and years of studying manuscripts and letters led to this book 

Blass views his subject as an eccentric visionary. He met Milgram only once, 
but he captures much of this difficult man as I remember him. I knew Stan well, 
first as fellow doctoral students then as faculty members in Harvard’s old Social 
Relations Department. 

Clearly, Milgram is the author’s hero — especially for the relevance of the 
obedience research for the Holocaust. Nevertheless, to Blass’s credit, he describes 
Milgram’s dark side — a “complex, sometimes enigmatic, individual — .. 
controlling and domineering — a prima donna who always had to have his way” 
(182). Blass speculates that continued drug use, especially to overcome writing 
blocks, offers a possible explanation for these characteristics. But I have no 
knowledge of this possibility. I just recall him as routinely cruel to graduate students 
in a seminar we co-taught. In any event, Blass feels Milgram’s “genius” outweighed 
these personal flaws. This view mirrors Milgram's defense of the questionable ethics 
of his obedience studies — that the importance of the findings outweighed any 
potential harm to the uninformed subjects. 

I have only two caveats to make of this interesting book. First, the description 
of Milgram's greatest setback — Harvard's denial of tenure — is flawed. The 
prolonged meetings over this decision were less concerned with Milgram, as Blass 
writes, than they were about other possibilities for the position. And Milgram knew 
long before the announcement that he would not receive the appointment. These 
errors probably derive from Blass's faithfully recording the comments of Roger 
Brown, Milgram’s major supporter who remained bitter over the decision. He cites 
Brown as saying that opponents of Milgram’s promotion wrongly “attributed to 
him some of the properties of the [obedience] experiment” (153). But this ignores 
the fact that Milgram’s authoritarian behavior, especially toward graduate students, 
reinforced his critics’ qualms. 

Second, from the book’s title to repeated claims for Milgram’s ideas, there is 
some hype and hyperbole. Milgram’s thesis on conformity was not ^. . . the first time 
an objective technique was used to study cross-cultural differences in behavior" 
(54), nor was the “stimulus overload” concept original to Milgram. And putting the 
computer to use in replicating Asch’s conformity study had been devised by Richard 
Crutchfield and others long before Milgram tried it. Moreover, his many interesting 
field studies of urban life conducted in the 1970s, while new to psychology, retraced 
similar studies by symbolic interactionists. 

Yet Milgram possessed truly innovative research skills. He made a specialty of 
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creating high-impact studies with behavioral dependent variables that convincingly 
demonstrated the power of situations to shape behavior. Milgram also employed 
clever follow-up studies — 19 variations in his obedience research alone — that 
systematically tested plausible rival explanations. My favorite involved his repeating 
his obedience study in a ram shackled setting in downtown Bridgeport, Connecticut. 
When his results almost matched his Yale laboratory results, he was able to dismiss 
the prestige of the university as a major factor. Here Milgram was inspired by 
the many repetitions by Soloman Asch, his mentor, of Asch's famous conformity 
experiment. It is commonplace today in social psychological journals to see several 
cumulative experiments reported on the same hypothesis — testimony to this 
Asch-Milgram contribution to social psychological method. 

However, Milgram did not significantly add to the theoretical development of 
his discipline. A great variety of studies in social psychology, from Asch’s conformity 
work to Phillip Zimbardo’s prison study, have revealed the power of the situation. 
Blass correctly notes, *. . . theoretical integration . . . was not his forte" (212). He 
was far more fascinated by the social phenomena themselves than with their larger 
theoretical significance. Even when relevant theory already existed, he largely 
ignored it. For instance, Milgram popularized the "small world" phenomenon — a 
special case example of sociology's social network theory. But he did not formally 
link his phenomenon to the theory. Consequently, Milgram's contributions to 
social psychology are almost entirely empirical. 

In his final chapter, Blass sums up Milgram’s “legacy.” One aspect of this legacy 
involves the ethical problems in the obedience research. Together with other studies 
using uninformed subjects, this work led to today’s standards for the treatment 
of human subjects in research. But Blass stresses the use in the popular culture 
of Milgram’s studies — especially the work on obedience and the “small world” 
network ideas. Undoubtedly, Milgram was the most successful self-promoter that 
social psychology has ever had. He produced popular articles, books, and films to 
spread his ideas. For a discipline that is largely out of view, even to other social 
scientists, this was no small accomplishment. 

The most important application of his ideas concerns the implications of the 
obedience research for the Holocaust. At first reading, his findings on the power 
of orders from an authority appear to support Adolf Eichmann’s defense of “only 
following orders.” But Milgram rightly held that this defense does not absolve Nazis 
from having transformed the obedience *. . . potential into actual brutality and 
actual slaughter" (270), yet there is an inconsistency here. If following such orders 
leads to brutality and evil, then how could his having his subjects commit what 
was apparently a brutal act not be harmful for the subjects? In a thoughtful final 
discussion, Blass points out that “the banality of evil” revealed by the obedience 
research is an insufficient explanation for the Holocaust. There was also anti- 
Semitism dripping in hatred by the S.S. 

Beyond any special interest in Milgram, then, social scientists involved in 
experimental ethics, obedience, and other Milgram topics will find this book a 
rewarding read. 
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Measuring Racial Discrimination. By National Research Council. The National 
Academies Press, 2004. 317 pp. Cloth, $44.95. 


DEVAH PAGER, Princeton University 


Discrimination has long been a fascinating but frustrating subject for social 
science. It is fascinating because it represents a powerful mechanism underlying 
many historical and contemporary patterns of inequality; frustrating because it 
is elusive and difficult to measure. The impressive new volume from the National 
Research Council, written by a team of interdisciplinary experts (chaired by 
Rebecca Blank), takes stock of the existing approaches to measuring racial 
discrimination, guiding readers through the possibilities and potential pitfalls of 
a wide range of available alternatives. Although focused on racial discrimination 
(primarily against African Americans), the insights of this book can be readily 
applied to the study of discrimination more broadly (e.g., by gender, national 
origin, age, etc.). Indeed, this volume is likely to become required reading for all 
those interested in the subject of discrimination, broadly defined. 

The book is divided into three sections. Part one focuses on concepts, with 
definitions of race, discrimination, and theories of discrimination. A majority of 
research on discrimination (and, correspondingly, a majority of the approaches 
discussed in this book) focuses on discrimination in one context at one time. The 
authors encourage researchers to begin thinking more dynamically. Here, and in 
a final chapter, they urge the development of models of discrimination capable 
of going beyond the individual level, and/or at single points in time, to instead 
consider the organizational/structural processes that constitute forms of racial 
discrimination as well as the cumulative features of discrimination across multiple 
domains and over time. 

Part two represents the heart of the volume. After a brief overview of the 
problems of causality and counterfactuals, the following three chapters explore the 
major approaches to measuring racial discrimination — including experimental 
methods, large-scale statistical analyses, and attitudinal and behavioral indicators. 
For each method, the authors present a thorough overview of the key design 
features, highlighting the various strengths and weaknesses, primarily with respect 
to the problems of causal inference and generalizability. These chapters are written 
at a high level of sophistication, but central points are explained clearly and will be 
readily accessible to a wide readership of researchers and students. 

The first of the three methods chapters focuses on experimental approaches, 
which offer a clear view of causal mechanisms by randomly assigning subjects to 
treatment conditions and holding constant all other influences. These studies are 
considered the “gold standard” for establishing causal effects; their key limitation, 
however, is the problem of limited (unknown) generalizability from lab settings 
and undergraduate research subjects to the real settings in which discrimination 
emerges. Field experiments, used most often to study discrimination in housing 
and labor markets, relax certain controls over environmental influences in order 
to better simulate real-world interactions. While retaining the key experimental 
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features of matching and random assignment important for assertions of causality, 
this approach relies on real contexts (e.g., actual real estate markets or employment 
searches) for its staged measurement techniques. Field experiments, however, are 
often costly and difficult to implement, and can only be used for selective decision- 
points (i.e., hiring decisions but not promotions). 

The next chapter in this section discusses statistical analyses of secondary 
datasets. Such analyses have advantages of large numbers, allowing researchers 
to take into account of wide range of influences and interactions. The primary 
limitations of this approach, however, are threats to causality: omitted variable 
bias (failing to control for a variable related to both race and the outcome variable) 
and sample selection (when certain segments of the relevant population are over- 
(under-) represented in the sample). This chapter is the most technical of the 
volume, including discussions of statistical techniques for strengthening causal 
inference such as instrumental variables, propensity scores and matching, panel 
data methods, and natural experiments. 

The last of the methods chapters deals with attitudinal indicators from surveys 
and interviews. Questions measuring prejudice and other attitudinal indicators 
of race relations, as well as individuals’ own experiences with discrimination, can 
provide a valuable source of information. This chapter presents an overview of the 
major sources of survey data, discusses the various measurement problems common 
to survey research (reporting error, influences of survey design, poor association 
with corresponding behaviors, etc.), and proposes several potential solutions. 

This section ends with a chapter on racial profiling as a key illustrative example 
of the complexities in conceptualizing and measuring discrimination. 

Finally, part three of the volume addresses issues of data collection and research. 
At various points in the book, and in this final section in particular, one could 
develop the impression that this volume is primarily intended for funding agencies 
and government administrators. In a series of bullet point recommendations at the 
end of each chapter, an emphasis is placed on where resources should be invested 
to develop better measures or to support promising methodologies for measuring 
racial discrimination. While useful to have a roadmap for future development in the 
field, this volume is likely to have far more of an impact on research practitioners 
and teachers, for whom this state-of-the-art handbook of methods for measuring 
racial discrimination will be a treasured resource. 


Race, Culture, and the Revolt of the Black Athlete: The 1968 Olympic Protests 
and Their Aftermath. By Douglas Hartmann. The University of Chicago Press, 
2003. 344 pp. Cloth, $58.00; paper, $22.00. 


BRYAN E. DENHAM, Clemson University 


At the 1968 Olympics in Mexico City, United States sprinters Tommie Smith and 
John Carlos used their time on the victory dais less to celebrate their athletic 
accomplishments than to draw attention to continuing racial inequities in the 
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country for which they competed. In what became known as the "black power 
salute” both athletes, heads solemnly bowed, held a black-gloved fist in the air 
during the playing of their national anthem, angering public officials in the United 
States as well as those from other nations whose primary responsibility, as they saw 
it, was to keep the Olympics free of political demonstrations. Douglas Hartmann, 
an associate professor of sociology at the University of Minnesota, suggests in 
Race, Culture, and the Revolt of the Black Athlete that this demonstration was not 
a militant act on the part of two radicals, as Olympic and public officials would 
repeatedly portray it as, but a much deeper gesture intended to symbolize an 
ongoing struggle for civil rights among African Americans in the very land that 
reaped economic and political benefits from their athletic talents. The "salute" 
was not a spur-of-the-moment act of defiance, the author suggests, but the “final 
exercise” of a failed African American boycott of the Mexico City Games. 

Hartmann offers a well-researched, balanced treatment of the 1968 
demonstration, situating the protest in socio-historical context and not pulling 
punches where others might. As an example, while he credits those who really did 
take a bold and brave stand that year — who used their success in sport to draw 
attention to the plight of African Americans, and who risked the loss of lucrative 
contracts in professional sports following their amateur careers — the author also 
recognizes instances of self-serving demagoguery, sketchy details of certain events, 
and dubious assertions of accomplishment. He also points to one of the great 
ironies of the Olympic Project for Human Rights (OPHR), from whose organizers 
the idea for a boycott had originated: While the OPHR purported to advance the 
cause of civilrights,some of its organizers were not above attempting to intimidate, 
embarrass, or humiliate athletes and coaches who did not support a boycott, 
portraying them as race-traitors and “Uncle Toms.” Additionally, black women 
were not welcome to participate in the highly masculinized, almost misogynistic, 
OPHR movement for equal rights under the law. Such ironies help to explain why 
the OPHR ran out of steam before the Olympics took place, leaving it to athletes, 
as individuals, to protest. 

Hartmann also discusses a more general irony of the Olympics, observing that 
while athletes from throughout the world assemble every four years for the Games, 
and that such an assembly purports to celebrate diversity across cultures, the athletes 
tend to be identified only with the nations they represent. “Olympic symbology,” 
the author suggests, “provides no formal space for representing various nonnational 
social categories such as race, religion, region, ethnicity, or gender, the collective 
identities and social solidarities that for many Olympic participants define most 
fundamentally who they are, the very essence of their being” (17). Thus, minority 
athletes, who must cope with social and economic inequality on a daily basis, are 
expected to suspend that reality and accept the “honor” of representing their country 
on an international stage. When members of marginalized groups do speak out, 
public officials often characterize such communication as the efforts of “radicals” 
or “unlawful extremists,” ultimately minimizing possibilities for change. 

Perhaps the greatest tragedy of the 1968 demonstration is the fact that little 
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has changed in the world of sport; in some respects, the situation has worsened. 
Athletes of color, although handsomely compensated at the elite levels of sport, 
continue to serve as athletic commodities, either helping a team win or being 
forgotten about altogether. Moreover, relatively few African Americans can be 
found in the coaching and executive ranks of sport, such that the hiring or firing 
of a coach in a prominent position immediately becomes a national news item; 
one can observe the hiring of Sylvester Croom at Mississippi State or the firing of 
Tyrone Willingham at Notre Dame as recent examples. Thus, while many sport 
and public officials in the 21st century portray the 1968 demonstration by Smith 
and Carlos as courageous in the face of imminent criticism, yet indicative of how 
individuals can use their expressive rights toward meaningful change, the reality 
is that very little has changed both within sport and in the larger society, which is 
presently as polarized as it has ever been, if not more so. 


Handbook of the Sociology of Religion. Edited by Michele Dillon. Cambridge 
University Press, 2003. 481 pp. Cloth, $90.00; paper, $30.00. 


WENDY CADGE, Harvard University and Bowdoin College 


Michele Dillon's recently edited Handbook of the Sociology of Religion is a tour 
de force of current thinking and recent scholarship in the sociology of religion. 
The book's 34 contributors cover a wide range of historical, thematic, theoretical, 
and methodological issues in 28 chapters, each of which was originally written, 
rather than being reprinted from elsewhere, for inclusion in this volume. Rather 
than simply reviewing current literature, each chapter represents the authors 
engagement with the topic based on their own current and previous research in 
the area. All of the book's chapters are substantial and engaging, and no review can 
do justice to the breadth of issues addressed in a volume of this magnitude. 
The Handbook opens with a series of chapters by Michele Dillon, Robert 
Wuthnow, Robert N. Bellah, Peter Beyer, and Grace Davie about religion as an 
objective of sociological inquiry. Sociologists of religion think of religion as a social 
fact, Dillon argues, to focus on “understanding religious beliefs and explaining how 
they relate to worldviews, practices, and identities, the diverse forms of expression 
religion takes, how religious practices and meanings change over time, and their 
implications for, and interrelations with, other domains of individual and social 
action” (7). Grace Davie reviews the development of the sociology of religion 
from the discipline's founding fathers to the present in “The Evolution of the 
Sociology of Religion: Theme and Variations,’ and Robert Wuthnow considers 
what it means to study religion sociologically by responding to objections and 
misunderstandings about the discipline, theory, and method in “Studying Religion, 
Making it Sociological.” Robert N. Bellah and Peter Beyer present arguments about 
the definition, conceptualization, and importance of religion in “The Ritual Roots of 
Society and Culture” and “Social Forms of Religion and Religions on Contemporary 
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Global Society? Both Bellah and Beyer point to broader questions about the 
relationship between religions as conceived and practiced locally, nationally, and 
globally that run throughout the volume as a whole. 

The second section considers key theoretical and methodological issues in the 
sociology of religion. Chapters by Roger Finke and Rodney Stark, "The Dynamics 
of Religious Economies,” and Philip S. Gorski, “Historicizing the Secularization 
Debate: An Agenda for Research,” speak to theoretical questions about secularization 
that have been central to the field for many decades. Finke and Starke contrast 
the old secularization approach with their new approach centered on religious 
economies, while Gorski argues that neither of these perspectives is likely correct 
because of the limited time periods and geographic regions from which evidence 
has been gathered. Rather, Gorski develops sociopolitical and religiocultural 
perspectives which he regards as promising alternatives to the secularization 
paradigm. In the methodological chapters, Michael Hout shows how demographic 
methods and thinking have and can contribute to knowledge in the sociology of 
religion in “Demographic Methods for the Sociology of Religion,” and Mark Chaves 
and Laura Stephens explore one topic, church attendance, in “Church Attendance 
in the United States” in which different methods of data collection have led to 
different findings over time. The last two chapters in this section by Patricia M.Y. 
Chang, “Escaping the Procrustean Bed: A Critical Analysis of the Study of Religious 
Organizations, 1930-2001," and Wade Clark Roof, “Religion and Spirituality: 
Toward an Integrated Analysis,” point to limits in the current analytic schemas 
for thinking about religion and organizations and spirituality respectively and 
propose novel approaches. 

The remaining chapters in the volume are divided into four thematic sections 
focused around religion and the life course; religion and social identity; religion, 
political behavior, and public culture; and religion and socioeconomic equality 
that each address how religion interacts with broader issues in public life. Chapters 
by Darren Sherkat, Michele Dillon and Paul Wink, and Michael McCullough and 
Timothy Smith in the section about religion and the life course respectively examine 
religious socialization in families, religion and spirituality in late adulthood, and 
the relationship between religion and health. Penny Edgell looks more specifically 
at congregations in her chapter in this section, considering how they respond 
to members’ needs over the life course. The thematic section focused on social 
identity includes chapters by Nancy T. Ammerman; Helen Rose Ebaugh; Arnold 
Dashefsky, Bernard Lazerwitz, and Ephraim Tabory; Lynn Davidman; and Mary 
Jo Neitz that examine how religious identities are created inside and outside of 
national, ethnic, religious and nonreligious institutional contexts by immigrants and 
native born people in a wide range of religious traditions. Ammerman proposes a 
novel approach to questions of religious identity in her chapter, which opens and 
anchors this section. 

The last two thematic sections focus on the relationship between religion 
and broader public life. Chapters by Jeff Manza and Nathan Wright; Rhys H. 
Williams; Fred Kniss; N.J. Demerath III; and John R. Hall focus on religion and 
voting behavior, religion and social movements, religious conflicts in the broader 
moral order, religion and civil society, and religion and violence, primarily in the 
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American context but with some cross-cultural comparisons. The volume’s final 
essays by Richard Wood, Milagros Pefia, and Omar McRoberts consider religion 
and socioeconomic inequality with primary attention to faith based organizing, 
advocacy groups, and congregations. 

Taken together, the chapters in this volume are an excellent overview of current 
research and thinking in the sociology of religion. With appropriate supplemental 
case materials, this book could be the central text in an upper level undergraduate 
course in the sociology of religion and will be a valuable resource for lecture 
writing. This volume will also be core reading for graduate students and should 
be on the shelves of anyone interested in the sociological study of religion in the 
United States. 


Congregations in America. By Mark Chaves. Harvard University Press, 2004. 
291 pp. Cloth, $29.95. 


COURTNEY BENDER, Columbia University 


In the last decade government initiatives and federal welfare laws have expanded 
the role of religious organizations in social service provision, prompting 
widespread debate about the appropriate role of religious organizations in 
social service provision and in public life. Pundits and religious leaders alike 
have taken sides on every imaginable side of the issue, and sociologists have not 
been far behind. At least a dozen monographs and numerous sociological papers 
directly address related issues, ranging from case studies of church soup kitchens 
to surveys on clergy’s perceptions of the consequences of taking government 
funds. Despite the wide ranges of issues addressed, most sociological studies 
(like the debates and government initiatives), assume that religious groups, and 
in particular congregations, have an interest in providing social service, or in 
fact do so. 

Enter Mark Chaves’ excellent new book Congregations in America, which 
demonstrates that this assumption is incorrect. Congregations, the most ubiquitous 
religious organization in American life (and arguably the seat of Americans’ 
religious activities) may “host” a homeless shelter in its facilities, or a small group 
of congregants may actively be involved in sponsoring a soup kitchen or other 
specific social service programs, But on the whole, congregations are not as involved 
as proponents would like, or detractors fear. Congregations, Chaves tells us, focus 
on worship and religious education, and commit most of their resources to those 
pursuits. That these findings are worth printing says volumes about the influence 
of popular debates on research on religion, and Chaves’ book reads in part as 
an argument for a return to studies in religion that are not directed by fleeting 
political concerns. “Overstating the extent of congregations’ social service work 
misleads about the nature of congregations,” he argues, stating that “we should 
not let notions of what congregations ought to look like influence our assessment 
of what they do look like” (93). 
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While the immediate value of Congregations in America lies in this and similar 
correctives, its lasting value rests in numerous observations about many aspects of 
American congregational life. Chaves’ purpose is to understand what “congregations 
do? and is built on findings from the 1998 National Congregations Survey (NCS). 
The 1998 NCS is the first survey of its kind to systematically sample American 
congregations on a national scale, and for this reason alone deserves attention. 
Much of what Chaves reports admittedly align with what sociologists of religion 
already know from previous (smaller scale) statistical analyses, historical research, 
or individual congregational case studies. While it is supremely useful to have the 
statistics in hand, it will come as no surprise that the majority of congregations 
are small, employ few people, and have very small endowments, or that political 
activities (similar to social service provisions) remain “peripheral” for the vast 
majority of conservative and liberal congregations. 

That said, many of Chaves’ findings are strikingly “new.” For example, Chaves 
finds that congregations play an important role in sustaining American art worlds 
and“... facilitate art, and perhaps, on occasion, even beauty, more commonly and 
more intensively than they pursue either charity or justice" (201). Americans are 
more likely to attend a concert or play in a congregation than in any other setting, 
regardless of whether they are regular churchgoers. Congregations present manifold 
opportunities for individuals to participate in the arts: indeed, they provide 
many more opportunities to hone "artistic skills" than “civic skills.” In making 
observations about the arts (and, in another chapter on worship styles), Chaves 
draws attention to realms of religious life that have received comparatively little 
attention of late. Both chapters suggest new avenues for research and a framework 
that tempers the unabated (and perhaps misplaced) fascination with the role of 
religion in politics. 

‘Congregations in America is an extremely well made and clearly written book. 
Its many positive attributes should not, however, keep readers from heeding 
the cautions that Chaves notes in the final chapter regarding what the survey 
itself might obscure about American congregational life. Built into the survey, 
for example, is an assumption that all congregations act with similar “agency,” 
masking how congregations are differently constrained or enabled by their 
relations within religious (or political, or social service, or arts) organizational 
fields. Likewise, this focus on congregations begs for future research that assesses 
congregations’ powers and impact compared with other religious bodies, including 
denominational structures, parachurch organizations, religious media outlets, and 
the like. This raises the further question of whether congregations are the most 
appropriate sites in which to find answers to the questions about the impact of 
religion in social services or public life, and whether such questions can be answered 
by studying congregations as isolated, noncontextualized entities. To Chaves’ 
credit, Congregations in America demonstrates both the necessity of asking these 
questions in future research while simultaneously deepening our understanding 
of congregational life in contemporary America. 
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Between Sundays: Black Women and Everyday Struggles of Faith. By Marla 
F. Frederick. University of California Press, 2003. 263 pp. Cloth, $50.00; paper, 
$19.95. 


JOHNNY E. WILLIAMS, Trinity College (Hartford, Connecticut) 


Increasingly social science scholars are exploring culture's role in facilitating social 
activism. Marla Frederick's Between Sundays: Women and Everyday Struggles of 
Faith, contributes to this effort through an examination of religiosity's effect 
on African American women's experiences and actions. Women's expressions 
of faith reflect what Frederick refers to as "spirituality" — the understanding 
of God and God's work in their everyday lives. Constructing religiosity in this 
way allows Frederick to conceptualize the agency religion inspires in a way that 
illuminates how multiple political and ideological meanings interact within the 
church's domain to frame action in both political and social relief terms (a point 
of view contemporary sociologists of religion generally accept). This nuanced 
construction of agency makes it possible for Frederick to think of the church 
as a ^public sphere" in which women are encouraged to build community in 
ways that inspire social activism. Data for the study was drawn from participant 
observation and ethnographic research conducted in Halifax County, North 
Carolina, between 1997 and 1998. 

Borrowing from the resource mobilization social movement theory, Frederick 
maintains that contexts and cultural processes influence how women interpret 
and act on their religiosity. She empirically explores how eight religious women's 
faith moved them to construct meanings about the inequality they confronted in 
their daily lives. Frederick found that two organizations, the Concerns Citizens of 
Tillery, North Carolina (CCT), and the church were pivotal “public spheres” for 
transforming religious meaning into social action. Specifically, she documents 
how the CCT’s and the church’s institutional settings nurtured interpersonal 
bonds, meanings, confidence and obligations that engendered in women political 
efficacy. Similar to Higginbotham (1994) and Payne (1995), Frederick argues that 
institutions are cultural reservoirs used by women to construct meanings to shape 
their moral and political views about the kind of society that should exist. According 
to Frederick, the CCT’s and the church’s support of women’s movement in to public 
discourse encouraged women to interpret their involvement in the “public sphere” 
as missionary work and “to pursue acts of faith that are both highly public and 
highly personal ... with the aim of improving themselves and their communities." 
Frederick’s demonstration of how the personal translates into confrontations 
with injustice, however, is tedious and inconsistent. For instance, though her 
exploration of televangelism is interesting, its contribution to our understanding 
of the dynamics of African American women’s activism is limited. This aspect of 
her analysis simply reveals that multiple interpretations of religiosity exist, some 
of which can be construed to discourage sociopolitical action. Since multivalent 
religious points of views are common among religious adherents Frederick’s 
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analysis of televangelism would have been more illuminating if it had explicitly 
dissected how women's involvement in CCT and church activities influence their 
interpretations of televangelism's message of individual prosperity message. Do 
some women view personal transformation as a tool for confronting injustice? Why 
or why not? Addressing these questions may disclose how participation in CCT and 
church activities exposed women to various social networks and interpretations 
of religiosity that can encourage or discourage social action. 

Frederick’s contention that personal transformation leads to advocating 
political and economic change is the most empirically troubling portion of her 
examination of religiosity in African American women’s everyday lives. Frederick 
argues that women’s faiths are expressed in two forms of personal activism: tithing 
and redefining intimacy, both of which lead women to constantly critique their 
behavior. Tithing, Frederick contends, leads to tension between the desire to 
spend and the obligation to tithe, thus inclining women to interrogate their own 
consumerism. The data regarding this process is problematic since it does not 
demonstrate how women’s religious critiques of consumption and materialism 
translate into political and economic activism. This same empirical disconnect 
applies to Frederick’s examination of African American women's sexual politics. 
Frederick begins her analysis of sexual politics suggesting that her data indicate that 
women use their spirituality to set boundaries for when and how others engage their 
bodies. She demonstrates this well with historical and primary data detailing how 
African American women’s readings of scriptural dictates about sexuality help them 
sustain the value of their bodies in the context of oppression. However, since this 
analysis is concerned primarily with understanding how women’s interpretations of 
scripture dealing with sexuality transform their personal behavior, we get virtually 
no sense for how women’s internalization of these sacred texts leads to community 
building and social action. 

Despite these problems, Frederick’s book makes an important contribution 
to our understanding of how culturally mediated interpretive processes engender 
efficacy. The book’s main value lies in its challenge to us to think critically about 
how institutional culture in oppressive contexts shapes religious consciousness. 


References 
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The Paradox of American Unionism: Why Americans Like Unions More Than 
Canadians Do But Join Much Less. By Seymour Martin Lipset and Noah Meltz, 
with Rafael Gomez and Ivan Katchanovski. Cornell University Press, 2004. 226 
pp. Cloth, $32.50. 


CLEM BROOKS, Indiana University, Bloomington 


Union density, the proportion of union members among employees within 
a country, has been a central focus in scholarly efforts to understand the 
historical and cross-national variability of labor movement strength, and its 
interrelationship with public policy-making. So vital has the study of union 
density been to the fields of political economy and comparative politics, that 
the basic contours of trade union growth and decline during the past century 
are by now well-established. In the decades after World War II, spurred by 
changing levels of unemployment and such institutional developments as the 
establishment of centralized wage setting, trade union organizations flourished 
in nearly all capitalist democracies, yielding by 1980 an average density of over 
50 percent. But much like social policy, union density varies considerably across 
national borders and world regions, ranging during the 1990s from an average 
approaching 90 percent within Scandinavia to 23 percent in Switzerland, and as 
low as 14 percent in the United States. Indeed, the persistence of high levels of 
union density within Nordic Europe (and also Belgium) attests to the strength 
of labor movement organizations within polities where centralized wage setting 
remains firmly established. Elsewhere, however, the postwar pattern of union 
growth has given way in many cases to substantial decline, particularly within 
liberal democracies such as the U.K. and the U.S. (and in several continental 
European democracies such as Austria and the Netherlands). 

Given, then, the varieties of capitalism with respect to union density, the 
publication of Lipset and Meltz’s The Paradox of American Unionism provides a 
further and finer-grained perspective on the North American context. Investigating 
union growth and decline within Canada versus the United States, Lipset and 
Meltz seek to develop a new theoretical explanation for the comparatively stronger 
position of Canadian unions during the current historical period. Whereas union 
density in Canada has varied about a mean of roughly 30 percent since the 1960s, 
American union density has experienced a steady decline from a postwar high of 
approximately 35 percent to a current low of 14 percent. 

What explains the decline of American unions during the contemporary 
historical era? Lipset and Meltz revisit some well-established mechanisms, 
including labor law and the activities of national government, but their primary 
focus is on political-cultural factors. Summarizing results from their 1996 survey, 
Lipset and Meltz argue that egalitarian values and support for public provision are 
considerably more established within Canada in comparison to the United States, 
contributing to higher levels of union density. They also present some evidence that 
non-unionized Americans express slightly higher levels of preference for joining a 
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union in comparison to their Canadian counterparts, but it is the broader pattern 
of cross-national differences in levels of public support for government regulation 
and provision that Lipset and Meltz identify as critical to understanding Canadian 
union strength. 

Given the focus of past research on political institutions, partisan governance, 
and macroeconomic factors, a theoretical approach to union density emphasizing 
mass opinion or cultural factors raises intriguing and unresolved issues. One of 
the most pressing questions concerns Lipset and Meltz’s explicit characterization 
of “values as quite static, even largely unchanging over the course of the past 
century. Yet this view rests uneasily with the thrust of contemporary theory and 
research on mass opinion and its linkages to policy outcomes. Here, mass opinion 
is conceptualized as the aggregation of policy attitudes among individuals, and 
thereby subject to a variety of causal influences across countries and over time that 
exert some significant (if, at times, cross-cutting) pressures toward change. Among 
key mechanisms are life-course dynamics, individuals’ degree of participation in 
major institutions, and their proximity to the discursive environment provided 
by social networks and movement organizations. While the net influence of such 
factors on opinion formation may yield a degree of stability in the aggregate, 
these processes are multi-causal and contain an intrinsically random component, 
yielding probabilistic rather than deterministic predictions concerning over-time 
and between-country variation. Over long stretches of historical time, trends in 
the level of such factors, including with respect to population-wide or generation- 
specific responses to policy innovation and political conflict, tend to exert pressures 
toward change in aggregate opinion, and these expectations are broadly consistent 
with findings from past research on opinion trends within specific countries. 

The choice of a specific set of micro-foundations has far-reaching consequences 
for understanding the influence of mass opinion or culture-related factors on 
macro-outcomes such as trade union density and public policy. Indeed, going 
beyond probabilistic tendencies involving path-dependency, a static conception of 
“values“ may ultimately risk restatement of the end of ideology thesis. The scenario 
that higher levels of union density in Canada versus the U.S. are influenced by the 
distribution of policy attitudes within each country is worthy of further study, yet 
skeptical scholars may have less grounds for taking this scenario seriously if it is 
conjoined with a static interpretation of values. 

In raising questions concerning the linkage of mass opinion or political-culture 
factors to union density, Lipset and Meltz’s book offers important contributions 
to the study of unions, but their regional focus also generates some predictable 
limitations. For example, the difference between Canadian union density at 30 
percent and U.S. union density at 14 percent is dwarfed by further comparisons 
with Sweden and Finland at 88 and 79 percent. Underlying this larger pattern of 
cross-national variation are established factors relating to political institutions and 
partisan governance, and notwithstanding their differences, Canada and the United 
States share most of the ideal-typical attributes of liberal democracies, possessing 
neither centralized wage-setting arrangements nor a social democratic party. At a 
time in which empirical theories of democracy are re-emerging within the study 
of comparative politics and public policy, The Paradox of American Unionism may 
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be viewed as offering a challenge to historical-institutionalist and class-centered 
accounts of union density to not reject out of hand causal mechanisms relating 

‘to mass opinion or cultural factors. Evaluation of political-cultural approaches to 
understanding union density may further benefit by a broader focus across a wider 
range of developed democracies. 


Monitoring Sweatshops: Workers, Consumers and the Global Apparel 
Industry. By Jill Esbenshade. Temple University Press, 2004. 288 pp. Cloth, 
$69.50; paper, $22.95. 


ROBERT J.S. ROSS, Clark University 


Imagine a study of a crime wave. 

In a random survey of 67 potential places where people might be victimized, in 
56 percent of the places, people have suffered losses by theft. The annualized total 
thefts for this random sample are close to $610,000. The data for this reviewer's 
thought experiment are taken from Jill Esbenshade’s summary of the Department 
of Labor’s compliance monitoring surveys that measured to the degree to which 
Los Angeles cutting and sewing shops paid the minimum wage or complied with 
overtime requirements of the Fair Labor Standards Act when they were monitored 
by the manufacturers. 

It is not likely that average annual thefts of $16,000 per small establishment 
would be seen as effective law enforcement.' The national average of workers per 
cutting sewing shop is 17^, so the average individual loss per worker is $956 each 
year. If this is extrapolated to the Los Angeles garment workforce, estimated at 
the time of Esbenshade's research at 100,000 workers, the losses in wages are over 
$95 million annually. If citizens with typical access to politics or communications 
media suffered losses of $1000 per year, it would be an outrage, a scandal of major 
proportions. 

In the early 1990s, Department of Labor staff proposed using the “hot cargo” 
provision of the Fair Labor Standards Act (FLSA) to get the brand-name firms 
and retailers to require cutting and sewing contractors to enforce the labor laws 
— minimum wage, maximum hours, overtime, child labor, no home sewing. When 
the new Clinton administration took office Secretary of Labor Reich and the head 
of the department's Wage and Hour Division, Maria Echeveste, grasped onto this 
idea as a means of combating sweatshops even in the budget-cutting atmosphere 
of the day. The hot-cargo provision of the FLSA allowed the secretary to prevent 
interstate shipment of goods produced under conditions that violate the Act. Reich 
and Echeveste began to require of firms caught using contractors who broke the 
law that they agree to inspect their contractors and certify that their contractors 
had obeyed the law. And so was born the late 1995 “Trendsetters list” — the press 
immediately dubbed it the “good guys” list — composed of firms who had signed 
agreements to monitor their contractors for labor law compliance — “compliance 
monitoring.” 
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Reich and Echeveste were undoubtedly sincere in their desire to use scarce 
resources to prevent labor law violations. The Ford Foundation gave the program 
an Innovation award — for it promised to "leverage" scarce government resources 
in a “new” way to accomplish a socially applauded goal. This is in the context of 
a secular decline in enforcement capacity. Under Eisenhower (1957) the ratio of 
Wage and Hour Division investigators was 1:46,000 employees; by 1996 it was 
over 1:150,000.? The Republican budget cutters were forcing the anti-sweatshop 
Department of Labor to invent strategies that privatized law enforcement. 

Compliance monitoring did reduce FLSA violations. In Southern California, in 
1996, “[t]he percentage of garment firms found in compliance with FLSA minimum 
wage requirements increased nearly 20 percentage points (from 39% to 57%). Firms 
found complying with FLSA overtime requirements increased 23 percentage points 
(from 2296 to 4596)" So, contractors violated the minimum wage law 43/100 times, 
and failed to pay overtime 55/100 cases — despite monitoring. 

Jill Esbenshade brought her experience as a union organizer to her scholarship 
on the question of “voluntary” codes of conduct. Her book, a doctoral dissertation 
revised, is a focused and effective examination of the code of conduct/compliance 
monitoring strategy. The work has historical depth and intellectual clarity. 

The monitoring strategy stands in contrast with two different approaches; one, 
domestically, would hire investigators, and boost sanctions to the level of credible 
deterrent; the second aspect of a credible strategy would use the enforcement 
powers implicit in the new international agencies of trade and enforcement (e.g. 
the WTO*'s trade sanctions) to punish labor standards violators. 

Esbenshade has critical insight into the sociological difference between labor 
law enforcement and corporate code compliance monitoring: 

[T]he workers have no influence over the practices of monitoring... 
Enforcement, which relies on the power of local governments, and organizing, 
which depends on the power of workers, is floundering while corporate campaigns 
and monitoring, which rely on publicity and consumer/investor power, are on 
the upsurge. . . . globalization has undermined the ability of local governments to 
enforce their own laws. Workers' power has been . . . decreased in the .. . the power 
of investors and consumers has been greatly increased. (55) 

At the end of her detailed investigation Esbenshade writes: " [P ]rivate monitoring 
has not been successful in ameliorating the sweatshop problem because of flaws 
in the way monitoring is carried out and the conflicts of interest involved, and 
the structure of the industry such as secrecy, mobility, workers' vulnerability and 
pricing." (198). 


Notes 
1. These data are extrapolated from Esbenshade's data. 


2. Robert J.S. Ross. 2004. Slaves to Fashion: Poverty and Abuse in the New Sweatshops. University 
of Michigan Press. 


3. Ibid. 
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Erratum to: 

Reynolds, Jeremy. *You Can't Always Get the Hours You Want: Mismatches 
between Actual and Preferred Work Hours in the U.S.” Social Forces 81(4):1171- 
99. 


In Table 4 of Jeremy Reynolds’s article, “You Can’t Always Get the Hours You 
Want: Mismatches between Actual and Preferred Work Hours in the U.S.,” the 
rows of coefficients for professional occupation and white collar occupation were 
placed in incorrect columns. Also, the first coefficient in the row for professional 
occupation should have been reported as 2.02, not .02. The correct version of 
the table is reprinted here. 


Erratum to: 

Fisher, Dana R., and William R. Freudenburg. 2004. “Post Industrialization and 
Environmental Quality: An Empirical Analysis of the Environmental State.” Social 
Forces. Volume 83(1):157-88. 


In the published version of this paper, Table 3 incorrectly reported the first 
set of regression coefficients for the Ecological Footprint variable. Although 
the significance level was correctly reported, the error might have conveyed to 
some readers the impression that there was a potentially important degree of 
fluctuation in coefficients between the full model and the subsequent models 
— an indicator of potential problems of multicollinearity — when the actual 
standardized and unstandardized coefficients were -.165 and -.304, respectively, 
making them almost identical to the coefficients in the second model. When the 
full set of coefficients is considered, it can be seen that, even though the table 
reports a total of 54 coefficients, there are no cases where the coefficients change 
signs, and there is only one case where the standardized coefficient changes by 
as much as 0.1 — the change from +.656 to +.547 for Motor Vehicle Travel, per 


. capita, between the third and fourth models. 


O The University of North Carolina Press Social Forces, June 2005, 83(4):1793-1796 
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TABLE 4: Multinomial Logistic Regression of Mismatches on Personal and Job 


Characteristics 


i U UU Eee eee — — E Ee m e ee E e — — — e E Oo 


Model 1 Model 2 Model 3 


Model 4 


Model 5 


Wants Wants Wants Wants Wants Wants Wants Wants Wants Wants 
Fewer More Fewer More Fewer More Fewer 
Hours Hours Hours Hours Hours Hours Hours Hours Hours Hours 


Worker characteristics 


Women 
Single woman 97- 107 
Dual-earner (no children) 2625 245 
Dual-earner with children 1.05 24t 
Single mother 108 1.65 
Men 
Breadwinner (no children) 7.84* 6.04 
Dual-earner (no children) 1649 197 
Breadwinner with children 69 2.65 
Dual-earner with children Ja 129 
Single father 21232 159 
College degree Tyee Qe 
Age ISS RS 
Age? 1.00* 1.00 
Black Ot TAI 
Work values 
Importance of work TO IOT 
Job security 952 1:00 
High income 1-118 105 
Opportunity for advancement 70 148 
Interesting job 1357 S9 
Control over hours 124 1.01 
Job characteristics 
Job rewards 
Job security 1.01 78 
High income 76+ | .64* 
Opportunity for advancement 94 1.66** 
Interesting job 85 .66* 
Control over hours 1.41 1.20 
Part-time 57 2.54T 
Managerial occupation 1.09 32t 
Professional occupation 2.021 S 
White collar occupation 79 .43* 
Union member 92 85 
Organization size 1.02 83+ 
Weekly hours 110725075 1.01 .99 T0280. — 6r 
Pseudo R? 04 3 29 
—2 Log-likelihood 211.17 671.97 618.59 


1.08 1.40 
3.80*  .50 
1.57 25 
1:36 . 1.99 
8.25* 4.40 
69) 92515 
92. 472] 
84 195 
A21. 2402 
1897 -20° 
119% 96 
1.00* 1.00 
86 141 
JA* .1.10 
104 117 
1-17 . 1.89* 
DI A 
1.34 .90 
121 140 
.88 78 
SA. SO 
99 16r 
79 22 
Ia 137 
43 333%* 
.62 .64 
99 7209 
Dr x62 
94 95 
1.00 .80* 
1.01 98 
37 
577.47 
357 


More Fewer More 


1179 1556 
416** 1156 
1.70 31 
149 243 
8.76* 4.98 
.0 2.04 
89 4.18T 
85> US 
137 2:15 
1791 20 
1.19] 98 
1.00* 1.00 
81” 128 
TI KOE 
1025 WIG 
110 1.4T 
62¢ 1.40 
1.38 89 
121 10 
84 .70T 
91 : 1241 
.76 UO 
1.537 1.40 
.44  329* 
57 .45 
92. 209 
S2] 389 
93 91 
m LS 
1.01 .99 
.35 
585.74 
357 


N 2 ee AUI NR E 357 


Note: The numbers in the table are exponentiated coefficients. 
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